NEWS from INTEGRATION - using&developing

Use of the Evaluation System including a new search tool "SOLR_SEARCH" and an area for YOUR specific model data
Added by Christopher Kadow almost 9 years ago

NEWS from 19.07.2013

Many projects inside of MiKlip joined the Evaluation System and the MiKlip Server by using it, brought in analyses (MesoTel, EnsDiVal) and data (VADY, MesoTel).

The Basic User Guide (BUG) by INTEGRATION with an overview of what has been developed or plugged in and how2use and load the evaluation system you can find here:

Always in mind, working on two aspects: using&developing , today we want to pick up two areas of your interest.



The MiKlip Server and the mounted DKRZ ESGF node combine a huge database with direct (!No need to download all data from the ESGF or from slow band archives!) access to over 50 model systems and 2 million files. That is why standardization is so important, just to find everything and stay comparable with the international community. INTEGRATION combined (linked) the different areas to one


and developed the search tool "find_files", both already presented at the MiKlip Status-Seminar 2013. But any normal search is too slow, to run through that amount of data.

Therefore INTEGRATION developed the new search system "solr_search" for the ongoing evaluation system and scientists working at the MiKlip server. It has the same syntax as "find_files".

"solr_search" is a very fast searching solution with a user friendly 'tab' usability.

Looking for the BASELINE1 runs?

[b324031@miklip04 ~]$ module load evaluation_system

[b324031@miklip04 ~]$ solr_search ->PRESS TABULATOR
cmor_table=      ensemble=        institute=       project=         time_frequency=  
data_type=       experiment=      model=           realm=           variable=

[b324031@miklip04 ~]$ solr_search data_type= ->PRESS TABULATOR
baseline0     baseline1     cmip5         observations  reanalysis

[b324031@miklip04 ~]$ solr_search data_type=baseline1 ->PRESS TABULATOR
cmor_table=      experiment=      realm=           variable=        
ensemble=        model=           time_frequency=

[b324031@miklip04 ~]$ solr_search data_type=baseline1 model=mpi ->PRESS TABULATOR
mpi-esm-lr  mpi-esm-mr  

[b324031@miklip04 ~]$ solr_search data_type=baseline1 model=mpi-esm-lr ->PRESS TABULATOR
cmor_table=      ensemble=        realm=           time_frequency=  variable=      

[b324031@miklip04 ~]$ solr_search data_type=baseline1 model=mpi-esm-lr variable= ->PRESS TABULATOR
Display all 100 possibilities? (y or n)
areacella      clivi          dissoc         fddtdisi       hfls           intpn2         no3
areacello      clt            dms            ffire          hfsithermds    intpp          npp
baresoilfrac   clw            dpco2          fgco2          hfss           lai            o2
basin          clwvi          dpo2           fgdms          hfx            landcoverfrac  o2min
bfe            co2mass        epc100         fgo2           hfxba          masso          omldamax
bsi            co3            epcalc100      fluc           hfxdiff        mfo            omlmax
burntarea      co3satcalc     epfe100        frc            hfy            mlotst         orog
c3pftfrac      cropfrac       epsi100        frfe           hfyba          mlotstsq       pasturefrac
c4pftfrac      csoil          evspsbl        frn            hfydiff        mrro           pbfe
calc           cveg           fco2nat        fsfe           hur            mrros          pbo...
cct            dcalc          fddtalk        fveglitter     hus            mrso           
chl            deptho         fddtdic        gpp            intdic         mrsofc         
cl             detoc          fddtdife       grassfrac      intpbfe        msftbarot      
cli            dfe            fddtdin        graz           intpbsi        msftmyz        
clitter        dissic         fddtdip        hfds           intpcalcite    nbp

etc etc etc

Looking for models to compare to your results in temperature?

solr_search variable=ta model= ->PRESS TABULATOR
access1-0        cesm1-bgc        ec-earth         giss-e2-h-cc     ipsl-cm5a-lr     mpi-esm-p
access1-3        cesm1-cam5       fio-esm          giss-e2-r        ipsl-cm5a-mr     mri-agcm3-2h
bcc-csm1-1       cesm1-fastchem   geos-5           giss-e2-r-cc     ipsl-cm5b-lr     mri-cgcm3
bnu-esm          cesm1-waccm      gfdl-cm3         hadcm3           miroc4h          noresm1-m
canam4           cmcc-cesm        gfdl-esm2g       hadgem2-a        miroc5           noresm1-me
cancm4           cmcc-cm          gfdl-esm2m       hadgem2-ao       miroc-esm        
canesm2          cmcc-cms         gfdl-hiram-c180  hadgem2-cc       miroc-esm-chem   
ccsm4            cnrm-cm5         gfdl-hiram-c360  hadgem2-es       mpi-esm-lr       
cdas             csiro-mk3-6-0    giss-e2-h        inmcm4           mpi-esm-mr       

etc etc etc

Do we have also observations for temperature?

[b324031@miklip04 ~]$ solr_search data_type=observations variable=ta ->PRESS ENTER

SOLR_SEARCH can be used for developing analyses getting direct access to datasets using their meta data and it makes the ensemble handling in CMOR/CMIP5 datasets much easier! It will be also the basement of the hybrid "shell/web/operational" solution of the evaluation system, to assure a fast and effective use of the evaluation system and the MiKlip server. The system is in the beta phase and ready to be filled up with the regional data as well as the next general MiKlip runs.

More and detailed informations:


Our project combined different areas of data in


This is about data not belonging to the general runs of MiKlip. Where could you put them to compare them to general MiKlip data?

We set up a projectdata directory:


If you have your CMORized data e.g. in your scratch, send us an email, then we link it to the system.

What does CMORized structure mean?

Let's have a look at user data from MESOTEL Module C - b324034


This is a link b324034 -> /scratch/b324034/archive/CMIP5/ You see the user decided to keep the standard CMIP5 CMOR settings, to test his data, therefore just use the switch in the MPI-ESM for postproduction.

When we go deeper into the structure we find e.g.


You see we need a specific directory structure to put in. Of course you already had a look into CMOR :) If not, here is a short description what you need in your data structure for linking in and the example you see above.







And now the different stories coming together as one!

If you consider solr_search now and look for your data, just do:

solr_search project=b324034 experiment=decadal2000 variable= -> PRESS TAB
Display all 100 possibilities? (y or n) -> n
solr_search project=b324034 experiment=decadal2000 variable=tas time_frequency=mon -> PRESS ENTER

And that means you can ask solr_search for data to compare to, here baseline1-LR!

solr_search project=b324034 project=baseline1 experiment=dec*2000 variable=tas time_frequency=mon model=mpi-esm-lr

If you want to put in other data, like reanalysis to test something you can of course use this too. E.g.:


For help getting your specific data CMORized, please have a look into:


There you find 1 example for decadal, historical, reanalysis and observation standardization.

This is the idea of bringing data together instead of changing the tools for different datasets. All ongoing projects developing analyses, will use this structure. If we can handle that in MiKlip, we will have a fast, effective, growing, and open evaluation system.

Please contact us, if you have any question or advice.

Christopher Kadow -
Sebastian Illing -