Ticket #1004 (assigned Issue)
No common key between TDS-DRS data and CIM instances
|Reported by:||spascoe||Owned by:||gerry|
|Component:||WP6 - CMIP5 Questionnaire||Version:|
|Keywords:||Cc:||charlotte, bryan, gerry|
Description (last modified by spascoe) (diff)
As discussed on the telco 2012-01-05.
To merge data and metadata and to link to the CIM simulation instances, we need a common key between the TDS-DRS oriented data and the CIM instances. If a common key does not exist we need to find another way of achieving the link.
There are 2 problems to overcome.
1. Experiment mapping
The Decadal experiment name in CIM has been collapsed into a single name whereas DRS is using a set of names of the form decadalXXXX where XXXX is a start year.
For instance the NCAR Gateway currently lists these decadal experiments for CMIP5. Dataset count in parentheses:
decadal1959 (123), decadal1960 (1061), decadal1961 (54) decadal1962 (54), decadal1963 (54), decadal1964 (177), decadal1965 (1104), decadal1966 (54), decadal1967 (54), decadal1968 (39), decadal1969 (177), decadal1970 (1104), decadal1971 (54), decadal1972 (54), decadal1973 (54), decadal1974 (177), decadal1975 (1104), decadal1976 (54), decadal1977 (31), decadal1978 (186), decadal1979 (310), decadal1980 (1204), decadal1981 (195), decadal1982 (163), decadal1983 (195), decadal1984 (286), decadal1985 (1188), decadal1986 (163), decadal1987 (163), decadal1988 (163), decadal1989 (274), decadal1990 (1193), decadal1991 (163), decadal1992 (163), decadal1993 (195), decadal1994 (286), decadal1995 (1191), decadal1996 (195), decadal1997 (163), decadal1998 (198), decadal1999 (287), decadal2000 (1190), decadal2001 (499), decadal2002 (499), decadal2003 (531), decadal2004 (619), decadal2005 (1183), decadal2006 (519), decadal2007 (441), decadal2008 (442), decadal2009 (193), decadal2010 (262)
2. Ensemble RIP to Simulation mapping
DRS has no concept of a simulation. We might assume that we could map simulations to DRS like this:
cim-simulation == (drs-institute, drs-model, drs-experiment)
However, this assumes all ensemble members for this model/experiment are in the same simulation. This is not the case (do we have examples?). Two CIM records may refer to the same institute/model/experiment but with different collections of ensemble rip values. Therefore in general:
cim-simulation == (drs-institute, drs-model, drs-experiment, [drs-ensemble, drs-ensemble, ...])
This cannot be represented as a single key without some syntax for a collection of ensembles. E.g. a wild-card or comma-separated list. Alternatively somewhere there needs to be a 1-to-many mapping between cim-simulation and drs-ensemble.
- Description modified (diff)
- Summary changed from Questionnaire decadal experiment names are not compatible with DRS experiment names to No common key between TDS-DRS data and CIM instances