TY - JOUR
T1 - The Chado Natural Diversity module
T2 - a new generic database schema for large-scale phenotyping and genotyping data
AU - Jung, Sook
AU - Menda, Naama
AU - Redmond, Seth
AU - Buels, Robert M.
AU - Friesen, Maren
AU - Bendana, Yuri
AU - Sanderson, Lacey Anne
AU - Lapp, Hilmar
AU - Lee, Taein
AU - MacCallum, Bob
AU - Bett, Kirstin E.
AU - Cain, Scott
AU - Clements, Dave
AU - Mueller, Lukas A.
AU - Main, Dorrie
PY - 2011/12
Y1 - 2011/12
N2 - Linking phenotypic with genotypic diversity has become a major requirement for basic and applied genome-centric biological research. To meet this need, a comprehensive database backend for efficiently storing, querying and analyzing large experimental data sets is necessary. Chado, a generic, modular, community-based database schema is widely used in the biological community to store information associated with genome sequence data. To meet the need to also accommodate large-scale phenotyping and genotyping projects, a new Chado module called Natural Diversity has been developed. The module strictly adheres to the Chado remit of being generic and ontology driven. The flexibility of the new module is demonstrated in its capacity to store any type of experiment that either uses or generates specimens or stock organisms. Experiments may be grouped or structured hierarchically, whereas any kind of biological entity can be stored as the observed unit, from a specimen to be used in genotyping or phenotyping experiments, to a group of species collected in the field that will undergo further lab analysis. We describe details of the Natural Diversity module, including the design approach, the relational schema and use cases implemented in several databases.
AB - Linking phenotypic with genotypic diversity has become a major requirement for basic and applied genome-centric biological research. To meet this need, a comprehensive database backend for efficiently storing, querying and analyzing large experimental data sets is necessary. Chado, a generic, modular, community-based database schema is widely used in the biological community to store information associated with genome sequence data. To meet the need to also accommodate large-scale phenotyping and genotyping projects, a new Chado module called Natural Diversity has been developed. The module strictly adheres to the Chado remit of being generic and ontology driven. The flexibility of the new module is demonstrated in its capacity to store any type of experiment that either uses or generates specimens or stock organisms. Experiments may be grouped or structured hierarchically, whereas any kind of biological entity can be stored as the observed unit, from a specimen to be used in genotyping or phenotyping experiments, to a group of species collected in the field that will undergo further lab analysis. We describe details of the Natural Diversity module, including the design approach, the relational schema and use cases implemented in several databases.
UR - https://www.scopus.com/pages/publications/84862168904
U2 - 10.1093/database/bar051
DO - 10.1093/database/bar051
M3 - Article
C2 - 22120662
AN - SCOPUS:84862168904
SN - 1758-0463
VL - 2011
JO - Database: the Journal of Biological Databases and Curation
JF - Database: the Journal of Biological Databases and Curation
M1 - bar051
ER -