Using Phenotype Ontologies in Human GVF

From SO Wiki
Jump to: navigation, search

This page describes a set of best practices for using phenotype ontologies in Human GVF. The goal is to support linkage of genomic features to computationally operational phenotype annotations.


Contents

Using IDs vs Term Names vs Comments

GVF allows you to use IDs or label strings to indicate the phenotype. The ID should be used at all times. Formatting for each ontology source ID is indicated below. If a specific enough ontology term cannot be found, use the comment field to enter a free text description and choose a higher level ontology term ID for the ID field. Then go to the respective tracker and request that the term be added (see below for specifics).


Which Phenotype Ontology to use?

If we take "phenotype" to be inclusive of traits, diseases, pathological features, etc then there are a number of choices. The main ontologies are listed below, together with examples.



Human Phenotype Ontology (HP)

In OBO Library? YES

Scope: Any human phenotype. Currently focuses on morphological abnormalities, but is being extended to cover other domains and is under active development.

Tracker: https://sourceforge.net/p/obo/human-phenotype-requests/

Example:

An individual with "Cafe-au-lait" spots would use this ontology class (http://purl.obolibrary.org/obo/HP_0000957): and the term in the GVF file would be indicated as HP:0000957

##phenotype-description Term=HP:0000957;Ontology=http://purl.obolibrary.org/obo/hp.obo


Disease Ontology (DO)

In OBO Library? YES

Scope: All human diseases.

Example:

An individual with Parkinson’s disease would be recorded in the DO as http://purl.obolibrary.org/obo/DOID_14330

The ID should be recorded in the GVF as DOID:14330

##phenotype-description Term=DO:14330;Ontology=http://purl.obolibrary.org/obo/doid.obo


OMIM

OMIM may be more suited to recording if the individual has a particular genetic disorder. A description of OMIM ID schema can be found here: http://omim.org/help/faq

An individual with GYRATE ATROPHY OF CHOROID AND RETINA (http://omim.org/entry/258870) would be recorded as: OMIM:258870




SNOMED-CT

In OBO Library? NO

SNOMED-CT has "findings" and "disorder" sub-hierarchies, as well as disease, which can be used to indicate the phenotype. Many electronic health care systems use SNOMED-CT, so there may be phenotype records available for individuals already using this vocabulary. Note that SNOMED-CT is not open - there are ongoing discussions about SNOMED transitioning to an open system. For now, bear in mind that if you use SNOMED it may restrict the ability of some people to do analyses that make use of the ontology structure (though mappings to HPO are available from the HPO team) If you use SNOMED, use the SCTID ID space.

Example: An individual with male hypogonadism would be recorded as:

SCTID:48723006

Information about SNOMED-CT is available here: http://www.ihtsdo.org/



ICD-9

In OBO Library? NO

Like SNOMED, this is frequently used in EHRs to record billing codes and diagnoses.

For describing human phenotypes, HPO may be more suitable, but if you have access only to ICD-9 encoded phenotype data, please use the following ID format.

Example:

For a patient with ‘cleft lip, record the ID as follows:

ICD9CM:749.1

Note that ICD9 is being replaced by newer versions ICD10 and ICD11 as described here: http://www.who.int/classifications/icd/revision/en/



Mammalian Pathology Ontology (MPATH)

In OBO Library? YES

Scope:

Pathological physical entities and processes. MPATH is focused on actual pathological entities, and may be most suitable for genotyping of pathological tissue samples.

Example:

An individual with truncoconal septal defect http://purl.obolibrary.org/obo/MPATH_619 would be recorded as MPATH:619 in the GVF file.

##phenotype-description Term=MPATH:619;Ontology=http://purl.obolibrary.org/obo/mpath.obo

Mapping between phenotype terminology terms

Medgen http://www.ncbi.nlm.nih.gov/medgen provides a slice of the UMLS for the purposes of annotating data in the context of ClinVar. Medgen therefore contains mappings to other resources, such as MeSH and HPO. Since MedGen is already a mapping, we recommend using one of the source ontologies for the annotations in the GVF.

Medical Subject Headings

MeSH (Medical Subject Headings) is the NLM controlled vocabulary thesaurus used for indexing articles for PubMed. Homepage for MeSH: http://www.nlm.nih.gov/mesh/meshhome.html and a browser here: http://www.nlm.nih.gov/cgi/mesh/2013/MB_cgi In particular, consider annotation using the disease ( C ) tree. An example of usage would be for Aphakia, which would be recorded as MESH:C11.510.103

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox