The aim of the GENCODE Genes project (Harrow et al., 2006) is to produce a set of highly accurate annotations of evidence-based gene features on the human reference genome. This includes the identification of all protein-coding loci with associated alternative splice variants, non-coding with transcript evidence in the public databases (NCBI/EMBL/DDBJ) and pseudogenes. A high quality set of gene structures is necessary for many research studies such as comparative or evolutionary analyses, or for experimental design and interpretation of the results.
The GENCODE Genes tracks display the high-quality manual annotations merged with evidence-based automated annotations across the entire human genome. The GENCODE gene set presents a full merge between HAVANA manual annotation and Ensembl automatic annotation. Priority is given to the manually curated HAVANA annotation using predicted Ensembl annotations when there are no corresponding manual annotations. With each release, there is an increase in the number of annotations that have undergone manual curation. This annotation was carried out on the GRCh38 (hg38) genome assembly.
For more information on the different gene tracks, see our Genes FAQ.
These are multi-view composite tracks that contain differing data sets (views). Instructions for configuring multi-view tracks are here. Only some subtracks are shown by default. The user can select which subtracks are displayed via the display controls on the track details pages. Further details on display conventions and data interpretation are available in the track descriptions.
GENCODE Genes and its associated tables can be explored interactively using the REST API, the Table Browser or the Data Integrator. The GENCODE data files for hg38 are available in our downloads directory as wgEncodeGencode* files in genePred format. All the tables can also be queried directly from our public MySQL servers, with instructions on this method available on our MySQL help page as well as on our blog.
GENCODE version 46 corresponds to Ensembl 112.
GENCODE version 45 corresponds to Ensembl 111.
GENCODE version 44 corresponds to Ensembl 110.
GENCODE version 43 corresponds to Ensembl 109.
GENCODE version 42 corresponds to Ensembl 108.
GENCODE version 41 corresponds to Ensembl 107.
GENCODE version 40 corresponds to Ensembl 106.
GENCODE version 39 corresponds to Ensembl 105.
GENCODE version 38 corresponds to Ensembl 104.
GENCODE version 37 corresponds to Ensembl 103.
GENCODE version 36 corresponds to Ensembl 102.
GENCODE version 35 corresponds to Ensembl 101.
GENCODE version 34 corresponds to Ensembl 100.
GENCODE version 33 corresponds to Ensembl 99.
GENCODE version 30 corresponds to Ensembl 96.
GENCODE version 29 corresponds to Ensembl 94.
GENCODE version 28 corresponds to Ensembl 92.
GENCODE version 27 corresponds to Ensembl 90.
GENCODE version 26 corresponds to Ensembl 88.
GENCODE version 24 corresponds to Ensembl 84.
GENCODE version 23 corresponds to Ensembl 81. GENCODE version 22 corresponds to Ensembl 79. GENCODE version 20 corresponds to Ensembl 76.See also: The GENCODE Project Release History.