Description

The NIH Genotype-Tissue Expression (GTEX) project was created to establish a sample and data resource for studies on the relationship between genetic variation and gene expression in multiple human tissues. This track hub shows RNA-seq read coverage signal graphs of GTEx samples in the GTEx midpoint milestone data release (V6, October 2015). A total of 7572 tracks representing samples from 570 adult post-mortem individuals are included.

Display Conventions

The individual sample tracks are organized by gender (Female Donors, Male Donors) and age, in 6 tracks, where samples can be selected from a donor vs. tissue matrix.

An alternative organization is provided by the 'By Tissues' supertrack set, which consists of a track containing all samples for each of the 53 tissues, with a selection matrix of donor gender and age.

Methods

Tissue samples were obtained using the GTEx standard operating procedures for informed consent and tissue collection, in conjunction with the National Cancer Institute Biorepositories and Biospecimen. All tissue specimens were reviewed by pathologists to characterize and verify organ source. Images from stained tissue samples can be viewed via the NCI histopathology viewer. The Qiagen PAXgene non-formalin tissue preservation product was used to stabilize tissue specimens without cross-linking biomolecules.

The Illumina TruSeq protocol was used to create an unstranded polyA+ library sequenced on the Illumina HiSeq 2000 platform to produce 76-bp paired end reads at a depth averaging 50M aligned reads per sample.

Sequence reads for this track were aligned to the hg38/GRCh38 human genome using STAR2 assisted by the GENCODE v24 transcriptome definition. The read coverage signal graphs were produced using the --outWig option of STAR2. Read mapping was performed at UCSC by the Computational Genomics lab, using the Toil pipeline. The resulting bedGraph files were converted to bigWig format for display. The bedGraph files were coordinate converted to the hg19/GRCh37 assembly via the UCSC liftOver tool.

Subject and Sample Characteristics

The scientific goal of the GTEx project required that the donors and their biospecimen present with no evidence of disease. The tissue types collected were chosen based on their clinical significance, logistical feasibility and their relevance to the scientific goal of the project and the research community. Postmortem samples were collected from non-diseased donors with ages ranging from 20 to 79. 34.4% of donors were female and 65.6% male.

Additional summary plots of GTEx sample characteristics are available at the GTEx Portal Tissue Summary page.

Credits

Samples were collected by the GTEx Consortium. RNA-seq was performed by the GTEx Laboratory, Data Analysis and Coordinating Center (LDACC) at the Broad Institute. John Vivian, Melissa Cline, and Benedict Paten of the UCSC Computational Genomics lab were responsible for the sequence read mapping and signal file generation used to produce this hub. Kate Rosenbloom and Parisa Nejad of the UCSC Genome Browser group were responsible for data file post-processing and track hub configuration.

Contacts

For questions about the GTEx data, contact the GTEx LDACC at the Broad Institute. For questions about this track hub, contact the UCSC Genome Browser mailing list.

References

GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013 Jun;45(6):580-5. PMID: 23715323; PMC: PMC4010069

Carithers LJ, Ardlie K, Barcus M, Branton PA, Britton A, Buia SA, Compton CC, DeLuca DS, Peter-Demchok J, Gelfand ET et al. A Novel Approach to High-Quality Postmortem Tissue Procurement: The GTEx Project. Biopreserv Biobank. 2015 Oct;13(5):311-9. PMID: 26484571; PMC: PMC4675181

Melé M, Ferreira PG, Reverter F, DeLuca DS, Monlong J, Sammeth M, Young TR, Goldmann JM, Pervouchine DD, Sullivan TJ et al. Human genomics. The human transcriptome across tissues and individuals. Science. 2015 May 8;348(6235):660-5. PMID: 25954002; PMC: PMC4547472

DeLuca DS, Levin JZ, Sivachenko A, Fennell T, Nazaire MD, Williams C, Reich M, Winckler W, Getz G. RNA-SeQC: RNA-seq metrics for quality control and process optimization. Bioinformatics. 2012 Jun 1;28(11):1530-2. PMID: 22539670; PMC: PMC3356847