Tabula Muris is a compendium of single cell transcriptome data from the model organism Mus musculus, containing nearly 100,000 cells from 20 organs and tissues. The data allow for direct and controlled comparison of gene expression in cell types shared between tissues, such as immune cells from distinct anatomical locations.
This track shows the results from FACS sorted cells sequenced with the SmartSeq2 protocol, as it has much higher transcript coverage. The sequencing data comprises more than 2TB and was summarized into a track at UCSC.
As indicated by the "..." after its name, this is a 'super track', a container for subtracks. There are three different subtracks:
BAM files were provided by the data submitters, one (single end) or two files (paired end) per cell. The BAM alignments were used as submitted. They were merged with "samtools merge" into a single BAM file per cluster. The readgroup (RG) BAM tag indicates the original cell.
From the resulting merged BAM file, coverage was obtained using "wiggletools coverage" a tool written by Daniel Zerbino and the result was converted with the UCSC tool "wigToBigWig".
Also on the merged BAM file, the software IntronProspector was run with default settings. It retains reads with a gap longer than 70 bp and shorter than 500 kbp and merges them into annotated splice junctions.
The merged BAM files, coverage bigWig files and splice junctions in bigBed format can be downloaded from the /gbdb fileserver.
Since the splice junction .bigBed files have their scores capped at 1000, the original IntronProspector .bed files are available in the same track hub directory. You can also find there *.calls.tsv files with more details about each junction, e.g. the number of uniquely mapping reads.
WiggleTools was written by Daniel Zerbino, IntronProspector was written by Mark Diekhans, track hubs were written to a large extent by Brian Raney and colleages at the UCSC Genome Browser. Track creation was done by Max Haeussler and tested by Jairo Navarro.
Zerbino DR, Johnson N, Juettemann T, Wilder SP, Flicek P. WiggleTools: parallel processing of large collections of genome-wide datasets for visualization and statistical analysis. Bioinformatics. 2014 Apr 1;30(7):1008-9. PMID: 24363377; PMC: PMC3967112
Mark Diekhans, IntronProspector GitHub Repository. Github 2018
The Tabula Muris Consortium, Stephen R. Quake, Tony Wyss-Coray, Spyros Darmanis: Single-cell transcriptomic characterization of 20 organs and tissues from individual mice creates a Tabula Muris. bioRxiv preprint March 2018, accepted paper in Nature 2018 (562) p.367-372