Common name: human
Taxonomic name: Homo sapiens, taxonomy ID: 9606
Sequencing/Assembly provider ID: T2T Consortium
Assembly date: 24 Jan 2022
Assembly type: haploid
Assembly level: Complete Genome
Biosample: (n/a)
Assembly accession ID: GCA_009914755.4
Assembly FTP location: GCA/009/914/755/GCA_009914755.4_T2T-CHM13v2.0
Total assembly nucleotides: 3,117,292,070
Assembly contig count: 25
N50 size: 150,617,247

Data file downloads

GCA_009914755.4.fa.gz fasta sequence with NCBI GenBank sequence names
GCA_009914755.4.2bit UCSC 2bit sequence file with NCBI GenBank sequence names
GCA_009914755.4.chromAlias.txt chromAlias file to relate chromosome names
GCA_009914755.4.chrNames.fa.gz fasta sequence with chrN sequence names
GCA_009914755.4.chrNames.2bit UCSC 2bit sequence file with chrN sequence names
GCF_009914755.1_T2T-CHM13v2.0.110.20220412.gtf.gz NCBI RefSeq genes GTF file version 110.20220412
GCA_009914755.4_T2T-CHM13v2.0.augustus.gtf.gz gene GTF file
GCA_009914755.4_T2T-CHM13v2.0.xenoRefGene.gtf.gz gene GTF file
catLiftOffGenesV1.gff3.gz gene GFF3 file
catLiftOffGenesV1.gtf.gz gene GTF file
pre-computed indices for alignment programs: bowtie2, bwa-mem2, hisat2, minimap2
explore the hub directory at: hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/

Copy this entire assembly hub for local use

This download is only for the purpose of using this assembly hub in your institution which may have firewall access restrictions to this data.
To download this assembly data, use this rsync command:

  rsync -a -P \
    rsync://hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/ \
      ./GCA_009914755.4/

which creates the local directory: ./GCA_009914755.4/
or this wget command:

  wget --timestamping -m -nH -x --cut-dirs=6 -e robots=off -np -k \
    --reject "index.html*" -P "GCA_009914755.4" \
       https://hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/

which creates a local directory: ./GCA_009914755.4/

There is an included hub.txt file in that download data directory to use for your local track hub instance.
Using the genome browser menus: My Data -> Track Hubs
select the My Hubs tab to enter a URL to this hub.txt file to attach this assembly hub to a genome browser.

The html/GCA_009914755.4_T2T-CHM13v2.0.description.html page is information for your users to describe this assembly.
This web page with these instructions is an instance of the html/GCA_009914755.4_T2T-CHM13v2.0.description.html file.

blat service

There is blat service available for this genome assembly. When viewing this assembly in the genome browser, access the blat service via the Tools -> Blat blue navigation bar menu item.

For local command line blat service, access the blat service via the gfClient command line operation.
See also: hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64/ to download command line binaries.

To operate this locally, you will need the GCA_009914755.4.2bit file from:

  https://hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/

Which can be obtained with rsync via:

  rsync -a -P 
    rsync://hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/GCA_009914755.4.2bit ./

With that GCA_009914755.4.2bit file in your working directory where you run this command, for example, a DNA query with your DNA sequence in the file: someDna.fa with result in the file: GCA_009914755.4.someDna.psl

gfClient -t=dna -q=dna -genome=GCA_009914755.4 -genomeDataDir=GCA/009/914/755/GCA_009914755.4 
    dynablat-01.soe.ucsc.edu 4040 ./ someDna.fa GCA_009914755.4.someDna.psl

For a protein fasta query with your protein sequence in the file: someProtein.faa with result in the file: GCA_009914755.4.someProtein.psl

gfClient -t=dnax -q=prot  -genome=GCA_009914755.4 -genomeDataDir=GCA/009/914/755/GCA_009914755.4 
    dynablat-01.soe.ucsc.edu 4040 ./ someProtein.faa GCA_009914755.4.someProtein.psl

Search the assembly:

By position or search term: Use the "position or search term" box to find areas of the genome associated with many different attributes, such as a specific chromosomal coordinate range; mRNA, EST, or STS marker names; or keywords from the GenBank description of an mRNA. More information, including sample queries.
By gene name: Type a gene name into the "search term" box, choose your gene from the drop-down list, then press "submit" to go directly to the assembly location associated with that gene. More information. To avoid case sensitivity issues, always use fully lower case gene names.
By track type: Click the "track search" button to find Genome Browser tracks that match specific selection criteria. More information.