Common name: human
Taxonomic name: Homo sapiens, taxonomy ID: 9606
Sequencing/Assembly provider ID: T2T Consortium
Assembly date: 24 Jan 2022
Assembly type: haploid
Assembly level: Complete Genome
Biosample: (n/a)
Assembly accession ID: GCA_009914755.4
Assembly FTP location: GCA/009/914/755/GCA_009914755.4_T2T-CHM13v2.0
Total assembly nucleotides: 3,117,292,070
Assembly contig count: 25
N50 size: 150,617,247


Data file downloads


Copy this entire assembly hub for local use

This download is only for the purpose of using this assembly hub in your institution which may have firewall access restrictions to this data.
To download this assembly data, use this rsync command:

  rsync -a -P \
    rsync://hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/ \
      ./GCA_009914755.4/
which creates the local directory: ./GCA_009914755.4/
or this wget command:
  wget --timestamping -m -nH -x --cut-dirs=6 -e robots=off -np -k \
    --reject "index.html*" -P "GCA_009914755.4" \
       https://hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/
which creates a local directory: ./GCA_009914755.4/

There is an included hub.txt file in that download data directory to use for your local track hub instance.
Using the genome browser menus: My Data -> Track Hubs
select the My Hubs tab to enter a URL to this hub.txt file to attach this assembly hub to a genome browser.

The html/GCA_009914755.4_T2T-CHM13v2.0.description.html page is information for your users to describe this assembly.
This web page with these instructions is an instance of the html/GCA_009914755.4_T2T-CHM13v2.0.description.html file.

See also: track hub help documentation.


blat service

There is blat service available for this genome assembly. When viewing this assembly in the genome browser, access the blat service via the Tools -> Blat blue navigation bar menu item.

For local command line blat service, access the blat service via the gfClient command line operation.
See also: hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64/ to download command line binaries.

To operate this locally, you will need the GCA_009914755.4.2bit file from:

  https://hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/
Which can be obtained with rsync via:
  rsync -a -P 
    rsync://hgdownload.soe.ucsc.edu/hubs/GCA/009/914/755/GCA_009914755.4/GCA_009914755.4.2bit ./
With that GCA_009914755.4.2bit file in your working directory where you run this command, for example, a DNA query with your DNA sequence in the file: someDna.fa with result in the file: GCA_009914755.4.someDna.psl
gfClient -t=dna -q=dna -genome=GCA_009914755.4 -genomeDataDir=GCA/009/914/755/GCA_009914755.4 
    dynablat-01.soe.ucsc.edu 4040 ./ someDna.fa GCA_009914755.4.someDna.psl
For a protein fasta query with your protein sequence in the file: someProtein.faa with result in the file: GCA_009914755.4.someProtein.psl
gfClient -t=dnax -q=prot  -genome=GCA_009914755.4 -genomeDataDir=GCA/009/914/755/GCA_009914755.4 
    dynablat-01.soe.ucsc.edu 4040 ./ someProtein.faa GCA_009914755.4.someProtein.psl


Search the assembly: