Share this genome browser with the link: https://http_host/h/GCF_020216065.1
Common name: high G+C Gram-positive bacteria
Taxonomic name: Nocardia rosealba, taxonomy ID: 2878563
Sequencing/Assembly provider ID: School of life science
Assembly date: 02 Oct 2021
Assembly type: na
Assembly level: Scaffold
Biosample: SAMN21620047
Assembly accession ID: GCF_020216065.1
Assembly FTP location: GCF/020/216/065/GCF_020216065.1_ASM2021606v1
Total assembly nucleotides: 6,405,167
Assembly contig count: 17
N50 size: 704,561
Clawson, H., Lee, B.T., Raney, B.J. et al.
"GenArk: towards a million UCSC genome browsers.
Genome Biol 24, 217 (2023).
https://doi.org/10.1186/s13059-023-03057-x
This download is only for the purpose of using this assembly hub in
your institution which may have firewall access restrictions to this
data.
To download this assembly data, use this rsync command:
rsync -a -P \ rsync://hgdownload.soe.ucsc.edu/hubs/GCF/020/216/065/GCF_020216065.1/ \ ./GCF_020216065.1/which creates the local directory: ./GCF_020216065.1/
wget --timestamping -m -nH -x --cut-dirs=6 -e robots=off -np -k \ --reject "index.html*" -P "GCF_020216065.1" \ https://hgdownload.soe.ucsc.edu/hubs/GCF/020/216/065/GCF_020216065.1/which creates a local directory: ./GCF_020216065.1/
There is an included hub.txt file in that download
data directory to use for your local track hub instance.
Using the genome browser menus: My Data -> Track Hubs
select the My Hubs tab to enter a URL
to this hub.txt file to attach this assembly hub to a genome browser.
The html/GCF_020216065.1_ASM2021606v1.description.html page is information for your users to
describe this assembly.
This web page with these instructions
is an instance of the html/GCF_020216065.1_ASM2021606v1.description.html file.
See also: track hub help documentation.
There is blat service available for this genome assembly. When viewing this assembly in the genome browser, access the blat service via the Tools -> Blat blue navigation bar menu item.
For local command line blat service, access
the blat service via the gfClient command line operation.
See also:
hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64/ to download command line
binaries.
To operate this locally, you will need the GCF_020216065.1.2bit file from:
https://hgdownload.soe.ucsc.edu/hubs/GCF/020/216/065/GCF_020216065.1/Which can be obtained with rsync via:
rsync -a -P rsync://hgdownload.soe.ucsc.edu/hubs/GCF/020/216/065/GCF_020216065.1/GCF_020216065.1.2bit ./With that GCF_020216065.1.2bit file in your working directory where you run this command, for example, a DNA query with your DNA sequence in the file: someDna.fa with result in the file: GCF_020216065.1.someDna.psl
gfClient -t=dna -q=dna -genome=GCF_020216065.1 -genomeDataDir=GCF/020/216/065/GCF_020216065.1 dynablat-01.soe.ucsc.edu 4040 ./ someDna.fa GCF_020216065.1.someDna.pslFor a protein fasta query with your protein sequence in the file: someProtein.faa with result in the file: GCF_020216065.1.someProtein.psl
gfClient -t=dnax -q=prot -genome=GCF_020216065.1 -genomeDataDir=GCF/020/216/065/GCF_020216065.1 dynablat-01.soe.ucsc.edu 4040 ./ someProtein.faa GCF_020216065.1.someProtein.psl
Search the assembly: