Share this genome browser with the link: https://http_host/h/GCF_900101475.1
Common name: a-proteobacteria
Taxonomic name: Ruegeria marina, taxonomy ID: 639004
Sequencing/Assembly provider ID: DOE - JOINT GENOME INSTITUTE
Assembly date: 21 Oct 2016
Assembly type: na
Assembly level: Scaffold
Biosample: SAMN04488239
Assembly accession ID: GCF_900101475.1
Assembly FTP location: GCF/900/101/475/GCF_900101475.1_IMG-taxon_2617270884_annotated_assembly
Total assembly nucleotides: 4,995,422
Assembly contig count: 51
N50 size: 200,448
Clawson, H., Lee, B.T., Raney, B.J. et al.
"GenArk: towards a million UCSC genome browsers.
Genome Biol 24, 217 (2023).
https://doi.org/10.1186/s13059-023-03057-x
This download is only for the purpose of using this assembly hub in
your institution which may have firewall access restrictions to this
data.
To download this assembly data, use this rsync command:
rsync -a -P \ rsync://hgdownload.soe.ucsc.edu/hubs/GCF/900/101/475/GCF_900101475.1/ \ ./GCF_900101475.1/which creates the local directory: ./GCF_900101475.1/
wget --timestamping -m -nH -x --cut-dirs=6 -e robots=off -np -k \ --reject "index.html*" -P "GCF_900101475.1" \ https://hgdownload.soe.ucsc.edu/hubs/GCF/900/101/475/GCF_900101475.1/which creates a local directory: ./GCF_900101475.1/
There is an included hub.txt file in that download
data directory to use for your local track hub instance.
Using the genome browser menus: My Data -> Track Hubs
select the My Hubs tab to enter a URL
to this hub.txt file to attach this assembly hub to a genome browser.
The html/GCF_900101475.1_IMG-taxon_2617270884_annotated_assembly.description.html page is information for your users to
describe this assembly.
This web page with these instructions
is an instance of the html/GCF_900101475.1_IMG-taxon_2617270884_annotated_assembly.description.html file.
See also: track hub help documentation.
There is blat service available for this genome assembly. When viewing this assembly in the genome browser, access the blat service via the Tools -> Blat blue navigation bar menu item.
For local command line blat service, access
the blat service via the gfClient command line operation.
See also:
hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64/ to download command line
binaries.
To operate this locally, you will need the GCF_900101475.1.2bit file from:
https://hgdownload.soe.ucsc.edu/hubs/GCF/900/101/475/GCF_900101475.1/Which can be obtained with rsync via:
rsync -a -P rsync://hgdownload.soe.ucsc.edu/hubs/GCF/900/101/475/GCF_900101475.1/GCF_900101475.1.2bit ./With that GCF_900101475.1.2bit file in your working directory where you run this command, for example, a DNA query with your DNA sequence in the file: someDna.fa with result in the file: GCF_900101475.1.someDna.psl
gfClient -t=dna -q=dna -genome=GCF_900101475.1 -genomeDataDir=GCF/900/101/475/GCF_900101475.1 dynablat-01.soe.ucsc.edu 4040 ./ someDna.fa GCF_900101475.1.someDna.pslFor a protein fasta query with your protein sequence in the file: someProtein.faa with result in the file: GCF_900101475.1.someProtein.psl
gfClient -t=dnax -q=prot -genome=GCF_900101475.1 -genomeDataDir=GCF/900/101/475/GCF_900101475.1 dynablat-01.soe.ucsc.edu 4040 ./ someProtein.faa GCF_900101475.1.someProtein.psl
Search the assembly: