This directory contains the May 2005 Zv5 assembly of the zebrafish genome 
(UCSC version danRer3) obtained from the Wellcome Trust Sanger Institute 
and produced by a collaboration between the Wellcome Trust Sanger Institute 
in Cambridge, UK, the Max Planck Institute for Developmental Biology in 
Tuebingen, Germany, the Netherlands Institute for Developmental Biology 
(Hubrecht Laboratory), Utrecht, The Netherlands and Yi Zhou and Leonard 
Zon from the Children's Hospital in Boston, Massachusetts.

Files included in this directory:

  - chr*.fa.gz: gzip compressed FASTA sequence of each chromosome.
    Repeats (from RepeatMasker and Tandem Repeat Finder) 
    are in lower case while non-repeating sequence is in upper case.
    RepeatMasker open-3.0 version with RepBase libraries:
    RepBase Update 9.04, RM database version 20040702 with the addition of
    the zebunc.ref (Zebrafish Unclassified) repeats library from RepBase 9.06.
  
  - scaffold*.fa.gz: gzip compressed FASTA sequence of individual scaffolds 
    for chrNA and chrUn. These are repeatmasked as described above.

  - md5sum.txt - Checksum file.

------------------------------------------------------------------
If you plan to download a large file or multiple files from this 
directory, we recommend you use ftp rather than downloading the files 
via our website. To do so, ftp to hgdownload.cse.ucsc.edu, then go to 
the directory goldenPath/danRer3/chromosomes. To download multiple files, 
use the "mget" command:

    mget <filename1> <filename2> ...
    - or -
    mget -a (to download all the files in the directory)

The Zv5 zebrafish sequence data were produced by the Zebrafish Sequencing 
Group at the Sanger Institute and can be obtained directly from 
ftp://ftp.ensembl.org/pub/assembly/zebrafish/Zv5release/. All sequence data 
are made available before scientific publication with the understanding that 
the groups involved in generating the data intend to publish the initial 
large-scale analyses of the dataset. This will include a summary detailing 
the data that have beeen generated and key features of the genome identified 
from genomic assembly and clone mapping/sequencing. Any redistribution of 
the data should carry this notice. 

      Name                    Last modified      Size  Description
Parent Directory - chrNA.fa.gz 2005-08-04 14:57 72M scaffoldNA.fa.gz 2005-08-04 14:59 72M scaffoldUn.fa.gz 2005-08-04 14:59 54M chrUn.fa.gz 2005-08-04 14:57 54M chr5.fa.gz 2005-08-04 14:57 22M chr19.fa.gz 2005-08-04 14:57 22M chr14.fa.gz 2005-08-04 14:56 21M chr7.fa.gz 2005-08-04 14:57 18M chr20.fa.gz 2005-08-04 14:57 17M chr1.fa.gz 2005-08-04 14:56 17M chr23.fa.gz 2005-08-04 14:57 17M chr16.fa.gz 2005-08-04 14:57 16M chr18.fa.gz 2005-08-04 14:57 16M chr17.fa.gz 2005-08-04 14:57 15M chr22.fa.gz 2005-08-04 14:57 15M chr2.fa.gz 2005-08-04 14:57 15M chr13.fa.gz 2005-08-04 14:56 14M chr3.fa.gz 2005-08-04 14:57 14M chr15.fa.gz 2005-08-04 14:56 14M chr8.fa.gz 2005-08-04 14:57 13M chr9.fa.gz 2005-08-04 14:57 13M chr11.fa.gz 2005-08-04 14:56 13M chr21.fa.gz 2005-08-04 14:57 12M chr10.fa.gz 2005-08-04 14:56 12M chr12.fa.gz 2005-08-04 14:56 11M chr24.fa.gz 2005-08-04 14:57 10M chr4.fa.gz 2005-08-04 14:57 10M chr6.fa.gz 2005-08-04 14:57 9.9M chr25.fa.gz 2005-08-04 14:57 8.7M chrM.fa.gz 2005-08-04 14:57 5.3K md5sum.txt 2005-08-04 15:36 1.3K