This directory contains the Nov. 2005 freeze of the D. yakuba genome
(droYak2) produced by the Genome Sequencing Center at Washington 
University School of Medicine in St. Louis.

Files included in this directory:

  - chr*.fa.gz: compressed FASTA sequence of each chromosome.
    Repeats (from RepeatMasker and Tandem Repeat Finder) 
    are in lower case while non-repeating sequence is in upper case.
    RepeatMasker Nov. 2005 (open-3-1-2) version with RepBase libraries: 
    RepBase Update 9.11, RM database version 20050112

  - md5sum.txt: checksums of files in this directory.


------------------------------------------------------------------
If you plan to download a large file or multiple files from this 
directory, we recommend that you use ftp rather than downloading the 
files via our website. To do so, ftp to hgdownload.cse.ucsc.edu, then 
go to the directory goldenPath/droYak2/chromosomes. To download multiple 
files, use the "mget" command:

    mget <filename1> <filename2> ...
    - or -
    mget -a (to download all the files in the directory)

The D. yakuba sequence is made freely available before scientific 
publication by The Genome Sequencing Center, WUSTL School of Medicine with 
the following understanding: 

1. The data may be freely downloaded, used in analyses, and repackaged in
   databases. 
2. Users are free to use the data in scientific papers analyzing particular 
   genes and regions if the providers of these data (Genome Sequencing 
   Center, WUSTL School of Medicine) are properly acknowledged. 
3. The Drosophila yakuba analysis group is aiming to publish an initial 
   analysis of the D. yakuba genome sequence in 2005 (submitted in early 
   2005) that will include descriptions of the assembly, genome landscape, 
   comparative analysis and initial gene content. People who would like to 
   coordinate other genome-wide analysis with this work should contact 
   Richard K. Wilson, Genome Sequencing Center Director, Washington 
   University School of Medicine. We welcome a coordinated approach to 
   describing this community resource. 
4. Any redistribution of the data should carry this notice. 
      Name                    Last modified      Size  Description
Parent Directory - chr2L.fa.gz 2005-11-15 16:25 6.8M chr2L_random.fa.gz 2005-11-15 16:25 1.2M chr2R.fa.gz 2005-11-15 16:26 6.4M chr2R_random.fa.gz 2005-11-15 16:26 14K chr2h.fa.gz 2005-11-15 16:26 262K chr2h_random.fa.gz 2005-11-15 16:26 1.1M chr3L.fa.gz 2005-11-15 16:26 7.3M chr3L_random.fa.gz 2005-11-15 16:26 1.3M chr3R.fa.gz 2005-11-15 16:26 8.7M chr3R_random.fa.gz 2005-11-15 16:26 815K chr3h.fa.gz 2005-11-15 16:26 70K chr3h_random.fa.gz 2005-11-15 16:26 273K chr4.fa.gz 2005-11-15 16:25 427K chr4_random.fa.gz 2005-11-15 16:25 9.3K chrM.fa.gz 2005-11-15 16:25 5.0K chrU.fa.gz 2005-11-15 16:25 7.5M chrUh.fa.gz 2005-11-15 16:26 202K chrX.fa.gz 2005-11-15 16:25 6.6M chrX_random.fa.gz 2005-11-15 16:25 515K chrXh.fa.gz 2005-11-15 16:26 53K chrYh.fa.gz 2005-11-15 16:26 32K md5sum.txt 2005-11-15 16:26 1.0K