Genomic data of the Puerto Rican Parrot (Amazona vittata) from a locally funded project.

Dataset type: Genomic
Data released on September 11, 2012

Oleksyk TK; Guiblet W; Pombert JF; Valentin R; Martinez-Cruzado JC (2012): Genomic data of the Puerto Rican Parrot (Amazona vittata) from a locally funded project. GigaScience.


These data represent the first assembly of a genome sequence for a critically endangered parrot (Amazona vittata) endemic to the United States, and also the first genome of a species from the diverse and ecologically important genus Amazona native to South America and the Caribbean. One sample has been selected from the non-reproductive female at Rio Abajo Breeding Facility in Puerto Rico (IACUC#201109.1), and sequenced on Illumina HiSeq platform with both fragment and paired-end sequencing approaches, resulting in a total of 42,479,499,706 bases. We predicted a total coverage depth of 26.89X of the parrot’s genome: 17.08X coverage for the short fragment reads, and 9.8X coverage for the mate pairs. The sequencing was initiated with the construction of two genome libraries: a short fragment library (~300 bp inserts) for sequencing the majority of the genome, and a long fragment library (~2.5 Kb inserts) to generate scaffolds to be used to order and assemble contigs derived from the short fragment library. The Illumina paired-end and mate-pairs reads were assembled together with Ray (, with the k-mer defined iteratively. In total, given that the genome size is predicted to be 1.58Gb, with the total scaffold length of 1,184, 594,388 bp, the overall coverage of the genome is around 76%, a value that might be slightly overestimated given that some of the scaffolds may be overlapping but could not be assembled. Filtering followed by assembly resulted in 259,423 contigs (N50=6,983 bp, longest = 75,003 bp), which was further combined into 148,255 scaffolds (N50 = 19,470, longest = 206,462 bp). The database contains all of the contigs, scaffolds, corresponding assembly parameters, and the annotations for the known repeats and coding sequences. The assembled scaffolds allow basic genomic annotation and comparative analyses with other available avian whole-genome sequences.

Additional details

Read the peer-reviewed publication(s):

(PubMed: 23587420)
(PubMed: 23587100)
(PubMed: 23587407)

Additional information:

Accessions (data generated as part of this study):

BioProject: PRJNA171587

Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
A. vittata241585Puerto Rican parrotPuerto Rican parrotAmazona vittata Sex:female [PATO:0000383]
Displaying 1-1 of 1 Sample(s).

File NameSample IDData TypeFile FormatSizeRelease Date 
A. vittataOtherEXCEL54.41 MB2012-09-14
A. vittataOtherTEXT32.23 MB2012-09-14
A. vittataGenome sequenceFASTA568.04 MB2012-09-11
A. vittataSequence assemblyFASTA421.38 MB2012-09-11
A. vittataTabular dataTEXT2.06 MB2012-09-11
A. vittataGenome sequenceFASTA338.52 MB2012-09-11
A. vittataOtherTEXT0.31 KB2012-09-11
A. vittataOtherTEXT78.15 KB2012-09-11
A. vittataOtherTEXT0.68 KB2012-09-11
A. vittataOtherTEXT4.15 KB2012-09-11
Displaying 1-10 of 24 File(s).
Date Action