Help Login Create account

Data released on September 11, 2012

Genomic data of the Puerto Rican Parrot (Amazona vittata) from a locally funded project.

Guiblet, W; Martinez-Cruzado, J, C; Oleksyk, T, K; Pombert, J, F; Valentin, R (2012): Genomic data of the Puerto Rican Parrot (Amazona vittata) from a locally funded project. GigaScience. RIS BibTeX Text

These data represent the first assembly of a genome sequence for a critically endangered parrot (Amazona vittata) endemic to the United States, and also the first genome of a species from the diverse and ecologically important genus Amazona native to South America and the Caribbean. One sample has been selected from the non-reproductive female at Rio Abajo Breeding Facility in Puerto Rico (IACUC#201109.1), and sequenced on Illumina HiSeq platform with both fragment and paired-end sequencing approaches, resulting in a total of 42,479,499,706 bases. We predicted a total coverage depth of 26.89X of the parrot’s genome: 17.08X coverage for the short fragment reads, and 9.8X coverage for the mate pairs. The sequencing was initiated with the construction of two genome libraries: a short fragment library (~300 bp inserts) for sequencing the majority of the genome, and a long fragment library (~2.5 Kb inserts) to generate scaffolds to be used to order and assemble contigs derived from the short fragment library. The Illumina paired-end and mate-pairs reads were assembled together with Ray (, with the k-mer defined iteratively. In total, given that the genome size is predicted to be 1.58Gb, with the total scaffold length of 1,184, 594,388 bp, the overall coverage of the genome is around 76%, a value that might be slightly overestimated given that some of the scaffolds may be overlapping but could not be assembled. Filtering followed by assembly resulted in 259,423 contigs (N50=6,983 bp, longest = 75,003 bp), which was further combined into 148,255 scaffolds (N50 = 19,470, longest = 206,462 bp). The database contains all of the contigs, scaffolds, corresponding assembly parameters, and the annotations for the known repeats and coding sequences. The assembled scaffolds allow basic genomic annotation and comparative analyses with other available avian whole-genome sequences.

Contact Submitter

Read the peer-reviewed publication(s):

Oleksyk, T. K., Pombert, J.-F., Siu, D., Mazo-Vargas, A., Ramos, B., Guiblet, W., … Martinez-Cruzado, J.-C. (2012). A locally funded Puerto Rican parrot (Amazona vittata) genome sequencing project increases avian data and advances young researcher education. GigaScience, 1(1). doi:10.1186/2047-217x-1-14 (PubMed: 23587420)
O’Brien, S. J. (2012). Erratum to: Genome empowerment for the Puerto Rican parrot – Amazona vittata. GigaScience, 1(1). doi:10.1186/2047-217x-1-17 (PubMed: 23587100)
O’Brien, S. J. (2012). Genome empowerment for the Puerto Rican parrot – Amazona vittata. GigaScience, 1(1). doi:10.1186/2047-217x-1-13 (PubMed: 23587407)

Additional information:

Accessions (data included in GigaDB):

BioProject: PRJNA171587


Samples: Table Settings


Common Name
Scienfic Name
Sample Attributes
Taxonomic ID
Genbank Name

Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
A. vittata241585Puerto Rican parrotPuerto Rican parrotAmazona vittata Sex:female
Displaying 1-1 of 1 Sample(s).

Files: (FTP site) Table Settings


File Description
Sample ID
Data Type
File Format
Release Date
Download Link
File Attributes

File NameSample IDData TypeFile FormatSizeRelease Date 
A. vittataOtherEXCEL54.41 MB2012-09-14
A. vittataOtherTEXT32.23 MB2012-09-14
A. vittataGenome sequenceFASTA568.04 MB2012-09-11
A. vittataSequence assemblyFASTA421.38 MB2012-09-11
A. vittataTabular dataTEXT2.06 MB2012-09-11
A. vittataGenome sequenceFASTA338.52 MB2012-09-11
A. vittataOtherTEXT0.31 KB2012-09-11
A. vittataOtherTEXT78.15 KB2012-09-11
A. vittataOtherTEXT0.68 KB2012-09-11
A. vittataOtherTEXT4.15 KB2012-09-11
Displaying 1-10 of 24 File(s).

Other datasets you might like: