Supporting data for "Keemei: cloud-based validation of tabular bioinformatics file formats in Google Sheets".

Dataset type: Software
Data released on June 14, 2016

Rideout JR; Chase JH; Boylen E; Ackermann GL; Gonzalez A; Knight R; Caporaso JG (2016): Supporting data for "Keemei: cloud-based validation of tabular bioinformatics file formats in Google Sheets". GigaScience.


Bioinformatics software often requires human-generated tabular text files as input and have specific requirements for how those data are formatted. Users frequently manage these data in spreadsheet programs, which is convenient for researchers who are compiling the requisite information because the spreadsheet programs can easily be used on different platforms including laptops and tablets, and because they provide a familiar interface. It is increasingly common for many different researchers to be involved in compiling these data, including study coordinators, clinicians, lab technicians, and bioinformaticians. As a result, many research groups are shifting toward using cloud-based spreadsheet programs, such as Google Sheets, which support concurrent editing of a single spreadsheet by different users working on different platforms. Often most of the researchers who are entering data will not be familiar with the formatting requirements of the bioinformatics programs that will be used, so validating and correcting file formats is often a bottleneck prior to beginning bioinformatics analysis.
We present Keemei, a Google Sheets Add-on for validating tabular files used in bioinformatics analyses. Keemei is available free of charge from Google’s Chrome Web Store. Keemei can be installed and run on any web browser supported by Google Sheets. Keemei currently supports validation of two widely used tabular bioinformatics formats, the QIIME sample metadata mapping file format, and the Spatially Referenced Genetic Data (SRGD) format, but is designed to easily support the addition of others.
Included in this GigaDB dataset are the archival code and test data as reviewed at the time of publication, for the most recent version of the application please visit the Keemei website.

Additional details

Read the peer-reviewed publication(s):

Additional information:

File NameSample IDData TypeFile FormatSizeRelease Date 
Tabular dataEXCEL5.39 MB2016-06-14
GitHub archivearchive1.93 MB2016-06-14
ReadmeTEXT0.95 KB2016-06-14
Displaying 1-3 of 3 File(s).
Date Action
June 14, 2016 Dataset publish
August 23, 2016 Manuscript Link added : 10.1186/s13742-016-0133-6