Sharing data in the cloud

Dataset type: Imaging, Neuroscience
Data released on October 31, 2016

O’Connor D; Clark DJ; Milham MP; Craddock RC (2016): Sharing data in the cloud GigaScience Database. http://dx.doi.org/10.5524/100233

DOI10.5524/100233

Cloud computing resources, such as Amazon Web Services (AWS), provide pay-as-you-go access to high-performance computer resources and dependable data storage solutions for performing large scale analyses of neuroimaging data . These are particularly attractive for researchers at small universities and in developing countries who lack the wherewithal to maintain their own high performance computing systems. The objective of this project is to upload data from the 1000 Functional Connectomes Project (FCP) and International Neuroimaging Datasharing Initiatives (INDI) grass-roots data sharing initiatives into a Public S3 Bucket that has been generously provided by AWS.
The entirety of the CoRR, ABIDE, ACPI, and ADHD-200 data collections and ENKIRS data for 427 individuals were uploaded during the OHBM Hackathon event. The data are available as individual files to make it easily index able by database infrastructures such as COINs LORIS and others. Additionally, this makes it easy for the users to download just the data that they want. The data in the bucket can be browsed and downloaded using a GUI based S3 file transfer software such as Cyberduck, or using the Boto python library.
The data is structured as follows: bucketname/data/Projects/ProjectName/DataType. For example you can access raw data from the ENKI-RS, by specifying the following path in CyberDuck: https://s3.amazon.com/fcp-indi/data/Projects/RocklandSample/RawData.
Uploading data shared through the FCP and INDI initiatives improves its accessibility for cloud-based and local computation. Future efforts for this project will include uploading the remainder of the FCP and INDI data and organizing the data in the new brain imaging data structure (BIDS) format.

Additional details

Read the peer-reviewed publication(s):

Craddock, R. C., Bellec, P., Margules, D. S., Nichols, B. N., Pfannmöller, J. P., Badhwar, A., … Cipollini, B. (2016). 2015 Brainhack Proceedings. GigaScience, 5(S1), 1–26. doi:10.1186/s13742-016-0147-0

Related datasets:

doi:10.5524/100233 IsPartOf doi:10.5524/100215

Additional information:

https://github.com/DaveOC90/INDI-Organization-Scripts





Displaying 1-1 of 1 File(s).
Date Action
October 31, 2016 Dataset publish
November 17, 2016 Manuscript Link added : 10.1186/s13742-016-0147-0