From March 31 to April 1, the National Human Genome Research Institute (NHGRI) is holding a meeting to explore the use of cloud computing for the storage, management, and analysis of genomic data, including the computing issues and the implementation for biological analyses. I wanted to make this site available for those who will not be in attendance but who want to contribute (post comments here or via Twitter (@garystiehr)).
The agenda contains presentations and discussion surrounding some specific topics:
- an overview of various cloud offerings as well as a survey of some genomics-specific cloud computing pilot projects
- an overview of some of the associated technical questions:
- transmitting large genomic data sets
- computational architecture of cloud environments
- cloud security and relevant NIH data privacy policies
Further, a general discussion session will be held regarding supporting genomic analyses using cloud computing:
- What are the key challenges for genomic analyses within a cloud?
- Under what conditions are cloud environments better than local clusters of computers?
- What types of analyses of large genomic datasets are clouds appropriate for currently and in the near future?
- What would be needed for cloud environments to work better for genomic analyses?
- What features of a cloud environment are appropriate for genome repositories to use?
- What alternatives to cloud environments exist or may be developed, and under what conditions would these be better than clouds?
- Are data standards needed - if so, what areas are they needed?
- What best practices are needed?
Please, join in the discussion on the topics above and help identify if there are principals from within the cloud computing arena that can be used to advance the state of genomics research. Post comments here or via Twitter (@garystiehr).