Arvados Open Source Platform for Managing and Processing Genomic Data Now Available on Microsoft Azure

Press Release | CUROVERSE, INC. | February 10, 2016

Curoverse Provides a New Solution for Gaining Insight from Terabytes to Petabytes of Sequencing Data on Azure

Boston, MA (PRWEB) February 10, 2016-Curoverse announced today that the open-source, multi-platform Arvados system is now available on Microsoft Azure. On Azure, Arvados provides capabilities for managing and processing genomic and health data at petabyte scale.

“We saw clear demand for Microsoft Azure from major institutions collecting and analyzing genomic data,” said Adam Berrey chief executive officer at Curoverse. “Microsoft Azure offers powerful capabilities and broad institutional support for research and pharmaceutical organizations, so we are excited to collaborate with Microsoft to deliver Arvados on Azure.”

The Arvados platform is being adopted by major genomic sequencing centers in the US, Europe, and Asia to meet the big data management and processing issues created by next generation sequencing as well as the analysis of related data such as RNA, microbiome, and imaging. Curoverse is working with Microsoft to help customers take advantage of Azure capabilities such as scalable compute, rapid deployment of new clusters, control over data locality, and high-speed connectivity to research institutions. The company is also collaborating with Microsoft to develop new genomic pipeline benchmarking tools that will be available on Azure.

“We are committed to empowering scientists and health professionals to solve the world’s biggest challenges by offering choice and flexibility of technology,” said David Heckerman, Distinguished Scientist at Microsoft Corp. “As a service layer on top of Azure, Arvados accelerates scientific discovery and clinical diagnostics using genomic and health data. The Arvados platform builds on Azure with advanced capabilities for managing large collections of files and reliably running reproducible computational workflows.”

Unlike software-as-a-service solutions for managing and processing genomic data, Arvados can be installed in an organization’s existing Microsoft cloud infrastructure or on-premises computing infrastructure. As a result, organizations using Arvados do not need to transfer their data or code outside of systems they control. In addition to control and security, this gives customers access to the most competitive cloud pricing and complete transparency. Curoverse provides a subscription service for maintaining, supporting, and operating Arvados clusters in the cloud and on-premises.

The source code for Arvados can be downloaded now from the Arvados project ( or on Github (, and users can try the solution using a trial cluster operated by Curoverse on Azure (


Curoverse is an investor-backed software company and the largest contributor to the free and open source Arvados big data platform. Curoverse provides a subscription service to ensure the success of organizations using Arvados in the cloud and on-premises. The company is based in Massachusetts and more information is available at