A Cloud Infrastructure for Optimization of a Massive Parallel Sequencing Workflow

Author: Francesco Abate, Andrea Acquaviva, L. Mossucca, R. Provenzano, Olivier Terzo
Publisher:

ABOUT BOOK

Massive Parallel Sequencing is a term used to describe several revolutionary approaches to DNA sequencing, the so-called Next Generation Sequencing technologies. These technologies generate millions of short sequence fragments in a single run and can be used to measure levels of gene expression and to identify novel splice variants of genes allowing more accurate analysis. The proposed solution provides novelty on two fields, firstly an optimization of the read mapping algorithm has been designed, in order to parallelize processes, secondly an implementation of an architecture that consists of a Grid platform, composed of physical nodes, a Virtual platform, composed of virtual nodes set up on demand, and a scheduler that allows to integrate the two platform

Powered by: