Where Biology Meets Big Data
Computational Challenges In High-Throughput DNA Sequencing
High-throughput DNA sequencing has created a revolution in biology and has caused an exponential growth in the amount of biological data currently being generated. This presents challenges in many fields ranging from biochemistry to computer science to mathematics and statistics. In a comprehensive presentation, Michael Weinstein explains how sequence data is generated, some of the standard methods for storing DNA sequence data, one of the earliest challenges of how to align random short segments of DNA to a template genome, and where this technology is likely going in the future.