Help us!

Take some time to transcribe PyCon 2014 talks! Click on the "Share" button below the video and then "Subtitle" to get started.

The Disco MapReduce Framework

Summary

Chris Mueller from Life Technologies introduces us to Disco, a MapReduce framework built in Python and Erlang.

Showing that Hadoop is not alone in the MapReduce world, Chris reviews the basic MapReduce paradigm, dataflow, file and job distribution, and goes on to explain the Disco Distributed Filesystem (DDFS) before going into some use- case scenarios in next generation genomic sequencing.