2012 IEEE 8th International Conference on E-Science (2009)
Oxford, United Kingdom
Dec. 9, 2009 to Dec. 11, 2009
Applying high level parallel runtimes to data/compute intensive applications is becoming increasingly common. The simplicity of the MapReduce programming model and the availability of open source MapReduce runtimes such as Hadoop, are attracting more users to the MapReduce programming model. Recently, Microsoft has released DryadLINQ for academic use, allowing users to experience a new programming model and a runtime that is capable of performing large scale data/compute intensive analyses. In this paper, we present our experience in applying DryadLINQ for a series of scientific data analysis applications, identify their mapping to the DryadLINQ programming model, and compare their performances with Hadoop implementations of the same applications.
Cloud Computing, MapReduce, DryadLINQ, Hadoop
Thilina Gunarathne, Jaliya Ekanayake, Nelson Araujo, Christophe Poulain, Roger Barga, Atilla Soner Balkir, Geoffrey Fox, "DryadLINQ for Scientific Analyses", 2012 IEEE 8th International Conference on E-Science, vol. 00, no. , pp. 329-336, 2009, doi:10.1109/e-Science.2009.53