MapReduce for the Cell B.E. Architecture
| Sorted by Date | Classified by Publication Type | Classified by Project |
Marc de Kruijf and Karthikeyan Sankaralingam. MapReduce for the Cell B.E. Architecture. IBM Journal of Research and Development, 53(5), 2009.
Download
(unavailable)
Abstract
MapReduce is a simple and flexible parallel programming model proposedby Google for large-scale distributed data processing. In this paper,we present a design and prototype implementation of MapReduce for theCell Broadband Engine Architecture (CBEA). The MapReduce modelprovides a simple machine abstraction that shi elds users fromparallelization and other distributed programming complications. Thegoal of this paper is t o describe the tradeoffs in the design of theruntime and demonstrate the potential for high performance. We studythe basic characteristics of the MapReduce model and identify threetypes of MapReduce applications: map dominated, partition dominated,and sort dominated. We evaluate our runtime performance, scalability,an d efficiency for microbenchmarks representing each of theseapplication types as well as for complete applic ations. We find thatmap-dominated applications map well to the CBEA and that our prototypesustains high pe rformance on these applications. Forpartition-dominated and sort-dominated applications, we analyzeruntime performance, identify sources of inefficiency, and proposeseveral future enhancements to significantly imp roveperformance. Overall, we find that the simplicity and efficiency ofthe model make it an attractive too l for programming Cell BroadbandEngine processor-based platforms.
BibTeX
@article{ibmjr09:MapreduceCell2007, AUTHOR = {Marc de Kruijf and Karthikeyan Sankaralingam}, TITLE = "{MapReduce for the Cell B.E. Architecture}", abstract = { MapReduce is a simple and flexible parallel programming model proposed by Google for large-scale distributed data processing. In this paper, we present a design and prototype implementation of MapReduce for the Cell Broadband Engine<AE> Architecture (CBEA). The MapReduce model provides a simple machine abstraction that shi elds users from parallelization and other distributed programming complications. The goal of this paper is t o describe the tradeoffs in the design of the runtime and demonstrate the potential for high performance. We study the basic characteristics of the MapReduce model and identify three types of MapReduce applications: map dominated, partition dominated, and sort dominated. We evaluate our runtime performance, scalability, an d efficiency for microbenchmarks representing each of these application types as well as for complete applic ations. We find that map-dominated applications map well to the CBEA and that our prototype sustains high pe rformance on these applications. For partition-dominated and sort-dominated applications, we analyze runtime performance, identify sources of inefficiency, and propose several future enhancements to significantly imp rove performance. Overall, we find that the simplicity and efficiency of the model make it an attractive too l for programming Cell Broadband Engine processor-based platforms. }, journal = {IBM Journal of Research and Development}, volume = {53}, number = {5}, year = {2009}, bib_pubtype = {Journal}, bib_rescat = {proj-opinion} }
Generated by bib.pl (written by Patrick Riley ) on Sun Sep 26, 2021 16:14:28 time=1207019082