August 25, 2008
MapReduce links
For whatever reason, I seem to be making the peripheral posts about MapReduce tonight before getting to the meat of the issues. So be it. There’s a rich set of links out there about MapReduce, and here are some of the best of them:
- Aster Data introduced MapReduce integrated into its SQL data warehouse DBMS tonight. Aster’s site features an excellent white paper.
- Exactly the same is true of Greenplum.
- Google Labs offers the seminal MapReduce research paper. It also has a broken link to an associated slide presentation, which fortunately is available here.
- One can get a good sense of MapReduce by reading up on the open source implementation Hadoop.
- In particular, this list of Hadoop applications is the longest list of MapReduce applications I know of (ahead even of Google’s long internal list).
- Joel Spolsky explained the core MapReduce concept a couple of years ago.
Some of our recent links about MapReduce
- The integration of MapReduce with SQL data warehousing
- Three major applications of MapReduce
- Another application of MapReduce
- Sound bites about MapReduce
Comments
8 Responses to “MapReduce links”
Leave a Reply
Google’s paper wasn’t seminal. “MapReduce” is 30 years old. Did you read the paper? Nothing ground-breaking.
The only thing Google did was write the code. However, in my opinion that’s high praise–talk is cheap. Google writes code; everybody else daydreams.
Google’s work is seminal in that they woke the world up to the benefits of this old idea.
CAM
There is a coding tutorial available at this link in the middle of the page: http://www.greenplum.com/resources/mapreduce/
Key things to note about Greenplum’s MR implementation:
– It’s very similar in form and expression to Google and Hadoop
– Extensions for Joins and Pipelined task execution
– Native parallel file access
– Parallelism is full and transparent to the programmer
In summary: we have implemented MapReduce within which you can write SQL, Perl, Python and many more languages. It is straightforward use MR programs written for Hadoop or Google and port them to Greenplum.
[…] Data was one of two MPP analytic DBMS vendors to announce MapReduce support last […]
[…] Other links about MapReduce […]
[…] Другие ссылки о MapReduce […]
[…] Другие ссылки о MapReduce […]
[…] pickle — but not one of my own making — I knocked heads, and persuaded both vendors to announce MapReduce at the same time, namely the following Monday. Lots of publicity ensued for both vendors, and everybody was […]