January 24, 2008

Is MapReduce a good underpinning for next-gen scientific DBMS?

Back in November, Mike Stonebraker suggested that there’s a need for database management advances to serve “big science”. He said:

Obviously, the best solution to these … problems would be to put everything in a next-generation DBMS — one capable of keeping track of data, metadata, and lineage. Supporting the latter would require all operations on the data to be done inside the DBMS with user-defined functions — Postgres-style.

Then he went on to give examples of failings in a prior effort, including that they didn’t support the right computations and data transformations.

Meanwhile, Google has started a program to host terabyte-scale scientific databases for free.

If the issue is that different scientific projects need different kinds of specialized indexing, it sure seems as if MapReduce would be a good way to populate those indexes in the first place. Banging data into indexes is what MapReduce was designed for, and indeed seems to be the core of what MapReduce does in production use for Google today. That said — getting data into indices is the beginning of DBMS design and operation, not the end.

Categories: Data types, MapReduce, Scientific research

Subscribe to our complete feed!

Comments

Leave a Reply

Search our blogs and white papers

Monash Research blogs

DBMS 2 covers database management, analytics, and related technologies.
Text Technologies covers text mining, search, and social software.
Strategic Messaging analyzes marketing and messaging strategy.
The Monash Report examines technology and public policy issues.
Software Memories recounts the history of the software industry.

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.

Links
- Monash Research
- White Papers
Admin
- Log in

Is MapReduce a good underpinning for next-gen scientific DBMS?

Comments

Search our blogs and white papers

Monash Research blogs

User consulting

Vendor advisory

Monash Research highlights

Recent posts

Categories

Date archives

Admin