October 25, 2009

Reports of perfectly-balanced hardware configurations are greatly exaggerated

Data warehouse appliance and software appliance vendors like to claim that they’ve worked out just the right hardware configuration(s), and that a single configuration is correct for a fairly broad range of workloads. But there are a lot of reasons to be dubious about that. Specific vendor evidence includes:

What’s more, the claim never made a lot of sense anyway. With the rarest of exceptions, even a single data warehouse’s workload will contain different queries that strain different parts of the system in different ratios. Calculating the “ideal” hardware configuration for that single workload would be forbiddingly difficult. And even if one could calculate it, it almost surely would be different than another user’s “ideal” configuration. How a single hardware configuration can be “ideally balanced” for a broad class of use cases boggles the imagination.

Comments

6 Responses to “Reports of perfectly-balanced hardware configurations are greatly exaggerated”

  1. Kumar on October 25th, 2009 3:11 am

    DW appliance vendors in general do not claim that they have the perfect hardware ‘alignment’ with respect to storage, IO, CPU, and memory for all workloads. Each vendor has a sweet spot with respect to volume of data, distribution, and workload and that their default configuration(s) would work reasonably well for a given workload characteristics. The vendors goal is to beat their competition with price/performance ratios for a given workload (TPC-H). To me there is a difference between applying general algorithms to problems and specifically tunning the algorithm knowing the characteristics of the problem. While the general algorithm may not be the ‘ideal’ solution, there is in general a bigger cost for maintaining the hand tunned one.

  2. Curt Monash on October 25th, 2009 5:54 am

    To a first approximation, you’re right. Some vendors optimize for TPC-H, and advertise TPC-H results. Others optimize for real-life work, and advertise real-life successful customers.

    Be that digression as it may, I stand by my opinion that it’s worthwhile pointing out the difference between a naive interpretation of marketing claims on the one hand, and reality on the other.

  3. Rob Klopp on October 26th, 2009 1:51 pm

    I think that “ideal” only matters if it affects the price/performance of the system. That is, given some workload with an SLA, “ideal” is the cheapest system that satisfies the SLA.

    The advantage of a software-only solution that can “run their software in all sorts of hardware and storage environments” is only meaningful if it allows the hardware and storage environment to be tweaked to some price/performance advantage.

    If a hardware/software appliance is too course-grained… i.e. the CPU or storage appliance increments are big and expensive, then the software-only offerings will have an advantage because they can offer systems in between the appliance increments. The appliance vendors can mitigate this advantage only by taking a margin hit when an in-between solution is required.

    Full Disclosure: I work for Greenplum… but this note presents my personal opinion and does not reflect a company view…

  4. Teradata Transition On Course in Steady Quarter, With Exciting New Offerings Ahead « Market Strategies for IT Suppliers on November 6th, 2009 11:51 pm

    […] predictable enough to warrant such configuration specificity; Curt Monash discusses the issue here.) With its strong base of happy customers, Teradata can back that story with real world examples […]

  5. Confluence: Edmunds Central on November 23rd, 2009 2:37 pm

    Oracle Exadata…

    Technical Documentation Exadata Prepup Presentation (compiled by Suman) Exadata V2 for Data Warehousing.pptx Files provided by Greg Day, Principal DB/Grid Sales Consultant Exadata Technical White Paper exadatatechnicalwhitepaper.pdf…….

  6. Teradata hardware strategy and tactics | DBMS2 -- DataBase Management System Services on February 21st, 2010 6:37 am

    […] believe Teradata will go modular more emphatically than Teradata itself does, because I think doing so will meet users needs more effectively than if Teradata relies strictly on fixed appliance […]

Leave a Reply




Feed: DBMS (database management system), DW (data warehousing), BI (business intelligence), and analytics technology Subscribe to the Monash Research feed via RSS or email:

Login

Search our blogs and white papers

Monash Research blogs

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.