May 17, 2012

Thoughts on “data science”

Teradata is paying me to join a panel on “data science” in downtown Boston, Tuesday May 22, at 3:00 pm. A planning phone call led me to jot down a few notes on the subject, which I’m herewith adapting into a blog post.

For starters, I have some concerns about the concepts of data science and data scientist. Too often, the term “data scientist” is used to suggest that one person needs to have strong skills both in analytics and in data management. But in reality, splitting those roles makes perfect sense. Further:

The leader in raising these issues is probably Neil Raden.

But there’s one respect in which I think the term “data science” is highly appropriate. In conventional science, gathering data is just as much of an accomplishment as analyzing it. Indeed, most Nobel Prizes are given for experimental results. Similarly, if you’re doing data science, you should be thinking hard about how to corral ever more useful data. Techniques include but are not limited to:

Comments

4 Responses to “Thoughts on “data science””

  1. Mark Stacey on May 17th, 2012 7:01 am

    Great feedback – I do think one place “data scientist” is appropriate is the scientist who is now using tech to collect data and do analysis.

    Not different from previously, except that with the ubiquity of sensors, gathering data about the physical world is easier.

    In industrial processes, running your car, even a modern exercise monitor like the highend Polar, Garmin and Suuto : my Polar RS800CX has more instrumentation than my first car! (By count ~ 3 times as many)

    Pulling in data from these different types of sensors, and then applying statistical analysis methods – that’s data *science*

  2. R. Scott on May 17th, 2012 12:50 pm

    I think the essance of data science is; the techniques and activities necessary to arrive at actionable insight.

  3. Alex on May 17th, 2012 2:47 pm

    Nice summary .I also think that in real life in many cases the collection/load/transformation and etc is actually done by “data engineers” ( that seems to be the term in fashion I guess ) and the analysis after that by “data scientist” but I could be wrong.

  4. Thomas W Dinsmore on May 18th, 2012 9:38 am

    “Data scientist” is how we refer to analysts who do not depend on user-friendly tools and vendor-defined OOTB “solutions”.

    For the record, none of the generic techniques cited — from retaining data previously discarded, to leveraging experimental design, to leveraging third party data — are new. Technology, however, has advanced the frontier of what is commercially viable.

Leave a Reply




Feed: DBMS (database management system), DW (data warehousing), BI (business intelligence), and analytics technology Subscribe to the Monash Research feed via RSS or email:

Login

Search our blogs and white papers

Monash Research blogs

User consulting

Building a short list? Refining your strategic plan? We can help.

Vendor advisory

We tell vendors what's happening -- and, more important, what they should do about it.

Monash Research highlights

Learn about white papers, webcasts, and blog highlights, by RSS or email.