In a previous blog post I offered up two interpretations of the term 'data science'. These amounted to 1) 'the science of data' and 2) 'doing science with data'. If you read the earlier post you'll probably detect my mild irritation with the term when coupled with the second of these interpretations. Perhaps it's the redundancy, or maybe the implication that plain 'science' is somehow devoid of data. It may be both.
However, irritation aside, there is no doubt that the commoditisation of high performance/distributed computing and data analysis tools are creating new avenues for scientific enquiry where data volumes may previously have been a barrier. Couple this with the increasing wealth and volume of open data available to researchers and we have unprecedented opportunities to 'do science with data'.