Paco Nathan is a Data Scientist at Concurrent in SF and a committer on the Cascading.org open source project. In this video he will introduce Cascading, then examine the concept of a "workflow" as an abstraction for integrating Hadoop with other systems. We'll show new features including support for SQL-92, PMML, plus an application manager. This presentation was given on February 12th at the Nokia offices in Chicago, IL.
To view the accompanying slides on slideshare: slideshare.net/pacoid/chicago-hadoop-users-group-enterprise-data-workflows