bigdata
47 views | +0 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by James Warren
Scoop.it!

The Road to Summingbird: Stream Processing at (Every) Scale // Speaker Deck

more...
No comment yet.
Scooped by James Warren
Scoop.it!

MongoDB is to NoSQL like MySQL to SQL — in the most harmful way

MongoDB is "leading NoSQL database" yet is has so many troubles that it seems to harm the NoSQL movement en large. Just like MySQL did to SQL.
more...
No comment yet.
Scooped by James Warren
Scoop.it!

Lightning Talk on Cascalog

Lightning Talk on Cascalog | bigdata | Scoop.it
Just 19 slides, but Paul Lam manages to provide both a comparison of Cascalog and Hive, plus an overview of the most interesting bits of Cascalog.
Cascalog vs Hive
Cascalog Query Pipe Assembly
Highly...
more...
No comment yet.
Scooped by James Warren
Scoop.it!

What are the best blogs about data?

What are the best blogs about data? | bigdata | Scoop.it
Posted on Quora long ago. See below Peter Skomoroch's list. We will publish our list of several hundred websites ranked by popularity, later this month
Click h…
more...
No comment yet.
Scooped by James Warren
Scoop.it!

The Big Data Scientist's Skillset | SmartData Collective

The Big Data Scientist's Skillset | SmartData Collective | bigdata | Scoop.it
Let's take a look at what a good Big Data Scientist (aka "the sexiest job in the 21st century") has to be capable of doing.
more...
No comment yet.
Scooped by James Warren
Scoop.it!

An introduction to Cloudera Impala - SQL on top of Hadoop

James Kinley (@jrkinley) gives an introduction to Cloudera Impala. Cloudera Impala provides fast, interactive SQL queries directly on your Apache Hadoop data...
more...
No comment yet.
Scooped by James Warren
Scoop.it!

Applying the Big Data Lambda Architecture

Applying the Big Data Lambda Architecture | bigdata | Scoop.it
A look inside a Hadoop-based project that matches connections in social media by leveraging the highly scalable lambda architecture.
more...
No comment yet.
Scooped by James Warren
Scoop.it!

The sweet spot for Cassandra secondary indexing | Richard Low's blog

more...
No comment yet.
Scooped by James Warren
Scoop.it!

A Hadoop Alternative: Building a real-time data pipeline with Storm

A Hadoop Alternative: Building a real-time data pipeline with Storm | bigdata | Scoop.it
With the tremendous growth of the online advertising industry, ad networks have to deal with a humongous amount of data to process. For years, Hadoop has been the de-facto technology used to aggreg...
more...
No comment yet.
Scooped by James Warren
Scoop.it!

HUG Meetup January 2013: Impala - Real-time Queries for Apache Hadoop

Mark Grover, Software Engineer, Cloudera The Cloudera Impala project is for the first time making scalable parallel database technology, which is the underpi...
more...
No comment yet.