Hadoop Recipes
521 views | +0 today
Follow
Hadoop Recipes
Quick Recipes related to Hadoop
Curated by Ashish Paliwal
Your new post is loading...
Your new post is loading...
Scooped by Ashish Paliwal
Scoop.it!

Crunching Data with Apache Crunch – Part 2

Crunching Data with Apache Crunch – Part 2 | Hadoop Recipes | Scoop.it
In Part 1, we saw the word count example. Lets built more on top of it. A very common use case of Word Count example would be to find, Top 100 words. Using MapReduce, you would use Secondary Sort a...
more...
No comment yet.
Scooped by Ashish Paliwal
Scoop.it!

MapReduce Patterns, Algorithms, and Use Cases

MapReduce Patterns, Algorithms, and Use Cases | Hadoop Recipes | Scoop.it
In this article I digested a number of MapReduce patterns and algorithms to give a systematic view of the different techniques that can be found on the web or scientific articles. Several practical...
more...
No comment yet.
Scooped by Ashish Paliwal
Scoop.it!

Hadoop Recipe – Using Custom Java Counters | Thread.currentThread().join()

Starting the Hadoop Recipe series, in which I shall pick up a topic and provide sample code around it.
more...
No comment yet.
Scooped by Ashish Paliwal
Scoop.it!

Hadoop Recipe – Implementing Custom Partitioner | Thread.currentThread().join()

Hadoop Recipe – Implementing Custom Partitioner | Thread.currentThread().join() | Hadoop Recipes | Scoop.it
This recipe is about implementing custom Parititoner A Partitioner in MapReduce world partitions the key space.
more...
No comment yet.
Scooped by Ashish Paliwal
Scoop.it!

Crunching Data with Apache Crunch – Part 1

Crunching Data with Apache Crunch – Part 1 | Hadoop Recipes | Scoop.it
Apache Crunch (incubating) is a Java library for writing, testing, and running MapReduce pipelines, based on Google's FlumeJava. Its goal is to make pipelines that are composed of many user-defined...
more...
No comment yet.
Scooped by Ashish Paliwal
Scoop.it!

Hadoop Distributed Cache | Thread.currentThread().join()

Hadoop has a distributed cache mechanism to make available file locally that may be needed by Map/Reduce jobs.
more...
No comment yet.
Scooped by Ashish Paliwal
Scoop.it!

Hadoop Recipe – Implementing Custom Writable | Thread.currentThread().join()

This Recipe is about implementing a custom Writable to be used in MapReduce code.
more...
No comment yet.