Log Processing
69 views | +0 today
Follow
Log Processing
Log Processing on BigData
Curated by Junghoon Shin
Your new post is loading...
Your new post is loading...
Rescooped by Junghoon Shin from Hadoop
Scoop.it!

What is Hadoop? | Java,maven,Hadoop,Pig,Hive tutorials with examples

What is Hadoop? | Java,maven,Hadoop,Pig,Hive tutorials with examples | Log Processing | Scoop.it
Hadoop analyze and process large amount of data i.e peta bytes of data in parallel with less time located in distributed environment.

Via Sylvain Kalache
more...
No comment yet.
Rescooped by Junghoon Shin from Bigdata Analytics Platform
Scoop.it!

gRaphics!

gRaphics! | Log Processing | Scoop.it

Luca Fenu provides #rstats code for this informative (and pretty!) visualization of the distribution of data across different axes.


Via Steven L. Johnson, Taehui Hong
more...
No comment yet.
Rescooped by Junghoon Shin from Hadoop
Scoop.it!

Realtime Analytics for Big Data: A Facebook Case Study

Realtime Analytics for Big Data: A Facebook Case Study | Log Processing | Scoop.it
Knowing what your users are doing on your site in real time and matching what they do with more targeted information transforms into better conversion rate and better user satisfaction, which means more money in the end.

Via Sylvain Kalache
more...
No comment yet.
Rescooped by Junghoon Shin from Cloud & Big Data Platform
Scoop.it!

Hadoop Distributed File System-2.0.0-alpha - Federation

HDFS has two main layers:

 

 - Namespace
  + Consists of directories, files and blocks
  + It supports all the namespace related file system operations such as create, delete, modify and list files and directories.

 

 - Block Storage Service has two parts
  + Block Management (which is done in Namenode)
Provides datanode cluster membership by handling registrations, and periodic heart beats.
Processes block reports and maintains location of blocks.
Supports block related operations such as create, delete, modify and get block location.
Manages replica placement and replication of a block for under replicated blocks and deletes blocks that are over replicated.
  + Storage - is provided by datanodes by storing blocks on the local file system and allows read/write access.
The prior HDFS architecture allows only a single namespace for the entire cluster. A single Namenode manages this namespace. HDFS Federation addresses limitation of the prior architecture by adding support multiple Namenodes/namespaces to HDFS file system.


Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by Junghoon Shin from Hadoop
Scoop.it!

HBase Schema Design - Things you need to know

When designing schemas for HBase, be it from scratch or porting an existing application over from a relational database for example, there are a set of architectural constraints that a user should be aware of to avoid common pitfalls.

Via Sylvain Kalache
more...
No comment yet.