Bigdata Analytics Platform
13.4K views | +1 today
Follow
Bigdata Analytics Platform
Hadoop ecosystem and open source framework for bigdata analytics
Curated by Taehui Hong
Your new post is loading...
Your new post is loading...
Rescooped by Taehui Hong from Cloud & Big Data Platform
Scoop.it!

Elements of Scale: Composing and Scaling Data Platforms - ben stopford

Elements of Scale: Composing and Scaling Data Platforms - ben stopford | Bigdata Analytics Platform | Scoop.it
This transcribed talk explores a range of data platforms through a lens of basic hardware and software tradeoffs.

Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by Taehui Hong from Cloud & Bigdata Watching
Scoop.it!

Apache Kafka: Next Generation Distributed Messaging System

Apache Kafka: Next Generation Distributed Messaging System | Bigdata Analytics Platform | Scoop.it
Apache Kafka is a distributed publish-subscribe messaging system. This article covers the architecture model, features and characteristics of Kafka framework and how it compares with traditional messaging systems.

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by Taehui Hong from Scala & Cloud Playing
Scoop.it!

log.io distributed logging dashboard

log.io distributed logging dashboard | Bigdata Analytics Platform | Scoop.it
At Arqspin we distribute our video-processing into many smaller steps that can run across many machines in parallel. This post discusses how we use log.io to visualize logs in real-time across all our machines. Here is an example from our Real-time...

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by Taehui Hong from Cloud & Bigdata Watching
Scoop.it!

Flurry Analytic Backend - Processing Terabytes of Data in Real-time

www.flurry.com November 14, 2013 Anthony Watkins, Senior Director of Developer Relations Processing Terabytes of Data in Real- Time @flurrymobile @antwatkins

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by Taehui Hong from hi bigdata
Scoop.it!

The Big List of D3.js Examples

The Big List of D3.js Examples | Bigdata Analytics Platform | Scoop.it

With nearly 1900 entries with links to WebSites or Projects that use, enhance and abstract D3 you might get an idea of its universality...


Via Jan Hesse, JerryJung
more...
Rescooped by Taehui Hong from hi bigdata
Scoop.it!

Hadoop 2 - Going beyond MapReduce

Talk held at the NoSQL Cologne Usergroup on 04.12.2013 Agenda: - Why Hadoop 2? - HDFS 2 - YARN - YARN Apps - Write your own YARN App - Stinger Initiative & Hive

Via JerryJung
more...
No comment yet.
Rescooped by Taehui Hong from hi bigdata
Scoop.it!

A Peek Inside Cisco's Hadoop Security Machine - Datanami

A Peek Inside Cisco's Hadoop Security Machine - Datanami | Bigdata Analytics Platform | Scoop.it
The Internet is the ultimate invention of man, a creation that will forever change how humans work, live, and play.

Via JerryJung
more...
No comment yet.
Rescooped by Taehui Hong from hi bigdata
Scoop.it!

Hadoop on OpenStack: Elastic Data Processing (EDP) with Savanna 0.3

Hadoop on OpenStack: Elastic Data Processing (EDP) with Savanna 0.3 | Bigdata Analytics Platform | Scoop.it
Now that version 0.2 of Project Savanna is out, it’s time to start looking at what will be coming up in version 0.3. The goal for this next development phase is to provide elastic data processing (...

Via Simon Hunanyan, JerryJung
more...
Simon Hunanyan's curator insight, August 28, 2013 6:36 AM

The next version 0.3 of Savanna will include both analytics as a service and elastic data processing (EDP).

Rescooped by Taehui Hong from Cloud & Big Data Platform
Scoop.it!

Google moves from MySQL to MariaDB

Google moves from MySQL to MariaDB | Bigdata Analytics Platform | Scoop.it
Jack Clark for TheRegister quoting Google senior systems engineer, Jeremy Cole’s talk at VLDB:
“ “Were running primarily on [MySQL] 5.1 which is a little outdated, and so we’re moving to MariaDB 10.0...

Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by Taehui Hong from Hadoop
Scoop.it!

storm at twitter

Talk given at facebook's analytics@webscale conference. Covers storm basics, system overview, architecture at twitter and current use-cases. Featuring Hadoop


Via Sylvain Kalache
more...
Rescooped by Taehui Hong from Cloud & Bigdata Watching
Scoop.it!

Apache Kafka 0.8 basic training - Verisign

Apache Kafka 0.8 basic training (120 slides) covering: 1. Introducing Kafka: history, Kafka at LinkedIn, Kafka adoption in the industry, why Kafka 2. Kafka cor…

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by Taehui Hong from hi bigdata
Scoop.it!

Something about Kafka - Why Kafka is so fast

This slide briefly introduced the reason why kafka is so fast in performance.

Via Wonil Lee Ph.D., JerryJung
more...
No comment yet.
Rescooped by Taehui Hong from hi bigdata
Scoop.it!

DataTorrent - Hadoop's Most Powerful Platform for
Real-Time Stream Analytics

DataTorrent - Hadoop's Most Powerful Platform for <br/>Real-Time Stream Analytics | Bigdata Analytics Platform | Scoop.it
DataTorrent is Hadoop's Most Powerful Platform for Real-Time Stream Analytics

Via Jaeboo Jeong, JerryJung
more...
No comment yet.
Rescooped by Taehui Hong from All about Software Technology
Scoop.it!

18 commands to monitor network bandwidth on Linux server

18 commands to monitor network bandwidth on Linux server | Bigdata Analytics Platform | Scoop.it
Here are some command line tools that can be used to analyse and monitor network bandwidth usage on your Linux server.

Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by Taehui Hong from hi bigdata
Scoop.it!

Phoenix - A High Performance Open Source SQL Layer over HBase

Have a lot of data? Using or considering using Apache HBase (part of the Hadoop family) to store your data? Want to have your cake and eat it too? Phoenix is...

Via JerryJung
more...
No comment yet.
Rescooped by Taehui Hong from hi bigdata
Scoop.it!

Hadoop Tutorial - Geolocation Data

Hadoop Tutorial - Geolocation Data | Bigdata Analytics Platform | Scoop.it
In this Hadoop tutorial video, we show how a trucking company can analyze geolocation data to reduce fuel costs and improve driver safety.

Via Wonil Lee Ph.D., JerryJung
more...
No comment yet.
Rescooped by Taehui Hong from BigData Hadoop Ecosystem
Scoop.it!

Impala and ANSI-92 SQL on Hadoop

Impala and ANSI-92 SQL on Hadoop | Bigdata Analytics Platform | Scoop.it
The origins of Impala can be found in F1 – The Fault-Tolerant Distributed RDBMS Supporting Google’s Ad Business.

Via Charles Gerth
more...
No comment yet.
Rescooped by Taehui Hong from Cloud & Big Data Platform
Scoop.it!

Big data landscape version 2.0

Here's the second version of our big data landscape. Thoughts, questions, comments? We'd love to hear your feedback in the comments section here: http://wp.me/

Via Steve Hyounggi Min
more...
No comment yet.
Scooped by Taehui Hong
Scoop.it!

Netflix open sources its data traffic cop, Suro

Netflix open sources its data traffic cop, Suro | Bigdata Analytics Platform | Scoop.it
Netflix has open sourced a tool called Suro that collects event data from disparate application servers before sending them to other data platforms such as Hadoop and Elasticsearch.
more...
Rescooped by Taehui Hong from BigData Hadoop Ecosystem
Scoop.it!

Linux Today - Hadoop 2.0 Spins New Big Data YARN

Linux Today - Hadoop 2.0 Spins New Big Data YARN | Bigdata Analytics Platform | Scoop.it
Enterprise Apps Today: Building Hadoop 2.0 required four years of effort and involved a good deal of complexity.

Via Charles Gerth
more...
No comment yet.
Rescooped by Taehui Hong from BigData Hadoop Ecosystem
Scoop.it!

How To Analyze Geolocation Data with Hive and Hadoop

This demo walks through a Geolocation dataset from Uber and looks at how to explore the dataset to assess new product viability using Hive and Hadoop (RT @egwada: How To Analyze #Geolocation Data with #Hive and #Hadoop by .@hortonworks on #bigdata...

Via Charles Gerth
more...
No comment yet.