Big Data, Statistics, Machine Learning, Hadoop
10 views | +0 today
Follow
Your new post is loading...
Your new post is loading...
Rescooped by Ryan Geonguk Ahn from Bigdata Analytics Platform
Scoop.it!

Hadoop on OpenStack: Elastic Data Processing (EDP) with Savanna 0.3

Hadoop on OpenStack: Elastic Data Processing (EDP) with Savanna 0.3 | Big Data, Statistics, Machine Learning, Hadoop | Scoop.it
Now that version 0.2 of Project Savanna is out, it’s time to start looking at what will be coming up in version 0.3. The goal for this next development phase is to provide elastic data processing (...

Via Simon Hunanyan, JerryJung, Taehui Hong
more...
Simon Hunanyan's curator insight, August 28, 2013 6:36 AM

The next version 0.3 of Savanna will include both analytics as a service and elastic data processing (EDP).

Rescooped by Ryan Geonguk Ahn from Bigdata Analytics Platform
Scoop.it!

MapReduce Algorithms - Understanding Data Joins Part 1

MapReduce Algorithms - Understanding Data Joins Part 1 | Big Data, Statistics, Machine Learning, Hadoop | Scoop.it
In this post we continue with our series of implementing the algorithms found in the Data-Intensive Text Processing with MapReduce book, this time discussing data joins. While we are going to discu...

Via JerryJung, Taehui Hong
more...
No comment yet.
Rescooped by Ryan Geonguk Ahn from Bigdata Analytics Platform
Scoop.it!

How To Analyze Geolocation Data with Hive and Hadoop

This demo walks through a Geolocation dataset from Uber and looks at how to explore the dataset to assess new product viability using Hive and Hadoop (RT @egwada: How To Analyze #Geolocation Data with #Hive and #Hadoop by .@hortonworks on #bigdata...

Via Charles Gerth, Taehui Hong
more...
No comment yet.
Rescooped by Ryan Geonguk Ahn from Bigdata Analytics Platform
Scoop.it!

Linux Today - Hadoop 2.0 Spins New Big Data YARN

Linux Today - Hadoop 2.0 Spins New Big Data YARN | Big Data, Statistics, Machine Learning, Hadoop | Scoop.it
Enterprise Apps Today: Building Hadoop 2.0 required four years of effort and involved a good deal of complexity.

Via Charles Gerth, Taehui Hong
more...
No comment yet.
Rescooped by Ryan Geonguk Ahn from Bigdata Analytics Platform
Scoop.it!

Hadoop on OpenStack: Elastic Data Processing (EDP) with Savanna 0.3

Hadoop on OpenStack: Elastic Data Processing (EDP) with Savanna 0.3 | Big Data, Statistics, Machine Learning, Hadoop | Scoop.it
Now that version 0.2 of Project Savanna is out, it’s time to start looking at what will be coming up in version 0.3. The goal for this next development phase is to provide elastic data processing (...

Via Simon Hunanyan, JerryJung, Taehui Hong
more...
Simon Hunanyan's curator insight, August 28, 2013 6:36 AM

The next version 0.3 of Savanna will include both analytics as a service and elastic data processing (EDP).

Rescooped by Ryan Geonguk Ahn from All about Software Technology
Scoop.it!

2013년 html5 총정리 (Summary of HTML5 Trend in 2013)

This slides give the trend for HTML5 Industry, W3C Standards, and Browser vendors in 2013.

Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by Ryan Geonguk Ahn from Bigdata Analytics Platform
Scoop.it!

Netflix open sources its data traffic cop, Suro

Netflix open sources its data traffic cop, Suro | Big Data, Statistics, Machine Learning, Hadoop | Scoop.it
Netflix has open sourced a tool called Suro that collects event data from disparate application servers before sending them to other data platforms such as Hadoop and Elasticsearch.

Via Taehui Hong
more...
Rescooped by Ryan Geonguk Ahn from Cloud Storage, Distributed File System
Scoop.it!

Amazon Web Services Blog: Amazon S3: Multipart Upload

Amazon Web Services Blog: Amazon S3: Multipart Upload | Big Data, Statistics, Machine Learning, Hadoop | Scoop.it
Can I ask you some questions? Have you ever been forced to repeatedly try to upload a file across an unreliable network connection? In most cases there's no easy way to pick up from where you left off and you...

Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by Ryan Geonguk Ahn from Cloud Storage, Distributed File System
Scoop.it!

Our Experience of Creating Large Scale Log Search System Using ElasticSearch | Architects Zone

Our Experience of Creating Large Scale Log Search System Using ElasticSearch | Architects Zone | Big Data, Statistics, Machine Learning, Hadoop | Scoop.it
This post comes from Lee Jae Ik at the CUBRID Blog. At NHN we have a service called NELO (NHN Error Log System) to manage and search logs pushed to the...

Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by Ryan Geonguk Ahn from Cloud Storage, Distributed File System
Scoop.it!

Elasticsearch - Devoxx France 2012 - English version

Elasticsearch presentation for Devoxx France 2012 English translation (feel free to correct my bad english ;-) ) French version is available here : http://www

Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by Ryan Geonguk Ahn from Cloud Storage, Distributed File System
Scoop.it!

hello world » elasticsearch로 로그 검색 시스템 만들기

hello world » elasticsearch로 로그 검색 시스템 만들기 | Big Data, Statistics, Machine Learning, Hadoop | Scoop.it

elasticsearch는 Shay Banon이 Lucene을 바탕으로 개발한 분산 검색엔진입니다. 설치와 서버 확장이 매우 편리하기 때문에 개발하고 있는 시스템에 검색 기능이 필요하다면 elasticsearch를 적용하는 것을 권장하고 싶습니다. 분산 시스템이기 때문에 검색 대상 용량이 증가했을 때 대응하기가 무척 수월하다는 것이 장점입니다.


Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by Ryan Geonguk Ahn from Bigdata Analytics Platform
Scoop.it!

Netflix open sources its data traffic cop, Suro

Netflix open sources its data traffic cop, Suro | Big Data, Statistics, Machine Learning, Hadoop | Scoop.it
Netflix has open sourced a tool called Suro that collects event data from disparate application servers before sending them to other data platforms such as Hadoop and Elasticsearch.

Via Taehui Hong
more...