hi bigdata
2.6K views | +0 today
Follow
hi bigdata
big data project
Curated by JerryJung
Your new post is loading...
Your new post is loading...
Rescooped by JerryJung from Large-scale Incremental Processing
Scoop.it!

Edge Intelligence for IoT with Apache MiNiFi - Hortonworks

Edge Intelligence for IoT with Apache MiNiFi - Hortonworks | hi bigdata | Scoop.it
MiNiFI is a subproject of NiFi designed to solve the difficulties of managing and transmitting data feeds to and from the source of origin, often the first

Via Jaeboo Jeong
more...
No comment yet.
Scooped by JerryJung
Scoop.it!

Data Lake vs. Data Warehouse: Is the warehouse going under the lake?

Data Lake vs. Data Warehouse: Is the warehouse going under the lake? | hi bigdata | Scoop.it
Understand the differences between a data lake vs. data warehouse and find out if data lakes will replace a data warehouse or will they coexist.
more...
No comment yet.
Rescooped by JerryJung from Technology Innovations
Scoop.it!

Kafka in Action: 7 Steps to Real-Time Streaming From RDBMS to Hadoop - DZone Big Data

Here is an in-depth example of using Flume with Kafka to stream real-time RDBMS data into a Hive table on HDFS.
Via Tony Shan
more...
No comment yet.
Rescooped by JerryJung from Large-scale Incremental Processing
Scoop.it!

DataTorrent - Hadoop's Most Powerful Platform for
Real-Time Stream Analytics

DataTorrent - Hadoop's Most Powerful Platform for <br/>Real-Time Stream Analytics | hi bigdata | Scoop.it
DataTorrent is Hadoop's Most Powerful Platform for Real-Time Stream Analytics

Via Jaeboo Jeong
more...
No comment yet.
Rescooped by JerryJung from JavaScript for Line of Business Applications
Scoop.it!

Data Visualization with JavaScript

Data Visualization with JavaScript | hi bigdata | Scoop.it

If you’re developing web sites or web applications today, there’s a good chance you have data to communicate, and that data may be begging for a good visualization. But how do you know what kind of visualization is appropriate? And, even more importantly, how do you actually create one? Answers to those very questions are the core of this book. In the chapters that follow, we explore dozens of different visualizations and visualization techniques and tool kits. Each example discusses the appropriateness of the visualization (and suggests possible alternatives) and provides step-by-step instructions for including the visualization in your own web pages.


Contents:

IntroductionImplementation vs DesignCode vs. StylingSimple vs. ComplexReality vs. an Ideal WorldSource Code for ExamplesAcknowledgementsGraphing DataMaking Charts InteractiveIntegrating Charts in a PageCreating Specialized GraphsShowing TimelinesVisualizing Geographic DataCustom Visualizations with D3.jsBuilding Data-Driven Web ApplicationsManaging Data in the Browser



Via Jan Hesse
more...
No comment yet.
Rescooped by JerryJung from Cloud & Big Data Platform
Scoop.it!

Haeinsa is linearly scalable multi-row, multi-table transaction library for HBase. Haeinsa uses two-phase locking and optimistic concurrency control for implementing transaction. The isolation leve...

Haeinsa Overview (HBase Transaction Library)

Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by JerryJung from Cloud & Bigdata Watching
Scoop.it!

Improving Hadoop Performance via Linux

Administering a Hadoop cluster isn't easy. Many Hadoop clusters suffer from Linux configuration problems that can negatively impact performance. With vast and …

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by JerryJung from Cloud & Bigdata Watching
Scoop.it!

Google adds a big data service and lots of monitoring to its cloud

Google adds a big data service and lots of monitoring to its cloud | hi bigdata | Scoop.it
Google rolled out a slew of new cloud services at I/O, including one called Dataflow that’s meant to put standard MapReduce to shame. It’s advertised a much simpler way to build data pipelines that can handle both batch processing and streaming data.

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by JerryJung from Large-scale Incremental Processing
Scoop.it!

Classifiying documents using Naive Bayes on Apache Spark / MLlib

Classifiying documents using Naive Bayes on Apache Spark / MLlib | hi bigdata | Scoop.it
In recent years, Apache Spark has gained in popularity as a faster alternative to Hadoop and it reached a major milestone last month by releasing the production ready version 1.0.0. It claims to be...

Via Jaeboo Jeong
more...
No comment yet.
Rescooped by JerryJung from Cloud & Bigdata Watching
Scoop.it!

Introducing R for Big Data with PivotalR | Pivotal P.O.V.

Introducing R for Big Data with PivotalR | Pivotal P.O.V. | hi bigdata | Scoop.it
Pivotal releases PivotalR to run R in-database & in-Hadoop - currently supported by PostreSQL, Greenplum, PivotalHD http://t.co/tFm7Ja22EK

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by JerryJung from Scala & Cloud Playing
Scoop.it!

Hadoop Tutorial - Geolocation Data

Hadoop Tutorial - Geolocation Data | hi bigdata | Scoop.it
In this Hadoop tutorial video, we show how a trucking company can analyze geolocation data to reduce fuel costs and improve driver safety.

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by JerryJung from Big Data GIS
Scoop.it!

Apache Spark, Spatial Functions and ArcGIS for Desktop

Apache Spark, Spatial Functions and ArcGIS for Desktop | hi bigdata | Scoop.it
A while back I watched with great fascination a webinar presented by UC Berkley amp lab on Spark and Shark. I wanted to spatially enable spark and has been on my todo list for a while.

Via Dahl Winters
more...
Exodusoversea's curator insight, February 5, 2014 12:58 AM

We are among top study abroad overseas education consultants in delhi

http://exodusoverseas.com/our-team.php

Exodusoversea's curator insight, February 5, 2014 12:59 AM

We are among top study abroad overseas education consultants in delhi

http://exodusoverseas.com/our-team.php

Dr. Vijay Srinivas A's curator insight, June 9, 2014 7:04 AM

Interesting angle to enhancing Spark, with spatial queries. 

Rescooped by JerryJung from Large-scale Incremental Processing
Scoop.it!

The current state of machine intelligence 3.0

The current state of machine intelligence 3.0 | hi bigdata | Scoop.it
Watching the appeal and applications of machine intelligence expand.
Via Jaeboo Jeong
more...
No comment yet.
Rescooped by JerryJung from Cloud & Bigdata Watching
Scoop.it!

Something about Kafka - Why Kafka is so fast

This slide briefly introduced the reason why kafka is so fast in performance.

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by JerryJung from Cloud & Bigdata Watching
Scoop.it!

Data-focused Docker clustering - ClusterHQ

Data-focused Docker clustering - ClusterHQ | hi bigdata | Scoop.it
This post is intended to start a conversation about how Docker should handle data volumes for distributed applications. Today we publicly launched Flocker 0.1, an open-source volume and container manager for Docker. Flocker 0.1 is an early model of how we believe storage and networking could be handled in a distributed system. Support for portable …

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by JerryJung from Large-scale Incremental Processing
Scoop.it!

Databricks Spark Reference Applications

Databricks Spark Reference Applications | hi bigdata | Scoop.it
Reference Applications demonstrating Apache Spark - brought to you by Databricks.

Via Jaeboo Jeong
more...
No comment yet.
Rescooped by JerryJung from Cloud & Bigdata Watching
Scoop.it!

Hadoop Summit 2014: Building a Self-Service Hadoop Platform at Linked…

Hadoop comprises the core of LinkedIn’s data analytics infrastructure and runs a vast array of our data products, including People You May Know, Endorsements, …

Via Wonil Lee Ph.D.
more...
No comment yet.
Rescooped by JerryJung from Cloud & Bigdata Watching
Scoop.it!

Interactive Analytics in Human Time

Interactive Analytics in Human Time S u p r e e t h R a o , S u n i l G u p t a ⎪ J u n e 4 , 2 0 1 4 2 0 1 4 H a d o o p S u m m i t , S a n J o s e , C a l i…

Via Wonil Lee Ph.D.
more...
No comment yet.
Scooped by JerryJung
Scoop.it!

Hadoop Operations Powered By ... Hadoop (Hadoop Summit 2014 Amsterdam)

At Spotify we collect huge volumes of data for many purposes. Reporting to labels, powering our product features, and analyzing user growth are some of our mos…
more...
No comment yet.
Rescooped by JerryJung from All about Software Technology
Scoop.it!

Grafana - The Graphite dashboard frontend, editor and graph composer

Grafana - The Graphite dashboard frontend, editor and graph composer | hi bigdata | Scoop.it
A unique graphite dashboard aimed to be a general purpose dashboard that looks nice and makes it easy to construct and edit dashboards through the UI. It also contains an advanced and unique graph editor and graphite target expression / function editor. Other notible features are fast client side rendering, select to zoom in, multiple y-axes and graph templating.

Via Steve Hyounggi Min
more...
No comment yet.
Rescooped by JerryJung from Scala & Cloud Playing
Scoop.it!

GP Tools: Geoprocessing for ArcGIS Using Amazon Elastic Map Reduce

GP Tools: Geoprocessing for ArcGIS Using Amazon Elastic Map Reduce | hi bigdata | Scoop.it

ArcGIS provides a large suite of tools for performing GIS tasks that range from simple buffers and polygon overlays to complex regression analysis and image classification. Tools can be chained together feeding the output of one tool into another.

 

The GP Tools for AWS are built to enable ArcGIS users to leverage Amazon Web Services through GP tools. This project leverages Elastic Map Reduce (EMR) the hosted hadoop framework, as well as, Simple Storage Service (S3) in AWS. In addition, the project leverages GIS Tools for Hadoop to geo-enable hadoop.


Via Dahl Winters, Wonil Lee Ph.D.
more...
No comment yet.