EEDSP
Follow
Find
14.6K views | +8 today
EEDSP
Digital Signal Processing, Data Analytics, Big Data, HPC, Deep Learning, GPGPU, Distributed and Parallel Computing
Curated by Shiwon Cho
Your new post is loading...
Your new post is loading...
Scooped by Shiwon Cho
Scoop.it!

To Go from Big Data to Big Insight, Start with a Visual

To Go from Big Data to Big Insight, Start with a Visual | EEDSP | Scoop.it
Lessons on making data useful from inside The New York Times.

Although data visualization has produced some of the most captivating artistic displays in recent memory, some of which have found their way into exhibits at the New York Museum of Modern Art and countless art installations around the world, business leaders are asking: is data visualization actionable?

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Making Hadoop applications work in the Cloud: Five key guidelines - hadoopsphere.com

Making Hadoop applications work in the Cloud: Five key guidelines - hadoopsphere.com | EEDSP | Scoop.it
hadoopsphere.com: Making Hadoop applications work in the Cloud: Five key guidelines
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Addressing Big Data Security | The Big Data Hub

Addressing Big Data Security | The Big Data Hub | EEDSP | Scoop.it

Data security rules have changed in the age of big data. The V-Force (Volume, Velocity and Variety) has changed the landscape for data processing and storage in many organizations. Organizations are collecting, analyzing and making decisions based on analysis of massive amounts of data sets from various sources such as web logs, clickstream data and social media content to gain better insights about their customers. Their business and security in this process are becoming increasingly more important.

 

 

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

An example of MapReduce with rmr2

R can be connected with Hadoop through the rmr2 package. The core of this package is mapreduce() function that allows to write some custom MapReduce algorithms. The aim of this article is to show h...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

One Petabyte Red Hat Storage and GlusterFS. Project Overview | jread.us

One Petabyte Red Hat Storage and GlusterFS. Project Overview | jread.us | EEDSP | Scoop.it

In October 2012, I was fortunate enough to be given the task of designing and implementing a 1000 Terabyte (1 Petabyte) GlusterFS volume.

 
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Topic of Interest - Distributed Stream Processing

Topic of Interest - Distributed Stream Processing | EEDSP | Scoop.it
I've been interested in distributed stream processing since the Storm and S4 announcements. Storm was open sourced by Twitter and S4, now an Apache Incubator project, was open sourced by Yahoo! Now...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

MapReduce C++ Library | Craig Henderson

MapReduce C++ Library | Craig Henderson | EEDSP | Scoop.it

The MapReduce C++ Library implements a single-machine platform for programming using the the Google MapReduce idiom.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

NoSQL Database Adoption Trends

NoSQL Database Adoption Trends | EEDSP | Scoop.it
In this research survey on NoSQL database adoption trends, InfoQ would like to learn what NoSQL databases you are currently using or planning on using in your applications.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Facebook’s graph-processing engine

This is a good presentation about Facebook’s graph-processing engine, Giraph, from a big data event held at the company’s Menlo Park campus in early June.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Average Income per Programming Language

Average Income per Programming Language | EEDSP | Scoop.it
Update 8/21:  I've gotten a lot of feedback about issues with these rankings from comments, and have tried to address some of them here.  The data there has been updated to include confidence inter...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Building an R Hadoop System - RDataMining.com: R and Data Mining

 This page shows how to build an R Hadoop system, and presents the steps to set up my first R Hadoop system in single-node mode on Mac OS
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Simple Hive ‘Cheat Sheet’ for SQL Users

Simple Hive ‘Cheat Sheet’ for SQL Users | EEDSP | Scoop.it
If you’re already familiar with SQL then you may well be thinking about how to add Hadoop skills to your toolbelt as an option for data processing.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Hoya (HBase on YARN) : Application Architecture - Hortonworks

Hoya (HBase on YARN) : Application Architecture - Hortonworks | EEDSP | Scoop.it
HOYA - HBase on YARN - Application Architecture for Hadoop 2.0
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Text Mining the Complete Works of William Shakespeare

Text Mining the Complete Works of William Shakespeare | EEDSP | Scoop.it
I am starting a new project that will require some serious text mining. So, in the interests of bringing myself up to speed on the tm package, I thought I would apply it to the Complete Works of William Shakespeare and just see what falls out.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Twitter open sources Storm-Hadoop hybrid called Summingbird

Twitter open sources Storm-Hadoop hybrid called Summingbird | EEDSP | Scoop.it
Twitter has open sourced a “streaming MapReduce” system called Summingbird that makes Hadoop and Storm play nicer together so applications that require both batch and stream processing can do their jobs with as little complexity as possible.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

RabbitMQ - AMQP 0-9-1 Model Explained

RabbitMQ - AMQP 0-9-1 Model Explained | EEDSP | Scoop.it
RabbitMQ is a complete and highly reliable enterprise messaging system based on the emerging AMQP standard
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

OpenCPU 1.0 release!

After more than 3 years of development, we release the first official version of the OpenCPU system.
Based on feedback and experiences from the beta series, OpenCPU version 1.0 has been rewritten ...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

OpenMP®/Clang

The OpenMP (Open Multi-Processing) specification is a standard for a set of compiler directives, library routines, and environment variables that can be used to specify shared memory parallelism in Fortran and C/C++ programs.

This project implements OpenMP support in the Clang C language family front-end for the LLVM compiler. The current scope of the project is to support the OpenMP 3.1 specification.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Big Data Sets you can use with R

Big Data Sets you can use with R | EEDSP | Scoop.it

The world may indeed be awash with data, however, it is not always easy to find a suitable data set when you need one.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The Hadoop Data Warehouse – A Wake-up Call for Traditional EDW | The Big Data Hub

The Hadoop Data Warehouse – A Wake-up Call for Traditional EDW | The Big Data Hub | EEDSP | Scoop.it
The change to augment the traditional EDW is happening gradually as big data technology matures and more solutions get added to fill the gaps that exist today
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Big Data, Analytics And The Future Of Marketing And Sales

Big Data, Analytics And The Future Of Marketing And Sales | EEDSP | Scoop.it
By Jonathan Gordon (@JW_Gordon), Jesko Perrey, and Dennis Spillecke (@dspillecke) Big Data is the biggest game-changing opportunity for marketing and sales since the Internet went mainstream almost 20 years ago.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Text Mining with R - Comparing Word Counts in two Text Documents

Here's what I came up with to compare word counts in two pieces of text. If you got any idea, I'd love to learn about alternatives!## a function that compares word counts in two textswordcount ...
more...
No comment yet.