EEDSP
Follow
Find
12.4K views | +14 today
EEDSP
Digital Signal Processing, Data Analytics, Big Data, HPC, Deep Learning, GPGPU
Curated by Shiwon Cho
Your new post is loading...
Your new post is loading...
Scooped by Shiwon Cho
Scoop.it!

Pig Eye for the SQL Guy - Hortonworks

Cat Miller is an engineer at Mortar Data, a Hadoop-as-a-service provider, and creator of mortar, an open source framework for data.

Pig is similar enough to SQL to be familiar, but divergent enough to be disorienting to newcomers. The goal of this guide is to ease the friction in adding Pig to an existing SQL skillset.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

How to install sar, sadf, mpstat, iostat, pidstat and sa tools on CentOS / Fedora / RHEL

How to install sar, sadf, mpstat, iostat, pidstat and sa tools on CentOS / Fedora / RHEL | EEDSP | Scoop.it
The following command can be used to install sar, sadf, mpstat, iostat, pidstat and sa tools on RPM based systems like CentOS, Fedora, RHEL (Red Hat Enterprise Linux):
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

DDS Programming using Modern C++

Resurgence of C++ is spreading in many industries. International computer system standards that target C++ for application portability, are quickly adopting modern C++. At the Object Management Gro...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Slides from "Big Data Real Time Predictive Analytics"

At Tuesday's Data Driven Business Day at the Strata conference I gave my talk, Real-time Big Data Predictive Analytics: From Deployment to Production.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Storm and Hadoop: Convergence of Big-Data and Low-Latency Processing · YDN Blog

Storm and Hadoop: Convergence of Big-Data and Low-Latency Processing · YDN Blog | EEDSP | Scoop.it

At Yahoo!, Hadoop plays a central role in providing personalized experiences for our users and creating value for our advertisers. To serve Yahoo!’s emerging business needs, the Cloud Engineering Group is working on a next generation platform that enables the convergence of big-data and low-latency processing.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

C++ REST SDK (codename "Casablanca") - Home

This library is a Microsoft effort to support cloud-based client-server communication in native code using a modern asynchronous C++ API design.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

New ways to Hadoop with R

Today, there are two main ways to use Hadoop with R and big data: 1. Use the open-source rmr package to write map-reduce tasks in R (running within the Hadoop cluster - great for data distillation!) 2.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

RabbitMQ Clustering on the RaspberryPi | Javalobby

RabbitMQ Clustering on the RaspberryPi from Alvaro Videla on Vimeo.
A video showing how to cluster RabbitMQ using three RaspberryPis for hardware.

More...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Redis 201

Redis 201 Posted by Brian P O'Rourke on Feb 18th, 2013 This is a quick overview of some of the big-picture lessons we’ve helped our
customers with …
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Building recommendation platforms with Hadoop - Strata

Building recommendation platforms with Hadoop - Strata | EEDSP | Scoop.it
Recommendations are making their way into more and more products. Using larger datasets are significantly improving the recommendations. Hadoop is being increasingly used for building out the recommendation platforms....
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Scala 2.10 – Macros Hands-on With "Method Alias" | Javalobby

In this post we’ll look at how implementing a macro in Scala 2.10 looks like. We’ll just cover the 1st syntax mentioned here (1st image).

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Near Real-time Processing Over Hadoop and HBase | Cerner Engineering Health

Near Real-time Processing Over Hadoop and HBase | Cerner Engineering Health | EEDSP | Scoop.it

These significant differences mean different processing infrastructures. Nathan Marz described this well in his How to Beat the CAP Theorem post. The result is a system that uses complementary technologies: stream-based processing with Storm and batch processing with Hadoop.

Interestingly, HBase sits at a juncture between realtime and batch processing models. It offers aspects of batch processing; computation can be moved to the data via direct MapReduce support. It also supports realtime patterns with random access and fas

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

FDTD Algorithm Optimization on Intel® Xeon Phi™ coprocessor | Intel® Developer Zone

FDTD Algorithm Optimization on Intel® Xeon Phi™ coprocessor | Intel® Developer Zone | EEDSP | Scoop.it
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

PARALUTION - The Library for Iterative Sparse Methods on CPU and GPU

PARALUTION - The Library for Iterative Sparse Methods on CPU and GPU | EEDSP | Scoop.it

PARALUTION is a library for sparse iterative methods with special focus on multi-core and accelerator technology such as GPUs. In particular, it incorporates fine-grained parallel preconditioners designed to expolit modern multi-/many-core devices. Based on C++, it provides a generic and flexible design and interface which allow seamless integration with other scientific software packages. The library is open source and released under GPL.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

How to make a scientific result disappear

How to make a scientific result disappear | EEDSP | Scoop.it
Nathan Danneman (a co-author and one of my graduate students from Emory) recently sent me a New Yorker article from 2010 about the “decline effect,” the tendency for initially promising scientific results to get smaller upon replication.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Big Data at Torbit: Custom MapReduce-like System

Big Data at Torbit: Custom MapReduce-like System | EEDSP | Scoop.it
Tylor Arndt about Torbit’s “build-your-own-MapReduce”: The final system begins with a web-service against which client systems interface. To ensure resiliency, an instance of the web- service runs on each cluster host.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

LG경제연구원 걸음마 뗀 소셜 분석, 한계 아는 만큼 가치가 보인다.

소셜 미디어가 활용의 도구에서 분석의 대상으로 진화하고 있다. 소셜 데이터는 방대한 양뿐만 아니라 자발적으로 표현되고 실시간으로 확보가능한 정보라는 점 때문에 기존의 인위적인 실험 환경이나 구조화된 설문 방식을 보완할 새로운 연구대상으로 관심을 모으고 있다

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

How-to: Resample from a Large Data Set in Parallel (with R on Hadoop)

How-to: Resample from a Large Data Set in Parallel (with R on Hadoop) | EEDSP | Scoop.it
Cloudera offers enterprises a powerful new data platform built on the popular Apache Hadoop open-source software package.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Intel Unveils New Distribution For Apache Hadoop

Intel Unveils New Distribution For Apache Hadoop | EEDSP | Scoop.it
Third generation of Intel's Hadoop promises better performance and security and tighter integration with Intel's Xeon processors.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Team Blog | Vert.x on Raspberry Pi

Team Blog | Vert.x on Raspberry Pi | EEDSP | Scoop.it
Vert.x on Raspberry Pi | Lightweight JVM web server

Some people say that using Java on Raspberry Pi is a stupid idea. Sure, JVM uses a lot of system
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Free e-book on Data Science with R

Free e-book on Data Science with R | EEDSP | Scoop.it
A new book by Jeffrey Stanton from Syracuse Iniversity School of Information Studies, An Introduction to Data Science, is now available for free download.
more...
No comment yet.