EEDSP
Follow
Find
12.4K views | +3 today
EEDSP
Digital Signal Processing, Data Analytics, Big Data, HPC, Deep Learning, GPGPU
Curated by Shiwon Cho
Your new post is loading...
Your new post is loading...
Scooped by Shiwon Cho
Scoop.it!

Yahoo! Open Sources Storm on Hadoop

Apache Hadoop is de facto standard for Big Data storage and batch processing, while Tweeter Storm is quickly becoming a standard for large-scale event processing implementations. Unfortunately, up until recently, Storm and Hadoop required two physically different clusters for their implementation. Last week Yahoo! announced open sourcing Storm running on a Hadoop cluster.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Top 100 R packages for 2013 (Jan-May)!

Top 100 R packages for 2013 (Jan-May)! | EEDSP | Scoop.it
What are the top 100 (most downloaded) R packages in 2013? Thanks to the recent release of RStudio of their “0-cloud” CRAN log files, we can now answer this question (at least for the months of Jan till May)!
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Tips for Writing Functional Programming Tutorials

Shiwon Cho's insight:

With the growing interest in a functional programming style, there are more tutorials and blog entries on the subject, and that's wonderful. For anyone so inclined to write their own, let me pass along a few quick tips.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Acquiring Big Data Using Apache Flume

Acquiring Big Data Using Apache Flume | EEDSP | Scoop.it
Data analysis is only half the battle; getting the data into a Hadoop cluster is the first step in any Big Data deployment. Apache Flume uses an elegant design to make data loading easy and efficient.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Quick and Simple D3 Network Graphs from R

Quick and Simple D3 Network Graphs from R | EEDSP | Scoop.it
Sometimes I just want to quickly make a simple D3 JavaScript directed network graph with data in R. Because D3 network graphs can be manipulated in the browser–i.e. nodes can be moved aroun...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

R and MongoDB

MongoDB is a document-based noSQL database. Different from the relational database storing data in tables with rigid schemas, MongoDB stores data in documents with dynamic schemas.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Integrating MongoDB Text Search with a Python App

Integrating MongoDB Text Search with a Python App | EEDSP | Scoop.it
By Mike O’Brien, 10gen Software engineer and maintainer of Mongo-Hadoop
With the release of MongoDB 2.4, it’s now pretty simple to take an existing application that already uses MongoDB and add new...
more...
No comment yet.
Rescooped by Shiwon Cho from BIG data, Data Mining, Predictive Modeling, Visualization
Scoop.it!

Real-Time Visualization with Kafka, Storm, Redis, node.js & d3.js

LivePerson Developers host Byron Ellis, Chief Data Scientist, LivePerson (@fdaapproved) In this meetup, Byron will demonstrate a realtime dashboard for strea...

Via AnalyticsInnovations
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The Future Of Technology Isn’t Mobile, It’s Contextual

The Future Of Technology Isn’t Mobile, It’s Contextual | EEDSP | Scoop.it
You’re walking home alone on a quiet street. You hear footsteps approaching quickly from behind. It’s nighttime. Your senses scramble to help your brain figure out what to do. You listen for signs of threat or glance backward.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Neo4j Blog: Neo4j 1.9 General Availability Announcement!

Shiwon Cho's insight:

The 1.9 release adds primarily three things:

Auto-Clustering, which makes Neo4j Enterprise clustering more robust & easier to administer, with fewer moving partsCypher language improvements make the language more functionally powerful and more performant, andNew welcome pages make learning easier for new users
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Image Processing with C++ AMP and the .NET Framework - Visual C++ Team Blog - Site Home - MSDN Blogs

Image processing is a computational task that lends itself very well to GPU compute scenarios. In many cases the most commonly used algorithms are inherently massively parallel, with each pixel in the image being processed independently from the others. As a result, image processing toolkits have been early adopters of the new GPGPU programming model.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Big News! “Practical Data Science with R” MEAP launched!

Big News! “Practical Data Science with R” MEAP launched! | EEDSP | Scoop.it
Nina Zumel and I ( John Mount ) have been working very hard on producing an exciting new book called “Practical Data Science with R.” The book has now entered Manning Early Access Progr...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

World Of Technology: The history of computer data storage, in pictures

World Of Technology: The history of computer data storage, in pictures | EEDSP | Scoop.it
The history of computer data storage, in pictures
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

New version of solaR

New version of solaR | EEDSP | Scoop.it
I have updated my package solaR. This package provides calculation methods of solar radiation and performance of photovoltaic systems from …Continuar leyendo »
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Pydoop: Writing Hadoop Programs in Python

Pydoop: Writing Hadoop Programs in Python | EEDSP | Scoop.it
Installed as a layer above Hadoop, the open-source Pydoop package enables Python scripts to do big data work easily.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

R package development

R package development | EEDSP | Scoop.it
Building R packages is not particular hard, but it can be a bit of a daunting endeavour at the beginning, particularly if you are more of a statistician than a computer scientist or programmer. Som...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache CloudStack: Open Source Cloud Computing

Apache CloudStack: Open Source Cloud Computing | EEDSP | Scoop.it

Apache CloudStack is open source software designed to deploy and manage large networks of virtual machines, as a highly available, highly scalable Infrastructure as a Service (IaaS) cloud computing platform. CloudStack is used by a number of service providers to offer public cloud services, and by many companies to provide an on-premises (private) cloud offering, or as part of a hybrid cloud solution.

CloudStack is a turnkey solution that includes the entire "stack" of features most organizations want with an IaaS cloud: compute orchestration, Network-as-a-Service, user and account management, a full and open native API, resource accounting, and a first-class User Interface (UI).

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

A Big Data introduction

Since R uses the computer RAM, it may handle only rather small sets of data. Nevertheless, there are some packages that allow to treat larger volumes and the best solution is to connect R with a Big Data environment.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Creating parallel reactive and streaming applications with the Intel® Threading Building Blocks (Intel® TBB) flow graph | Intel® Developer Zone

Creating parallel reactive and streaming applications with the Intel® Threading Building Blocks (Intel® TBB) flow graph | Intel® Developer Zone | EEDSP | Scoop.it

The flow graph feature available in Intel® Threading Building Blocks (Intel® TBB) allows users to easily create both dependence graphs and reactive, messaging passing graphs that execute on top of Intel TBB tasks. Users programmatically create nodes and edges that express the computations performed by their application and the dependencies between these computations.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The Engineer’s Engineer | Agile Zone

The Engineer’s Engineer | Agile Zone | EEDSP | Scoop.it
Lately I’ve seen quite a few requests for advice from younger programmers, asking questions either directly to me or in public forums about a career decision...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

How to use MongoDB as a pure in-memory DB (Redis style)

The idea There has been a growing interest in using MongoDB as an in-memory database, meaning that the data is not stored on disk at all. This can be super useful for applications like:
• a...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

High Scalability - High Scalability - Strategy: Stop Using Linked-Lists

High Scalability - High Scalability - Strategy: Stop Using Linked-Lists | EEDSP | Scoop.it
What data structure is more sacred than the link list? If we get rid of it what silly interview...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Light Table

Light Table | EEDSP | Scoop.it

Light Table is a new interactive IDE that lets you modify running programs and embed anything from websites to games. It provides the real time feedback we need to not only answer questions about our code, but to understand how our programs really work.

more...
No comment yet.