EEDSP
Follow
Find
11.9K views | +9 today
EEDSP
Digital Signal Processing, Data Analytics, Big Data, HPC, Deep Learning, GPGPU
Curated by Shiwon Cho
Your new post is loading...
Your new post is loading...
Scooped by Shiwon Cho
Scoop.it!

Using Hadoop Pig with MongoDB

In this post, we'll see how to install MongoDB support for Pig and we'll illustrate it with an example where we join 2 MongoDB collections with Pig and store the result in a new collection. Require...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The Structure of Big Data

First things first, all data is more or less structured. That being said, there is... Structured Data Semi-Structured Data Unstructured Data I tend to think of it as: data, composite or simple, wit...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Hive Plots - Linear Layout for Network Visualization - Visually Interpreting Network Structure and Content Made Possible

Hive Plots - Linear Layout for Network Visualization - Visually Interpreting Network Structure and Content Made Possible | EEDSP | Scoop.it

Hive Plots shows the connections between similar genes (nodes) in three related genomes (SL, BA and SN). You can create hive plots in R using the hiveR package.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Phoenix: A SQL layer over HBase

Phoenix: A SQL layer over HBase | EEDSP | Scoop.it

Phoenix is a SQL layer over HBase, delivered as a client-embedded JDBC driver, powering the HBase use cases at Salesforce.com. Phoenix targets low-latency queries (milliseconds), as opposed to batch operation via map/reduce. To see what's supported, go to ourlanguage reference guide, and read more on our wiki.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Arc Diagrams in R: Les Miserables

Arc Diagrams in R: Les Miserables | EEDSP | Scoop.it
In this post we will talk about the R package “arcdiagram” for plotting pretty arc diagrams like the one below: Arc Diagrams An arc diagram is a graphical display to visualize graphs or networks in a one-dimensional layout.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

modern.IE – A new set of tools to help test web site compatibility - IEBlog - Site Home - MSDN Blogs

modern.IE – A new set of tools to help test web site compatibility - IEBlog - Site Home - MSDN Blogs | EEDSP | Scoop.it

 modern.IE includes a wizard that scans a Web page URL for common interoperability problems and suggests some ideas for how to address those issues to improve the user experience across modern and older browsers.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Twitter Engineering: Introducing Flight: a web application framework

Twitter Engineering: Introducing Flight: a web application framework | EEDSP | Scoop.it

Flight is distinct from existing frameworks in that it doesn't prescribe or provide any particular approach to rendering or providing data to a web application. It's agnostic on how requests are routed, which templating language you use, or even if you render your HTML on the client or the server. While some web frameworks encourage developers to arrange their code around a prescribed model layer, Flight is organized around the existing DOM model with functionality mapped directly to DOM nodes.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Scaling Real-time Apps on Cloud Foundry Using Node.js and Redis

Scaling Real-time Apps on Cloud Foundry Using Node.js and Redis | EEDSP | Scoop.it
Common applications being built on Node.js like social networking and chat require real-time scaling capabilities across multiple instances. Developers need to deal with sticky sessions, scale-up, ...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

A Brief Review of the Land Registry Linked Data

The Land Registry have today announced the publication of their Open Data -- including both Price Paid information and Transactions as Linked Data. This is great to see, as it means that there is a...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The Big Data Landscape

The Big Data Landscape | EEDSP | Scoop.it
It’s far from being complete, but the infographic published on The Big Data Landscape site provides a good overview of the big data landscape including a wide range of companies, products, and...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Unit Testing in R | Architects Zone

Unit Testing in R | Architects Zone | EEDSP | Scoop.it
R is a statistical programming language, with a strong focus on mathematical operations. When writing code that is math-heavy, unit testing becomes very...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Redis-Stat: Redis Monitoring With Netflix's Hystrix Flavor

Redis-Stat: Redis Monitoring With Netflix's Hystrix Flavor | EEDSP | Scoop.it
A very early stage of a Redis monitoring tool using hiredis1and express2 on Node.js presenting a dashboard inspired by Netflix’s Hystrix3:
The project is on GitHub so you can send some pull requests...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Framer: Modern Prototyping

Framer is a modern prototyping tool. It can help you to quickly build and test complex interactions and rich animations for both desktop and mobile.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

MongoDB 2.4 Highlights

MongoDB 2.4 Highlights | EEDSP | Scoop.it
MongoDB 2.4 is just around the corner:
From Mike Friedman’s Roadmap slidedeck.
Original title and link: MongoDB 2.4 Highlights (NoSQL database©myNoSQL)
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Learning RStudio for R Statistical Computing

Learning RStudio for R Statistical Computing | EEDSP | Scoop.it
Learning RStudio for R Statistical Computing will teach you how to quickly and efficiently create and manage statistical analysis projects, import data, develop R scripts, and generate reports and graphics.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Online C++ compilers : Standard C++

Online C++ compilers : Standard C++ | EEDSP | Scoop.it

Many people don’t realize how many web pages offer access to try out C++ compilers, including many of the latest compilers with burgeoning C++11 language support. So we thought we’d publish a list.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Sorting data in parallel CPU vs GPU | Solarian Programmer

Sorting data in parallel CPU vs GPU | Solarian Programmer | EEDSP | Scoop.it

For many programmers sorting data in parallel means implementing a state of the art algorithm in their preferred programming language. However, most programming languages have a good serial sorting function in their standard library. It appears to me, that the obvious thing to do is to first try to use what your language library provides. If this approach is not successful, you should try to find an existing library that is used, and consequently well debugged, by other programmers. Only as a last resort, you should implement a new sorting algorithm from scratch.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

B-Tree Containers from Google : Standard C++

B-Tree Containers from Google : Standard C++ | EEDSP | Scoop.it

Google has graciously gifted to the community a set of STL-like containers that use B-trees under the covers. The code has been released under the Apache 2 license.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

"Ordered Information" is not a paint job

"Ordered Information" is not a paint job | EEDSP | Scoop.it
At Davos 2013, CEO Marissa Mayer unveiled her vision for Yahoo's rebirth, affirming it wants to be: a feed of information that is ordered, the Web is ordered for you and is also on your mobile phon...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Stack Up Hadoop to Find Its Place in Your Architecture

Stack Up Hadoop to Find Its Place in Your Architecture | EEDSP | Scoop.it

2013 promises to be a banner year for Apache Hadoop, platform providers, related technologies – and analysts who try to sort it out. I’ve been wrestling with ways to make sense of it for Gartner clients bewildered by a new set of choices, and for them and myself, I’ve built a stack diagram that describes possible functional layers of a Hadoop-based model.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Publishing SPARQL queries and documentation using github

Yesterday I released an early version of sparql-doc a SPARQL documentation generator. I've got plans for improving the functionality of the tool, but I wanted to briefly document how to use github ...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

SPARQL with R in less than 5 minutes - ProgrammingR

SPARQL with R in less than 5 minutes - ProgrammingR | EEDSP | Scoop.it
In this article we'll get up and running on the Semantic Web in less than 5 minutes using SPARQL with R.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Parallel R: Quick Ways Model More (The Data Warehouse Insider)

Blogs.Oracle.Com - The Data Warehouse Insider

I am less and less often mistaken for a pirate when I mention the R language.  While I miss the excuse to wear an eyepatch, I'm glad more people are beginning to explore a statistical language I've been touting for years.  When it comes to plotting or running complex statistics in a single line of code, R is a great tool to have.  That said, there are plenty of pitfalls for the casual or new user: syntax, learning to write vectorized code, or even just knowing which "apply" function you really should choose.

  I want to explore a slightly less-often considered aspect of R development: parallelism.  Out of the box, R can seem very limited to someone used to working on compute clusters or even a multicore server.  However, there are a few tricks we can leverage to get the most out of R on everything from a personal workstation to a Hadoop cluster.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

5 online tools in data visualization playground - hadoopsphere.com

5 online tools in data visualization playground - hadoopsphere.com | EEDSP | Scoop.it
hadoopsphere.com: 5 online tools in data visualization playground
more...
No comment yet.