EEDSP
17.5K views | +10 today
Follow
EEDSP
Digital Signal Processing, Data Analytics, Big Data, HPC, Deep Learning, GPGPU, Distributed and Parallel Computing
Curated by Shiwon Cho
Your new post is loading...
Your new post is loading...
Scooped by Shiwon Cho
Scoop.it!

OptaPlanner - Can MapReduce solve planning problems?

OptaPlanner - Can MapReduce solve planning problems? | EEDSP | Scoop.it

To solve a planning or optimization problem, some solvers tend to scale out poorly: As the problem has more variables and more constraints, they use a lot more RAM memory and CPU power. They can hit hardware memory limits at a few thousand variables and few million constraint matches. One way their users typically work around such hardware limits, is to use MapReduce. Let’s see what happens if we MapReduce a planning problem, such as the Traveling Salesman Problem.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Coding for SSDs – Part 1: Introduction and Table of Contents | Code Capsule

Introduction I want to make solid-state drives (SSDs) the optimal storage solution for my key-value store project. For that reason, I had to make sure I
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

As MapReduce fades, Apache Spark is now a top-level project

As MapReduce fades, Apache Spark is now a top-level project | EEDSP | Scoop.it
Apache Spark, an in-memory data-processing framework, is now a top-level Apache project. That’s an important step for Spark’s stability as it increasingly replaces MapReduce in next-generation big data applications.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

A hackable text editor for the 21st Century

A hackable text editor for the 21st Century | EEDSP | Scoop.it
At GitHub, we’re building the text editor we’ve always wanted: hackable to the core, but approachable on the first day without ever touching a config file. We can’t wait to see what you build with it.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache Hadoop 2.3.0 Released! - Hortonworks

Apache Hadoop 2.3.0 Released! - Hortonworks | EEDSP | Scoop.it

Announcing the release of Hadoop 2.3.0 with 560 JIRAs fixed. 

 

Hadoop-2.3.0 is the first release for the year 2014, and brings a number of enhancements to the core platform, in particular to HDFS.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Embedded Analytics and Statistics for Big Data

Embedded Analytics and Statistics for Big Data | EEDSP | Scoop.it
This article provides an overview of tools and libraries available for embedded data analytics and statistics, both stand-alone software packages and programming languages with statistical capabilities. The authors also discuss how to combine and integrate these embedded analytics technologies to handle big data.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Top 18 Best Javascript Frameworks for Developers

Top 18 Best Javascript Frameworks for Developers | EEDSP | Scoop.it
JavaScript framework is the one of the best features of the JavaScript programming language, Basically JavaScript framework is the set of pre-written JavaScript code which helps to easier developme...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

5 online tools in data visualization playground - hadoopsphere.com

5 online tools in data visualization playground - hadoopsphere.com | EEDSP | Scoop.it

While building up an analytics dashboard, one of the major decision points is regarding the type of charts and graphs that would provide better insight into the data.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

MapReduce Patterns, Algorithms, and Use Cases

MapReduce Patterns, Algorithms, and Use Cases | EEDSP | Scoop.it
In this article I digested a number of MapReduce patterns and algorithms to give a systematic view of the different techniques that can be found on the web or scientific articles. Several practical...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Data&storage » Largest Redis Clusters Ever

Data&storage » Largest Redis Clusters Ever | EEDSP | Scoop.it

Tape is Dead,Disk is Tape,Flash is Disk,RAM Locality is King.       — Jim Gray

Redis不是比较成熟的Memcache或者Mysql的替代品,是对于大型互联网类应用在架构上很好的补充。现在有越来越多的应用也在纷纷基于Redis做架构的改造。

可以简单公布一下Redis平台实际情况

2200+亿 commands/day   5000亿Read/day   500亿Write/day

18TB+ Memory

500+ Servers in 6 IDC    2000+instances

应该是国内外比较大的Redis使用平台,今天主要从应用角度谈谈Redis服务平台。

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Redis 3.0.0 beta release

Redis 3.0.0 beta release | EEDSP | Scoop.it

This is the first beta of Redis 3.0.0. Redis 3.0 features support for Redis Cluster and important speed improvements under certain workloads.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Hadoop's 10 in Facebook's 10 - hadoopsphere.com

Hadoop's 10 in Facebook's 10 - hadoopsphere.com | EEDSP | Scoop.it

As the social media giant Facebook celebrated it's 10th anniversary, let's take a look at how the company has been impacting Hadoop ecosystem. Listed below are 10 ecosystem projects in which Facebook has done significant open source contributions. Also, towards the end, listed below are some of the other Hadoop ecosystem projects which are not yet open source but occupy a position of prominence inside the company's technical environment.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Using Apache Cassandra for Real-Time Analytics | Javalobby

Using Apache Cassandra for Real-Time Analytics | Javalobby | EEDSP | Scoop.it

If you're interested in using Cassandra for real-time analytics, you might find something useful in this talk from Stephane Legay, CTO at LoopLogic, on LoopLogic's use case.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

ZooKeeper Resilience at Pinterest

ZooKeeper Resilience at Pinterest | EEDSP | Scoop.it

ZooKeeper Resilience at Pinterest. Apache ZooKeeper is an open source distributed coordination service that’s popular for use cases like service discovery, dynamic configuration management and distributed locking. While it’s versatile and useful, it has failure modes that can be hard to prepare for and recover from, and if used for site critical functionality, can have a significant impact on site availability.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Future of MongoDB: Fireside chat with MongoDB CTO Eliot Horowitz

MongoDB CTO Eliot Horowitz discusses the future of MongoDB and the product roadmap at the new MongoDB Inc. office in Palo Alto.
more...
Benjamin Dean's curator insight, March 3, 2014 4:52 PM

Future of MongoDB

Scooped by Shiwon Cho
Scoop.it!

High Dimensional Biological Data Analysis and Visualization

High Dimensional Biological Data Analysis and Visualization | EEDSP | Scoop.it
High dimensional biological data shares many qualities with other forms of data. Typically it is wide (samples << variables), complicated by experiential design and made up of complex relationships driven by both biological and analytical sources of variance. Luckily the powerful combination of R, Cytoscape (< v3) and the R package RCytoscape can be used […]
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Data Science Kit - Get - O'Reilly Media

Data Science Kit - Get - O'Reilly Media | EEDSP | Scoop.it
O'Reilly Strata: The Data Science Starter Kit gives you the tools you need to get started with data. This kit includes everything you need from analysis, visualization, to management.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Beagle Supercomputer is a Genome Smasher - insideHPC

Beagle Supercomputer is a Genome Smasher - insideHPC | EEDSP | Scoop.it
The Beagle Supercomputer at the University of Chicago can analyze 240 whole genomes in two days.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache Tajo Enters the SQL-on-Hadoop Space

Apache Tajo Enters the SQL-on-Hadoop Space | EEDSP | Scoop.it

The number of SQL options for Hadoop expanded substantially over the last 18 months. Most get a large amount of attention when announced, but a few slip under the radar. One of these low-flying options is Apache Tajo. I learned about Tajo in November of 2013 at a Hadoop User Group meeting.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Intel® Xeon® Processor E7 V2 Family Technical Overview

Intel® Xeon® Processor E7 V2 Family Technical Overview | EEDSP | Scoop.it

The new Intel Xeon processor E7 v2 product family is designed to make data more valuable for your business through in-memory computing – one of the more recent advances in data management and analytic solutions, which stores the entire data set in main memory rather than traditional hard disk storage. In-memory database and analytics solutions enable significant performance gains in analyzing complex and diverse datasets. We’re talking about analysis in seconds or minutes rather than hours or days. This is how you get to real-time insight.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

NoSQL Data Modeling Techniques

NoSQL Data Modeling Techniques | EEDSP | Scoop.it
NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency. This aspect of NoSQL is well-studied both in practice and theory because sp...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Plan 9

The University of California, Berkeley, has been authorised by Alcatel-Lucent to release all Plan 9 software previously governed by the Lucent Public License, Version 1.02 under the GNU General Public License, Version 2.Click here to edit the title

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

High Scalability - High Scalability - Paper: Network Stack Specialization for Performance 

High Scalability - High Scalability - Paper: Network Stack Specialization for Performance  | EEDSP | Scoop.it
In the scalability is specialization department here is an interesting paper presented at HotNe...
more...
No comment yet.