EEDSP
19.9K views | +0 today
Follow
EEDSP
Digital Signal Processing, Data Analytics, Big Data, HPC, Deep Learning, GPGPU, Distributed and Parallel Computing
Curated by Shiwon Cho
Your new post is loading...
Your new post is loading...
Scooped by Shiwon Cho
Scoop.it!

Beagle Supercomputer is a Genome Smasher - insideHPC

Beagle Supercomputer is a Genome Smasher - insideHPC | EEDSP | Scoop.it
The Beagle Supercomputer at the University of Chicago can analyze 240 whole genomes in two days.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Apache Tajo Enters the SQL-on-Hadoop Space

Apache Tajo Enters the SQL-on-Hadoop Space | EEDSP | Scoop.it

The number of SQL options for Hadoop expanded substantially over the last 18 months. Most get a large amount of attention when announced, but a few slip under the radar. One of these low-flying options is Apache Tajo. I learned about Tajo in November of 2013 at a Hadoop User Group meeting.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Intel® Xeon® Processor E7 V2 Family Technical Overview

Intel® Xeon® Processor E7 V2 Family Technical Overview | EEDSP | Scoop.it

The new Intel Xeon processor E7 v2 product family is designed to make data more valuable for your business through in-memory computing – one of the more recent advances in data management and analytic solutions, which stores the entire data set in main memory rather than traditional hard disk storage. In-memory database and analytics solutions enable significant performance gains in analyzing complex and diverse datasets. We’re talking about analysis in seconds or minutes rather than hours or days. This is how you get to real-time insight.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

NoSQL Data Modeling Techniques

NoSQL Data Modeling Techniques | EEDSP | Scoop.it
NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency. This aspect of NoSQL is well-studied both in practice and theory because sp...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Plan 9

The University of California, Berkeley, has been authorised by Alcatel-Lucent to release all Plan 9 software previously governed by the Lucent Public License, Version 1.02 under the GNU General Public License, Version 2.Click here to edit the title

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

High Scalability - High Scalability - Paper: Network Stack Specialization for Performance 

High Scalability - High Scalability - Paper: Network Stack Specialization for Performance  | EEDSP | Scoop.it
In the scalability is specialization department here is an interesting paper presented at HotNe...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Yahoo's New Long Game: Contextual Search

Yahoo's New Long Game: Contextual Search | EEDSP | Scoop.it
Marissa Mayer is betting that Yahoo can beat Google at search - not on desktops but on mobile, where users need specialized results based on context.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

High Scalability - High Scalability - Stuff The Internet Says On Scalability For February 7th, 2014

High Scalability - High Scalability - Stuff The Internet Says On Scalability For February 7th, 2014 | EEDSP | Scoop.it
Hey, it's HighScalability time:


Google "Corkboard" Server, 1999

5 billion reques...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

HDFS Explorer: Accessing HDFS from Windows Explorer

HDFS Explorer: Accessing HDFS from Windows Explorer | EEDSP | Scoop.it
HDFS Explorer, by Red Gate Big Data:
“ At Red Gate we have been working on some query tools for Hadoop for a while and while testing we found ourselves endlessly typing hadoop fs. Getting data sets...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Data Mining in R: Analyzing the word use in tweets with the hashtag #tltsym13 | Stephanie E. Vasko Data Mining in R: Analyzing the word use in tweets with the hashtag #tltsym13 | Teaching and Learn...

Data Mining in R: Analyzing the word use in tweets with the hashtag #tltsym13 | Stephanie E. Vasko Data Mining in R: Analyzing the word use in tweets with the hashtag #tltsym13 | Teaching and Learn... | EEDSP | Scoop.it

This weekend I had the opportunity to attend Penn State’s “Teaching and Learning with Technology” Symposium.  In addition to hearing some great talks about innovation, learning analytics, and the PSU strategy on MOOCs, I was energized to pick up with data/text mining in R.  Learning analytics (LA) and the future of their use have fascinated me for quite sometime, and I have been eager to combine my developing R skills with data mining techniques.

 

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

REEF | The Retainable Evaluator Execution Framework

REEF | The Retainable Evaluator Execution Framework | EEDSP | Scoop.it

REEF stands for the Retainable Evaluator Execution Framework, and it is our approach to simplify and unify the lower layers of big data systems on modern resource managers like Apache YARN, Apache Mesos, Google Omega, and Facebook Corona. On these resource managers, REEF provides a centralized control plane abstraction that can be used to build a decentralized data plane for supporting big data systems, like those mentioned below. Special consideration is given to graph computation and machine learning applications, which require data retention on allocated resources, as they execute multiple passes over the data.Click here to edit the content

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

InfluxDB

InfluxDB | EEDSP | Scoop.it

InfluxDB is a time series, events, and metrics database. It's written in Go and has no external dependencies. That means once you install it there's nothing else to manage (like Redis, HBase, or whatever). It's designed to be distributed and scale horizontally, but be useful even 

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Top 18 Best Javascript Frameworks for Developers

Top 18 Best Javascript Frameworks for Developers | EEDSP | Scoop.it
JavaScript framework is the one of the best features of the JavaScript programming language, Basically JavaScript framework is the set of pre-written JavaScript code which helps to easier developme...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

5 online tools in data visualization playground - hadoopsphere.com

5 online tools in data visualization playground - hadoopsphere.com | EEDSP | Scoop.it

While building up an analytics dashboard, one of the major decision points is regarding the type of charts and graphs that would provide better insight into the data.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

MapReduce Patterns, Algorithms, and Use Cases

MapReduce Patterns, Algorithms, and Use Cases | EEDSP | Scoop.it
In this article I digested a number of MapReduce patterns and algorithms to give a systematic view of the different techniques that can be found on the web or scientific articles. Several practical...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Data&storage » Largest Redis Clusters Ever

Data&storage » Largest Redis Clusters Ever | EEDSP | Scoop.it

Tape is Dead,Disk is Tape,Flash is Disk,RAM Locality is King.       — Jim Gray

Redis不是比较成熟的Memcache或者Mysql的替代品,是对于大型互联网类应用在架构上很好的补充。现在有越来越多的应用也在纷纷基于Redis做架构的改造。

可以简单公布一下Redis平台实际情况

2200+亿 commands/day   5000亿Read/day   500亿Write/day

18TB+ Memory

500+ Servers in 6 IDC    2000+instances

应该是国内外比较大的Redis使用平台,今天主要从应用角度谈谈Redis服务平台。

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Redis 3.0.0 beta release

Redis 3.0.0 beta release | EEDSP | Scoop.it

This is the first beta of Redis 3.0.0. Redis 3.0 features support for Redis Cluster and important speed improvements under certain workloads.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Hadoop's 10 in Facebook's 10 - hadoopsphere.com

Hadoop's 10 in Facebook's 10 - hadoopsphere.com | EEDSP | Scoop.it

As the social media giant Facebook celebrated it's 10th anniversary, let's take a look at how the company has been impacting Hadoop ecosystem. Listed below are 10 ecosystem projects in which Facebook has done significant open source contributions. Also, towards the end, listed below are some of the other Hadoop ecosystem projects which are not yet open source but occupy a position of prominence inside the company's technical environment.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Red Hat and Hortonworks fuse Hadoop and cloud in new partnership

Red Hat and Hortonworks fuse Hadoop and cloud in new partnership | EEDSP | Scoop.it
Red Hat and Hortonworks are integrating a number of technologies to give joint customers a more seamless experience running their Hadoop workloads on private cloud or virtualized infrastructure. In an upstart market worth billions, it helps to have friends like Red Hat.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Top 11 Free Software for Text Analysis, Text Mining, Text Analytics - Predictive Analytics Today

Top 11 Free Software for Text Analysis, Text Mining, Text Analytics - Predictive Analytics Today | EEDSP | Scoop.it
Review of Top 11 Free Software for Text Analysis, Text Mining, Text Analytics ? KH Coder, Carrot2, GATE, tm, Gensim, Natural Language Toolkit, RapidMiner, Unstructured Information Management Architecture, OpenNLP, KNIME, Orange-Textable and LPU are some of the key vendors who provides text analytics software
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The Art and Science of Data Visualization Part 1 – Introduction to Data Visualization | BEyond

In this part 1 of Data Visualization series video Dr. Dakshinamurthy Kolluru explains why data visualization is important for engineers.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Sigma.js 1.0, the next-gen graph drawing lib for the Web, is out! - Linkurious. See Graph Databases Easily

Sigma.js 1.0, the next-gen graph drawing lib for the Web, is out! - Linkurious. See Graph Databases Easily | EEDSP | Scoop.it
At Linkurious our mission is to deliver enterprise-level applications for graph visualization and exploration. We naturally integrate world-class open source technologies in our products, like Neo4j, ElasticSearch and Node.js on the backend. On our Web frontend, graphs are rendered using our fork of Sigma.js, the most efficient graph visualization library on the market. Today the …
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Neo4j Blog: Neo4j 2.0 GA - Graphs for Everyone

Neo4j Blog: Neo4j 2.0 GA - Graphs for Everyone | EEDSP | Scoop.it

A dozen years ago, we created a graph database because we needed it. We focused on performance, reliability and scalability, cementing a foundation for graph databases with the 0.x series, then expanding the features with the 1.x series. Today, we announce the first of the 2.x series of Neo4j and a commitment to take graph databases further to the mainstream.

more...
No comment yet.