EEDSP
Follow
Find
10.3K views | +4 today
 
EEDSP
Digital Signal Processing, Data Analytics, Big Data, HPC, Deep Learning, GPGPU
Curated by Shiwon Cho
Your new post is loading...
Your new post is loading...
Scooped by Shiwon Cho
Scoop.it!

C++14 - Wikipedia, the free encyclopedia

C++14

C++14 is the informal name for the most recent revision of the C++ ISO/IEC standard, formally " International Standard ISO/IEC 14882:2014(E) Programming Language C++". C++14 is intended to be a small extension over C++11, featuring mainly bug fixes and small improvements.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Why a deep-learning genius left Google & joined Chinese tech shop Baidu (interview)

Why a deep-learning genius left Google & joined Chinese tech shop Baidu (interview) | EEDSP | Scoop.it

Chinese tech company Baidu has yet to make its popular search engine and other web services available in English. But consider yourself warned: Baidu could someday wind up becoming a favorite among consumers.

 
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

How-to: Use IPython Notebook with Apache Spark

How-to: Use IPython Notebook with Apache Spark | EEDSP | Scoop.it

The developers of Apache Spark have given thoughtful consideration to Python as a language of choice for data analysis. They have developed the PySpark API for working with RDDs in Python, and further support using the powerful IPythonshell instead of the builtin Python REPL.

 
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Unified Memory: Now for CUDA Fortran Programmers

Unified Memory: Now for CUDA Fortran Programmers | EEDSP | Scoop.it
Unified Memory is a CUDA feature that we've talked a lot about on Parallel Forall. CUDA 6 introduced Unified Memory, which dramatically simplifies GPU programming by giving programmers a single poi...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

HPCC Systems from LexisNexis Celebrates Third Open-Source Anniversary, Releases 5.0 Version - insideBIGDATA

HPCC Systems from LexisNexis Celebrates Third Open-Source Anniversary, Releases 5.0 Version - insideBIGDATA | EEDSP | Scoop.it
HPCC Systems from LexisNexis Celebrates Third Open-Source Anniversary, Releases 5.0 Version
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Wyvern: a Language for Engineering Mobile and Web Applications

Wyvern: a Language for Engineering Mobile and Web Applications | EEDSP | Scoop.it

This site describes the rationale for the Wyvern programming language targeted at potential users of the language. It will grow to include a specification for Wyvern as well. The Wyvern programming language is a new language designed to smooth the use of internal DSLs in the form of type-specific languages.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

JavaScript and V8 TurboFan

JavaScript and V8 TurboFan | EEDSP | Scoop.it
Recently, Google engineers landed a new optimizing JavaScript compiler for V8, codenamed TurboFan. As the name implies, this is supposed to further improve JavaScript execution speed, likely to be ...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Collect & visualize your logs with Logstash, Elasticsearch & Redis

Collect & visualize your logs with Logstash, Elasticsearch & Redis | EEDSP | Scoop.it
Update of December 6th : although Logstash does the job as a log shipper, you might consider replacing it with Lumberjack / Logstash Forwarder,...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

IBM researchers make a chip full of artificial neurons

IBM researchers make a chip full of artificial neurons | EEDSP | Scoop.it

Coprocessors and a neural network supercomputer may follow. 

A team of scientists at Cornell University and IBM Research have gotten together to design a chip that's fundamentally different: an asynchronous collection of thousands of small processing cores, each capable of the erratic spikes of activity and complicated connections that are typical of neural behavior. When hosting a neural network, the chip is remarkably power efficient. And the researchers say their architecture can scale arbitrarily large, raising the prospect of a neural network supercomputer.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Google shows off Mesa, a super-fast data warehouse that runs across data centers

Google shows off Mesa, a super-fast data warehouse that runs across data centers | EEDSP | Scoop.it
Google has published a paper about its latest big data system, a globally distributed data warehouse called Mesa that can ingest millions of rows in minutes and even survive a data center failure.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Accelerate R Applications with CUDA

Accelerate R Applications with CUDA | EEDSP | Scoop.it

In this article, I will introduce the computation model of R with GPU acceleration, focusing on three topics:

accelerating R computations using CUDA libraries;calling your own parallel algorithms written in CUDA C/C++ or CUDA Fortran from R; andprofiling GPU-accelerated R applications using the CUDA Profiler.

 

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

A first step toward more global email

A first step toward more global email | EEDSP | Scoop.it

 In 2012, an organization called the Internet Engineering Task Force (IETF) created a new email standard that supports addresses with non-Latin and accented Latin characters (e.g. 武@メール.グーグル). In order for this standard to become a reality, every email provider and every website that asks you for your email address must adopt it. That’s obviously a tough hill to climb. The technology is there, but someone has to take the first step.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Tumblr: Hashing Your Way to Handling 23,000 Blog Requests per Second - High Scalability -

Tumblr: Hashing Your Way to Handling 23,000 Blog Requests per Second - High Scalability - | EEDSP | Scoop.it

At Tumblr, blogs (or Tumblelog) are one of our most highly trafficked faces on the internet.  One of the most convenient aspects of tumblelogs is their highly cacheable nature, which is fantastic because of the high views/post ratio the Tumblr network offers our users.  That said, it's not entirely trivial to scale out the perimeter proxy tier, let alone the caching tier, necessary for serving all of those requests.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

38 Seminal Articles Every Data Scientist Should Read

38 Seminal Articles Every Data Scientist Should Read | EEDSP | Scoop.it
Here is selection containing both external and internal papers, focusing on various technical aspects of data science and big data. Feel free to add your favor…
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The Easy Way of Building a Growing Startup Architecture Using HAProxy, PHP, Redis and MySQL to Handle 1 Billion Requests a Week - High Scalability -

The Easy Way of Building a Growing Startup Architecture Using HAProxy, PHP, Redis and MySQL to Handle 1 Billion Requests a Week - High Scalability - | EEDSP | Scoop.it

In the post I'll show you the way we developed quite simple architecture based on HAProxy, PHP, Redis and MySQL that seamlessly handles approx 1 billion requests every week. There’ll be also a note of the possible ways of further scaling it out and pointed uncommon patterns, that are specific for this project.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

AMPLab - UC Berkeley

AMPLab - UC Berkeley | EEDSP | Scoop.it

Algorithms, Machines and People Lab.

AMPLab Overview

We make sense of the world around us by turning data into information. For years, research in fields such as machine learning (ML), data mining, databases, information retrieval, natural language processing, and speech recognition have steadily improved their techniques for revealing the information lying within otherwise opaque datasets. But computer science is now on the verge of a new era in data analysis because of several recent developments, including: the rise of the warehouse-scale computer (WSC), the massive explosion in online data, the increasing diversity and time-sensitivity of queries, and the advent of crowdsourcing. Together these trends — often referred to collectively as Big Data — have the potential for ushering in a new era in data analysis, but to realize this opportunity requires us to confront several significant scientific challenges:

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Stream Processing with a Spreadsheet | Lambda the Ultimate

Continuous data streams are ubiquitous and represent such a high volume of data that they cannot be stored to disk, yet it is often crucial for them to be analyzed in real-time. Stream processing is a programming paradigm that processes these immediately, and enables continuous analytics. Our objective is to make it easier for analysts, with little programming experience, to develop continuous analytics applications directly. We propose enhancing a spreadsheet, a pervasive tool, to obtain a programming platform for stream processing. We present the design and implementation of an enhanced spreadsheet that enables visualizing live streams, live programming to compute new streams, and exporting computations to be run on a server where they can be shared with other users, and persisted beyond the life of the spreadsheet. We formalize our core language, and present case studies that cover a range of stream processing applications.

 
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Khronos Announces OpenCL SPIR 2.0

Khronos Announces OpenCL SPIR 2.0 | EEDSP | Scoop.it

Khronos released OpenCL SPIR 1.2 as a provisional specification, keeping it there over a protracted period to solicit feedback over the first version of the standard. Since that provisional release, Khronos finalized OpenCL 1.2 SPIR in early 2014 and has been working on building up their developer and user bases for SPIR.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Using Big Data To Understand Migrations

Using Big Data To Understand Migrations | EEDSP | Scoop.it
Monitoring migrations is not an easy task. While in today's economy, survey data about economic confidence or public opinion are collected on a daily basis, that is not the case for migration statistics, which come from Censuses, population registers, and, occasionally, ad-hoc surveys--and are often outdated and inconsistent across countries. [...]
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

35 Free eBooks On Control System

35 Free eBooks On Control System | EEDSP | Scoop.it
Here's bringing 35 absolutely free ebooks to help you with all you ever wanted to learn about various control systems. Have fun!
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Atomic Scala – Free E-Books | Typesafe

Atomic Scala – Free E-Books | Typesafe | EEDSP | Scoop.it
Get FREE eBooks. Learn best practices for building reactive applications.
Atomic Scala
(Sample Chapters)by Bruce Eckel and Dianne Marsh

This should be your first Scala book, not your last. We show you enough to become familiar and comfortable with the language – competent, but not expert. You’ll write useful Scala code, but you won’t necessarily be able to read all the Scala code you encounter.

 
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

App Indexing for Google Search — Google Developers

App Indexing for Google Search — Google Developers | EEDSP | Scoop.it

App Indexing for Google Search.

App Indexing helps you drive usage of your app through Google. Deep links to your app appear in Google Search results on Android so users can get to your native mobile experience quickly and easily.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing

Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google's Internet advertising business. Mesa is designed to satisfy a complex and challenging set of user and systems requirements, including near real-time data ingestion and queryability, as well as high availability, reliability, fault tolerance, and scalability for large data and query volumes. Specifically, Mesa handles petabytes of data, processes millions of row updates per second, and serves billions of queries that fetch trillions of rows per day. Mesa is geo-replicated across multiple datacenters and provides consistent and repeatable query answers at low latency, even when an entire datacenter fails. This paper presents the Mesa system and reports the performance and scale that it achieves.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Hadoop YARN Installation: The definitive guide

Hadoop YARN Installation: The definitive guide | EEDSP | Scoop.it
This article guides you in the installation of the new generation Hadoop based on YARN. It is based on the most recent version of Hadoop at the time of this writing (2.2.0) and includes HDFS, YARN and MapReduce configurations for both single-node and cluster environments.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The lab that created Spark wants to speed up everything, including cures for cancer

The lab that created Spark wants to speed up everything, including cures for cancer | EEDSP | Scoop.it

AMPLab, the University of California, Berkeley, research group responsible for making Spark a household name in big data, has a lot more tricks up its sleeve. They range from databases to machine learning, and even include tools that could help treat cancer.

more...
No comment yet.