EEDSP
18.3K views | +9 today
Follow
EEDSP
Digital Signal Processing, Data Analytics, Big Data, HPC, Deep Learning, GPGPU, Distributed and Parallel Computing
Curated by Shiwon Cho
Your new post is loading...
Your new post is loading...
Scooped by Shiwon Cho
Scoop.it!

NVIDIA Deep Learning Software Platform Updated with DIGITS, cuDNN, GIE | NVIDIA Blog

NVIDIA Deep Learning Software Platform Updated with DIGITS, cuDNN, GIE | NVIDIA Blog | EEDSP | Scoop.it
New updates to the NVIDIA Deep Learning SDK — DIGITS 4, cuDNN and GIE — help data scientists and developers make the most of the vast opportunities in deep learning.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Introduction to Neural Machine Translation with GPUs (part 3)

Introduction to Neural Machine Translation with GPUs (part 3) | EEDSP | Scoop.it
Note: This is the final part of a detailed three-part series on machine translation with neural networks by Kyunghyun Cho. You may enjoy part 1 and part 2. In the previous post in this series, I in...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Welcome to Mega-KV Homepage!

Welcome to Mega-KV Homepage! | EEDSP | Scoop.it

In-memory key-value stores play a critical role in data processing to provide high throughput and low latency data accesses. In-memory key-value stores have several unique properties that include (1) data intensive operations demanding high memory bandwidth for fast data accesses, (2) high data parallelism and simple computing operations demanding many slim parallel computing units, and (3) a large working set. As data volume continues to increase, our experiments show that conventional and general-purpose multicore systems are increasingly mismatched to the special properties of key-value stores because they do not provide massive data parallelism and high memory bandwidth; the powerful but the limited number of computing cores do not satisfy the demand of the unique data processing task; and the cache hierarchy may not well benefit to the large working set. In this paper, we make a strong case for GPUs to serve as special-purpose devices to greatly accelerate the operations of in-memory key-value stores. Specifically, we present the design and implementation of Mega-KV, a GPU-based in-memory key-value store system that achieves high performance and high throughput. Effectively utilizing the high memory bandwidth and latency hiding capability of GPUs, Mega-KV provides fast data accesses and significantly boosts overall performance. Running on a commodity PC installed with two CPUs and two GPUs, Mega-KV can process up to 160+ million key-value operations per second, which is 1.4-2.8 times as fast as the state-of-the-art key-value store system on a conventional CPU-based platform.

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The Power of C++11 in CUDA 7

The Power of C++11 in CUDA 7 | EEDSP | Scoop.it
Today I'm excited to announce the official release of CUDA 7, the latest release of the popular CUDA Toolkit. Download the CUDA Toolkit version 7 now from CUDA Zone! CUDA 7 has a huge number of imp...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

DIGITS: Deep Learning GPU Training System

The hottest area in machine learning today is Deep Learning, which uses Deep Neural Networks (DNNs) to teach computers to detect recognizable concepts in data. Researchers and industry practitioner...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

10 Ways CUDA 6.5 Improves Performance and Productivity

10 Ways CUDA 6.5 Improves Performance and Productivity | EEDSP | Scoop.it
Today we're excited to announce the release of the CUDA Toolkit version 6.5. CUDA 6.5 adds a number of features and improvements to the CUDA platform, including support for CUDA Fortran in develope...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Accelerate R Applications with CUDA

Accelerate R Applications with CUDA | EEDSP | Scoop.it

In this article, I will introduce the computation model of R with GPU acceleration, focusing on three topics:

accelerating R computations using CUDA libraries;calling your own parallel algorithms written in CUDA C/C++ or CUDA Fortran from R; andprofiling GPU-accelerated R applications using the CUDA Profiler.

 

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

NVIDIA GPUs Deliver a Shot in the ARM for HPC Industry

NVIDIA GPUs Deliver a Shot in the ARM for HPC Industry | EEDSP | Scoop.it
Among the most interesting announcements at this week’s ISC’14 is the emergence of a new class of system – one that marries the many advantages of ARM processors with the massively parallel processing power of NVIDIA Tesla GPU accelerators. This is great news for the industry. Initially designed for micro-servers and web servers, ARM64 server processors… Read More
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

CUDA Spotlight: GPU-Accelerated Deep Learning

CUDA Spotlight: GPU-Accelerated Deep Learning | EEDSP | Scoop.it

Our Spotlight is on Dr. Ren Wu, a distinguished scientist at Baidu's Institute of Deep Learning (IDL). He is known for his pioneering research in using GPUs to accelerate big data analytics and his...

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

The CUDA Thrust API Now Supports Streams and Concurrent Tasks

The CUDA Thrust API Now Supports Streams and Concurrent Tasks | EEDSP | Scoop.it
The CUDA Thrust API now supports streams and concurrent kernels through the use of a new API called Bulk created by Jared Hoberock at NVIDIA. The design of Bulk is intended to extend the parallel e...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

CUDA Spotlight: GPU-Accelerated Speech Recognition

CUDA Spotlight: GPU-Accelerated Speech Recognition | EEDSP | Scoop.it
This week's Spotlight is on Dr. Ian Lane of Carnegie Mellon University. Ian is an Assistant Research Professor and leads a speech and language processing research group based in Silicon Valley. He ...
more...
Topiary eDiscovery LLC's curator insight, March 12, 2014 10:30 AM

Carnegie Mellon... the Incubator.

Scooped by Shiwon Cho
Scoop.it!

How New Features in CUDA 6 Make GPU Acceleration Easier - insideHPC

In this video from the Nvidia booth at SC13, Mark Harris from Nvidia presents: New Features in CUDA 6 Make GPU Acceleration Easier.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

gpucc: an open-source GPGPU compiler

gpucc: an open-source GPGPU compiler | EEDSP | Scoop.it
gpucc: an open-source GPGPU compiler | Jingyue Wu, Artem Belevich, Eli Bendersky, Mark Heffernan, Chris Leary, Jacques Pienaar, Bjarke Roune, Rob Springer, Xuetian Weng, Robert Hundt | Code generation, Compilers, Computer science, CUDA, LLVM, nVidia, Presentation, Tesla K40
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Introduction to Neural Machine Translation with GPUs (Part 2)

Introduction to Neural Machine Translation with GPUs (Part 2) | EEDSP | Scoop.it
Note: This is part two of a detailed three-part series on machine translation with neural networks by Kyunghyun Cho. You may enjoy part 1 and part 3. In my previous post, I introduced statistical m...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

cuDNN v2: Higher Performance for Deep Learning on GPUs

cuDNN v2: Higher Performance for Deep Learning on GPUs | EEDSP | Scoop.it

The cuDNN library team is excited to announce the second version of cuDNN, NVIDIA’s library of GPU-accelerated primitives for deep neural networks (DNNs). We are proud that the cuDNN library has se...

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

NVIDIA’s Next-Gen Pascal GPU Architecture to Provide 10X Speedup for Deep Learning Apps | The Official NVIDIA Blog

NVIDIA’s Next-Gen Pascal GPU Architecture to Provide 10X Speedup for Deep Learning Apps | The Official NVIDIA Blog | EEDSP | Scoop.it
NVIDIA’s Pascal GPU architecture, set to debut next year, will accelerate deep learning applications 10X beyond the speed of its current-generation Maxwell processors. NVIDIA CEO and co-founder Jen-Hsun Huang revealed details of Pascal and the company’s updated processor roadmap in front of a crowd of 4,000 during his keynote address at the GPU Technology Conference,… Read More
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Deep Learning on the NVIDIA Developer Zone

Developer resources for deep learning, deep neural networks, and more. Find research, presentations, articles, educational links, events, and more.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Unified Memory: Now for CUDA Fortran Programmers

Unified Memory: Now for CUDA Fortran Programmers | EEDSP | Scoop.it
Unified Memory is a CUDA feature that we've talked a lot about on Parallel Forall. CUDA 6 introduced Unified Memory, which dramatically simplifies GPU programming by giving programmers a single poi...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

CUDA Pro Tip: Profiling MPI Applications

CUDA Pro Tip: Profiling MPI Applications | EEDSP | Scoop.it

When I profile a MPI+CUDA application sometimes performance issues only occur for certain MPI ranks. To fix these, it's necessary to identify the MPI rank where the performance issue occurs. Up to ...

more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

Deep learning with COTS HPC systems

Deep learning with COTS HPC systems | EEDSP | Scoop.it
Scaling up deep learning algorithms has been shown to lead to increased performance in benchmark tasks and to enable discovery of complex high-level features. Recent efforts to train extremely larg...
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

CUDA Toolkit | CUDA 6 Production Release

The NVIDIA® CUDA® Toolkit provides a comprehensive development environment for C and C++ developers building GPU-accelerated applications. The CUDA Toolkit includes a compiler for NVIDIA GPUs, math libraries, and tools for debugging and optimizing the performance of your applications.
more...
No comment yet.
Scooped by Shiwon Cho
Scoop.it!

CUDA 6, Available as Free Download, Makes Parallel Programming Easier, Faster

CUDA 6, Available as Free Download, Makes Parallel Programming Easier, Faster | EEDSP | Scoop.it

Available now to all developers on the CUDA website, the CUDA 6 Release Candidate is packed with several new features that are sure to please developers

more...
No comment yet.