opencl, opengl, webcl, webgl
26.3K views | +1 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Overhauling SC atomics in C11 and OpenCL

Overhauling SC atomics in C11 and OpenCL | opencl, opengl, webcl, webgl | Scoop.it
Despite the conceptual simplicity of sequential consistency (SC), the semantics of SC atomic operations and fences in the C11 and OpenCL memory models is subtle, leading to convoluted prose descrip...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Introducing the NVIDIA OpenACC Toolkit

Introducing the NVIDIA OpenACC Toolkit | opencl, opengl, webcl, webgl | Scoop.it
Programmability is crucial to accelerated computing, and NVIDIA's CUDA Toolkit has been critical to the success of GPU computing. Over 3 million CUDA Toolkits have been downloaded since its first l...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Study of Data Partitioning on OpenCL-based FPGAs

A Study of Data Partitioning on OpenCL-based FPGAs | opencl, opengl, webcl, webgl | Scoop.it
A lot of research efforts have been devoted to accelerating relational database applications on FPGAs, due to their high energy efficiency and high throughput. Most of the existing studies are base...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Optimization, Specification and Verification of the Prefix Sum Program in an OpenCL Environment

Optimization, Specification and Verification of the Prefix Sum Program in an OpenCL Environment | opencl, opengl, webcl, webgl | Scoop.it
The Prefix Sum is an algorithm used as a building block for various other algorithms, for example radix sort, quicksort and lexically comparing strings. Implementing the Prefix Sum algorithm on the...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Getting Started with OpenACC

Getting Started with OpenACC | opencl, opengl, webcl, webgl | Scoop.it
This week NVIDIA has released the NVIDIA OpenACC Toolkit, a starting point for anyone interested in using OpenACC. OpenACC gives scientists and researchers a simple and powerful way to accelerate s...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Towards Good Practices for Very Deep Two-Stream ConvNets

Towards Good Practices for Very Deep Two-Stream ConvNets | opencl, opengl, webcl, webgl | Scoop.it
Deep convolutional networks have achieved great success for object recognition in still images. However, for action recognition in videos, the improvement of deep convolutional networks is not so e...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Characterizing and Optimizing Irregular Applications on Graphics Processing Units

Characterizing and Optimizing Irregular Applications on Graphics Processing Units | opencl, opengl, webcl, webgl | Scoop.it
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput computing devices. Applications from various domains achieve significant speedups using GPGPUs. How...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Many-Core Compiler Fuzzing

Many-Core Compiler Fuzzing | opencl, opengl, webcl, webgl | Scoop.it
We address the compiler correctness problem for many-core systems through novel applications of fuzz testing to OpenCL compilers. Focusing on two methods from prior work, random differential testin...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Sorting and Permuting without Bank Conflicts on GPUs

Sorting and Permuting without Bank Conflicts on GPUs | opencl, opengl, webcl, webgl | Scoop.it
In this paper, we look at the complexity of designing algorithms without any bank conflicts in the shared memory of Graphical Processing Units (GPUs). Given input of size $n$, $w$ processors and $w...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Autotuning OpenACC Work Distribution via Direct Search

Autotuning OpenACC Work Distribution via Direct Search | opencl, opengl, webcl, webgl | Scoop.it
OpenACC provides a high-productivity API for programming GPUs and similar accelerator devices. One of the last steps in tuning OpenACC programs is selecting values for the num_gangs and vector leng...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

An Interesting Interview About The Vulkan API - Phoronix

An Interesting Interview About The Vulkan API - Phoronix | opencl, opengl, webcl, webgl | Scoop.it
Phoronix is the leading technology website for Linux hardware reviews, open-source news, Linux benchmarks, open-source benchmarks, and computer hardware tests.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

LTTng CLUST: A system-wide unified CPU and GPU tracing tool for OpenCL applications

LTTng CLUST: A system-wide unified CPU and GPU tracing tool for OpenCL applications | opencl, opengl, webcl, webgl | Scoop.it
As computation schemes evolve and many new tools become available to programmers to enhance the performance of their applications, many programmers started to look towards highly parallel platforms...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Mega-KV: A Case for GPUs to Maximize the Throughput of In-Memory Key-Value Stores

Mega-KV: A Case for GPUs to Maximize the Throughput of In-Memory Key-Value Stores | opencl, opengl, webcl, webgl | Scoop.it
In-memory key-value stores play a critical role in data processing to provide high throughput and low latency data accesses. In-memory key-value stores have several unique properties that include (...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

10 Fastest Machines Look Familiar on New TOP500 List

10 Fastest Machines Look Familiar on New TOP500 List | opencl, opengl, webcl, webgl | Scoop.it
The latest TOP500 list of the world's fastest supercomputers was released this morning at the ISC 2015 conference in Frankfurt, Germany.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

OpenGL ES 3.0 Cookbook: Parminder Singh: 9781849695527: Amazon.com: Books

OpenGL ES 3.0 Cookbook: Parminder Singh: 9781849695527: Amazon.com: Books | opencl, opengl, webcl, webgl | Scoop.it
OpenGL ES 3.0 Cookbook [Parminder Singh] on Amazon.com. *FREE* shipping on qualifying offers.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Evaluating the capabilities of the Xeon Phi platform in the context of software-only, thread-level speculation

Evaluating the capabilities of the Xeon Phi platform in the context of software-only, thread-level speculation | opencl, opengl, webcl, webgl | Scoop.it
Intel Xeon Phi accelerators are one of the newest devices used in the field of parallel computing. However, there are comparatively few studies concerning their performance when using most of the e...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

PLB-HeC: A Profile-based Load-Balancing algorithm for Heterogeneous CPU-GPU Clusters

PLB-HeC: A Profile-based Load-Balancing algorithm for Heterogeneous CPU-GPU Clusters | opencl, opengl, webcl, webgl | Scoop.it
The use of GPU clusters for scientific applications in areas such as physics, chemistry and bioinformatics is becoming more widespread. These clusters frequently have different types of processing ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

AMD Delivers World’s First Server GPU with Industry-Leading 32GB Memory for High Performance Compute

AMD today announced the new AMD FirePro™ S9170 server GPU, the world’s first and fastest 32GB single-GPU server card for DGEMM heavy double-precision workloads1, with support for OpenCL™ 2.0.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Multiple String Matching on a GPU using CUDAs

Multiple String Matching on a GPU using CUDAs | opencl, opengl, webcl, webgl | Scoop.it
Multiple pattern matching algorithms are used to locate the occurrences of patterns from a finite pattern set in a large input string. Aho-Corasick, Set Horspool, Set Backward Oracle Matching, Wu-M...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Contributions to Music Semantic Analysis and Its Acceleration Techniques

Contributions to Music Semantic Analysis and Its Acceleration Techniques | opencl, opengl, webcl, webgl | Scoop.it
Digitalized music production exploded in the past decade. Huge amount of data drives the development of effective and efficient methods for automatic music analysis and retrieval. This thesis focus...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Experiments on Parallel Training of Deep Neural Network using Model Averaging

Experiments on Parallel Training of Deep Neural Network using Model Averaging | opencl, opengl, webcl, webgl | Scoop.it
In this work we apply model averaging to parallel training of deep neural network (DNN). Parallelization is done in a model averaging manner. Data is partitioned and distributed to different nodes ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Learning Better Encoding for Approximate Nearest Neighbor Search with Dictionary Annealing

Learning Better Encoding for Approximate Nearest Neighbor Search with Dictionary Annealing | opencl, opengl, webcl, webgl | Scoop.it
We introduce a novel dictionary optimization method for high-dimensional vector quantization employed in approximate nearest neighbor (ANN) search. Vector quantization methods first seek a series o...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Nvidia Speeds Up Deep Learning Software

Nvidia Speeds Up Deep Learning Software | opencl, opengl, webcl, webgl | Scoop.it
Today Nvidia updated its GPU-accelerated deep learning software to accelerate deep learning training performance.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Hetero-DB: Next Generation High-Performance Database Systems by Best Utilizing Heterogeneous Computing and Storage Resources

Hetero-DB: Next Generation High-Performance Database Systems by Best Utilizing Heterogeneous Computing and Storage Resources | opencl, opengl, webcl, webgl | Scoop.it
With recent advancement on hardware technologies, new general-purpose high-performance devices have been widely adopted, such as the graphics processing unit (GPU) and solid state drive (SSD). GPU ...
more...
No comment yet.