opencl, opengl, w...
Follow
Find
16.2K views | +0 today
 
Scooped by Mikael Bourges-Sevenier
onto opencl, opengl, webcl, webgl
Scoop.it!

Interactive Refactoring for GPU Parallelization of Affine Loops | hgpu.org

Interactive Refactoring for GPU Parallelization of Affine Loops | Code generation, Computer science, CUDA, Heterogeneous systems, nVidia, nVidia Quadro FX 2000
more...
No comment yet.

From around the web

Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Improved Integral Histogram Algorithm for Big Sized Images in CUDA Environment

Improved Integral Histogram Algorithm for Big Sized Images in CUDA Environment | opencl, opengl, webcl, webgl | Scoop.it
Although integral histogram enables histogram computation of a sub-area within constant time, construction of the integral histogram requires O(nm) steps for n x m sized image. Such construction ti...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Gaussian Process Models with Parallelization and GPU acceleration

Gaussian Process Models with Parallelization and GPU acceleration | opencl, opengl, webcl, webgl | Scoop.it
In this work, we present an extension of Gaussian process (GP) models with sophisticated parallelization and GPU acceleration. The parallelization scheme arises naturally from the modular computati...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

cufftShift: High Performance CUDA-accelerated FFT-shift Library

cufftShift: High Performance CUDA-accelerated FFT-shift Library | opencl, opengl, webcl, webgl | Scoop.it
For embarrassingly parallel algorithms, a Graphics Processing Unit (GPU) outperforms a traditional CPU on price-per-flop and price-per-watt by at least one order of magnitude. This had led to the m...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Optimization Techniques for Mapping Algorithms and Applications onto CUDA GPU Platforms and CPU-GPU Heterogeneous Platforms

Optimization Techniques for Mapping Algorithms and Applications onto CUDA GPU Platforms and CPU-GPU Heterogeneous Platforms | opencl, opengl, webcl, webgl | Scoop.it
An emerging trend in processor architecture seems to indicate the doubling of the number of cores per chip every two years with same or decreased clock speed. Of particular interest to this thesis ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Metal Kernel Functions / Compute Shaders in Swift | Javalobby

Metal Kernel Functions / Compute Shaders in Swift | Javalobby | opencl, opengl, webcl, webgl | Scoop.it
As part of a project to create a GPU based reaction diffusion simulation, I stated to look at using Metal in Swift this weekend.I've...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Heterogeneous computing with an algorithmic skeleton framework

Heterogeneous computing with an algorithmic skeleton framework | opencl, opengl, webcl, webgl | Scoop.it
The Graphics Processing Unit (GPU) is present in almost every modern day personal computer. Despite its specific purpose design, they have been increasingly used for general computations with very ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Constructing Long Short-Term Memory based Deep Recurrent Neural Networks for Large Vocabulary Speech Recognition

Constructing Long Short-Term Memory based Deep Recurrent Neural Networks for Large Vocabulary Speech Recognition | opencl, opengl, webcl, webgl | Scoop.it
Long short-term memory (LSTM) based acoustic modeling methods have recently been shown to give state-of-the-art performance on some speech recognition tasks. To achieve a further performance improv...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Review of CUDA, MapReduce, and Pthreads Parallel Computing Models

A Review of CUDA, MapReduce, and Pthreads Parallel Computing Models | opencl, opengl, webcl, webgl | Scoop.it
The advent of high performance computing (HPC) and graphics processing units (GPU), present an enormous computation resource for Large data transactions (big data) that require parallel processing ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPUdb: A Distributed Database for Many-Core Devices - insideHPC

GPUdb: A Distributed Database for Many-Core Devices - insideHPC | opencl, opengl, webcl, webgl | Scoop.it
GPUdb is a scalable, distributed database with SQL-style query capability, capable of storing Big Data.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Parallel Programming and Compressed Material Data for an Eulerian Code

Parallel Programming and Compressed Material Data for an Eulerian Code | opencl, opengl, webcl, webgl | Scoop.it
We describe the problem of iterating over mesh zones and iterating over material data within a zone, in the context of relatively new compute architectures. We present an example for how this can b...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

The Distribution of OpenCL Kernel Execution Across Multiple Devices

The Distribution of OpenCL Kernel Execution Across Multiple Devices | opencl, opengl, webcl, webgl | Scoop.it
Many computer systems now include both CPUs and programmable GPUs. OpenCL, a new programming framework, can program individual CPUs or GPUs; however, distributing a problem across multiple devices ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Parallel Algorithms for the Summed Area Table on the Asynchronous Hierarchical Memory Machine, with GPU implementations

Parallel Algorithms for the Summed Area Table on the Asynchronous Hierarchical Memory Machine, with GPU implementations | opencl, opengl, webcl, webgl | Scoop.it
The Hierarchical Memory Machine (HMM) is a theoretical parallel computing model that captures the essence of computing on CUDA-enabled GPUs. The summed area table (SAT) of a matrix is a data struct...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Synthetic Aperture Radar imaging on a CUDA-enabled mobile platform

Synthetic Aperture Radar imaging on a CUDA-enabled mobile platform | opencl, opengl, webcl, webgl | Scoop.it
This paper presents the details of a Synthetic Aperture Radar (SAR) imaging on the smallest CUDA-capable platform available, the Jetson TK1. The results indicate that GPU accelerated embedded platf...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Monitoring Large-scale Microblog on GPUs

Monitoring Large-scale Microblog on GPUs | opencl, opengl, webcl, webgl | Scoop.it
To monitor bad information spreading in microblog system, large-scale data from microblog must be processed in real time. This needs high cost-effective parallel schemes. A parallel processing meth...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Query Optimization in Heterogeneous CPU/GPU Environment for Time Series Databases

Query Optimization in Heterogeneous CPU/GPU Environment for Time Series Databases | opencl, opengl, webcl, webgl | Scoop.it
In recent years, processing and exploration of time series has experienced a noticeable interest. Growing volumes of data and needs of efficient processing pushed the research in new directions, in...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Parallel Algorithm for Enumerating All Chordless Cycles in Graphs

Fast Parallel Algorithm for Enumerating All Chordless Cycles in Graphs | opencl, opengl, webcl, webgl | Scoop.it
Finding chordless cycles is an important theoretical problem in the Graph Theory area. It also can be applied to practical problems such as discover which predators compete for the same food in eco...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Introducing CURRENNT - the Munich open-source CUDA RecurREnt Neural Network Toolkit

Introducing CURRENNT - the Munich open-source CUDA RecurREnt Neural Network Toolkit | opencl, opengl, webcl, webgl | Scoop.it
In this article, we introduce CURRENNT, an open-source parallel implementation of deep recurrent neural networks (RNNs) supporting graphics processing units (GPUs) through NVIDIA's Computed Unified...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast-Fourier-Transform-Based Electrical Noise Measurements

Fast-Fourier-Transform-Based Electrical Noise Measurements | opencl, opengl, webcl, webgl | Scoop.it
We have shown how the Fourier spectrum and the power spectral density can be estimated in concrete measurements. Moreover, we have derived spectral leakage, which is a systematic error in spectrum ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Performance Comparison of Sort and Scan Libraries for GPUs

A Performance Comparison of Sort and Scan Libraries for GPUs | opencl, opengl, webcl, webgl | Scoop.it
Sorting and scanning are two fundamental primitives for constructing highly parallel algorithms. A number of libraries now provide implementations of these primitives for GPUs, but there is relativ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Hybrid CPU-GPU Implementation of Tracking-Learning-Detection Algorithm

Hybrid CPU-GPU Implementation of Tracking-Learning-Detection Algorithm | opencl, opengl, webcl, webgl | Scoop.it
Tracking objects in a video stream is an important problem in robot learning (learning an object's visual features from different perspectives as it moves, rotates, scales, and is subjected to some...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Corel(R) AfterShot(TM) Pro 2.1 Brings HDR to Mac and Linux, Adds Support for 17 Cameras

Corel(R) AfterShot(TM) Pro 2.1 Brings HDR to Mac and Linux, Adds Support for 17 Cameras | opencl, opengl, webcl, webgl | Scoop.it
OTTAWA, ONTARIO--(Marketwired - Oct. 16, 2014) - Editors Note: There are three photos associated with this press release.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units

Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units | opencl, opengl, webcl, webgl | Scoop.it
We revisit the implementation of iterative solvers on discrete graphics processing units and demonstrate the benefit of implementations using extensive kernel fusion for pipelined formulations over...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

OpenCL Implementation of Montgomery Multiplication on FPGA

OpenCL Implementation of Montgomery Multiplication on FPGA | opencl, opengl, webcl, webgl | Scoop.it
Galois Field arithmetic has been used very frequently in popular security and error-correction applications. Montgomery multiplication is among the suitable methods used for accelerating modular mu...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Preparing for Manycore

Preparing for Manycore | opencl, opengl, webcl, webgl | Scoop.it
One of several insightful presentations to come out of the DOE Computational Science Graduate Fellowship was delivered by Katie Antypas, Services Departmen
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Random Address Permute-Shift Technique for the Shared Memory on GPUs

Random Address Permute-Shift Technique for the Shared Memory on GPUs | opencl, opengl, webcl, webgl | Scoop.it
The Discrete Memory Machine (DMM) is a theoretical parallel computing model that captures the essence of memory access to the shared memory of a streaming multiprocessor on CUDA-enabled GPUs. The D...
more...
No comment yet.