opencl, opengl, w...
Follow
Find
16.3K views | +7 today
 
Scooped by Mikael Bourges-Sevenier
onto opencl, opengl, webcl, webgl
Scoop.it!

Kernelet: High-Throughput GPU Kernel Executions with Dynamic Slicing and Scheduling | hgpu.org

Kernelet: High-Throughput GPU Kernel Executions with Dynamic Slicing and Scheduling | Computer science, CUDA, nVidia, nVidia GeForce GTX 680, PTX, Task scheduling, Tesla C2050
more...
No comment yet.

From around the web

Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging

Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging | opencl, opengl, webcl, webgl | Scoop.it
We describe the neural-network training framework used in the Kaldi speech recognition toolkit, which is geared towards training DNNs with large amounts of training data using multiple GPU-equipped...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Efficient Particle-Mesh Spreading on GPUs

Efficient Particle-Mesh Spreading on GPUs | opencl, opengl, webcl, webgl | Scoop.it
The particle-mesh spreading operation maps a value at an arbitrary particle position to contributions at regular positions on a mesh. This operation is often used when a calculation involving irreg...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Implementing Level-3 BLAS Routines in OpenCL on Different Processing Units

Implementing Level-3 BLAS Routines in OpenCL on Different Processing Units | opencl, opengl, webcl, webgl | Scoop.it
This paper presents an implementation of different matrix-matrix multiplication routines in OpenCL. We utilize the high-performance GEMM (GEneral Matrix-Matrix Multiply) implementation from our pre...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Finding Longest Common Subsequences by GPU-Based Parallel Ant Colony Optimization

Finding Longest Common Subsequences by GPU-Based Parallel Ant Colony Optimization | opencl, opengl, webcl, webgl | Scoop.it
The longest common subsequence (LCS) problem is one of the classic problems in string processing. It is commonly used in file comparison, pattern recognition, and computational biology as a measure...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Computer Architecture and Structured Parallel Programming - insideHPC

Computer Architecture and Structured Parallel Programming - insideHPC | opencl, opengl, webcl, webgl | Scoop.it
In this video, James Reinders presents: Computer Architecture and Structured Parallel Programming.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Medical imaging using CUDA

Medical imaging using CUDA | opencl, opengl, webcl, webgl | Scoop.it
As multiple sclerosis is known to cause atrophy and deformation in the brain, it also influences the shape and size of the corpus callosum. Longitudinal studies try to quantify these changes using ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPGPU Acceleration for Skeletal Animation-comparing OpenCL with CUDA and GLSL

GPGPU Acceleration for Skeletal Animation-comparing OpenCL with CUDA and GLSL | opencl, opengl, webcl, webgl | Scoop.it
The existing matrix palette algorithms for skeletal animation are accelerated by the technique GPGPU based on GLSL or CUDA. Because GLSL is extended from graphics library OpenGL, it couples the ren...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Monitoring Large-scale Microblog on GPUs

Monitoring Large-scale Microblog on GPUs | opencl, opengl, webcl, webgl | Scoop.it
To monitor bad information spreading in microblog system, large-scale data from microblog must be processed in real time. This needs high cost-effective parallel schemes. A parallel processing meth...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Query Optimization in Heterogeneous CPU/GPU Environment for Time Series Databases

Query Optimization in Heterogeneous CPU/GPU Environment for Time Series Databases | opencl, opengl, webcl, webgl | Scoop.it
In recent years, processing and exploration of time series has experienced a noticeable interest. Growing volumes of data and needs of efficient processing pushed the research in new directions, in...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Parallel Algorithm for Enumerating All Chordless Cycles in Graphs

Fast Parallel Algorithm for Enumerating All Chordless Cycles in Graphs | opencl, opengl, webcl, webgl | Scoop.it
Finding chordless cycles is an important theoretical problem in the Graph Theory area. It also can be applied to practical problems such as discover which predators compete for the same food in eco...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Introducing CURRENNT - the Munich open-source CUDA RecurREnt Neural Network Toolkit

Introducing CURRENNT - the Munich open-source CUDA RecurREnt Neural Network Toolkit | opencl, opengl, webcl, webgl | Scoop.it
In this article, we introduce CURRENNT, an open-source parallel implementation of deep recurrent neural networks (RNNs) supporting graphics processing units (GPUs) through NVIDIA's Computed Unified...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast-Fourier-Transform-Based Electrical Noise Measurements

Fast-Fourier-Transform-Based Electrical Noise Measurements | opencl, opengl, webcl, webgl | Scoop.it
We have shown how the Fourier spectrum and the power spectral density can be estimated in concrete measurements. Moreover, we have derived spectral leakage, which is a systematic error in spectrum ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

TraceParts enables support for WebGL 3D rendering of millions of CAD models on TracePartsOnline.net

TraceParts enables support for WebGL 3D rendering of millions of CAD models on TracePartsOnline.net | opencl, opengl, webcl, webgl | Scoop.it
HTML5 WebGL technology allows detailed 3D part models to be rendered directly in the browser, plugin-free, across any devices Saint Romain, France – October 29, 2014 – TraceParts, a world-leading digital engineering 3D content company, has launched...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Performance Modeling, Optimization, and Characterization on Heterogeneous Architectures

Performance Modeling, Optimization, and Characterization on Heterogeneous Architectures | opencl, opengl, webcl, webgl | Scoop.it
Today, heterogeneous computing has truly reshaped the way scientists think and approach high-performance computing (HPC). Hardware accelerators such as general-purpose graphics processing units (GP...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Sparse Recovery on GPUs: Accelerating the Iterative Soft-Thresholding Algorithm

Sparse Recovery on GPUs: Accelerating the Iterative Soft-Thresholding Algorithm | opencl, opengl, webcl, webgl | Scoop.it
Solving linear inverse problems where the solution is known to be sparse is of interest to both signal processing and machine learning research. The standard algorithms for solving such problems ar...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Contract-Based General-Purpose GPU Programming

Contract-Based General-Purpose GPU Programming | opencl, opengl, webcl, webgl | Scoop.it
Using GPUs as general-purpose processors has revolutionized parallel computing by offering, for a large and growing set of algorithms, massive data-parallelization on desktop machines. As an obstac...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Testing and Exposing Weak Graphics Processing Unit Memory Models

Testing and Exposing Weak Graphics Processing Unit Memory Models | opencl, opengl, webcl, webgl | Scoop.it
Graphics Processing Units (GPUs) are highly parallel shared memory microprocessors, and as such, they are prone to the same concurrency considerations as their traditional multicore CPU counterpart...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

CUVLE: Variable-Length Encoding on CUDA

CUVLE: Variable-Length Encoding on CUDA | opencl, opengl, webcl, webgl | Scoop.it
Data compression is the process of representing information in a compact form, in order to reduce the storage requirements and, hence, communication bandwidth. It has been one of the critical enabl...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Evacuation Route Modeling and Planning with General Purpose GPU Computing

Evacuation Route Modeling and Planning with General Purpose GPU Computing | opencl, opengl, webcl, webgl | Scoop.it
This work introduces a bilevel, stochastic optimization problem aimed at robust, regional evacuation network design and shelter location under uncertain hazards. A regional planner, acting as a Sta...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Improved Integral Histogram Algorithm for Big Sized Images in CUDA Environment

Improved Integral Histogram Algorithm for Big Sized Images in CUDA Environment | opencl, opengl, webcl, webgl | Scoop.it
Although integral histogram enables histogram computation of a sub-area within constant time, construction of the integral histogram requires O(nm) steps for n x m sized image. Such construction ti...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Gaussian Process Models with Parallelization and GPU acceleration

Gaussian Process Models with Parallelization and GPU acceleration | opencl, opengl, webcl, webgl | Scoop.it
In this work, we present an extension of Gaussian process (GP) models with sophisticated parallelization and GPU acceleration. The parallelization scheme arises naturally from the modular computati...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

cufftShift: High Performance CUDA-accelerated FFT-shift Library

cufftShift: High Performance CUDA-accelerated FFT-shift Library | opencl, opengl, webcl, webgl | Scoop.it
For embarrassingly parallel algorithms, a Graphics Processing Unit (GPU) outperforms a traditional CPU on price-per-flop and price-per-watt by at least one order of magnitude. This had led to the m...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Optimization Techniques for Mapping Algorithms and Applications onto CUDA GPU Platforms and CPU-GPU Heterogeneous Platforms

Optimization Techniques for Mapping Algorithms and Applications onto CUDA GPU Platforms and CPU-GPU Heterogeneous Platforms | opencl, opengl, webcl, webgl | Scoop.it
An emerging trend in processor architecture seems to indicate the doubling of the number of cores per chip every two years with same or decreased clock speed. Of particular interest to this thesis ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Metal Kernel Functions / Compute Shaders in Swift | Javalobby

Metal Kernel Functions / Compute Shaders in Swift | Javalobby | opencl, opengl, webcl, webgl | Scoop.it
As part of a project to create a GPU based reaction diffusion simulation, I stated to look at using Metal in Swift this weekend.I've...
more...
No comment yet.