opencl, opengl, webcl, webgl
24.8K views | +0 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Compiler-Level Explicit Cache for a GPGPU Programming Framework

Compiler-Level Explicit Cache for a GPGPU Programming Framework | opencl, opengl, webcl, webgl | Scoop.it
GPU is widely used for high-performance computing. However, standard programming framework such as CUDA and OpenCL requires low-level specifications, thus programming is difficult and the performan...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Real-Time Grasp Detection Using Convolutional Neural Networks

Real-Time Grasp Detection Using Convolutional Neural Networks | opencl, opengl, webcl, webgl | Scoop.it
We present an accurate, real-time approach to robotic grasp detection based on convolutional neural networks. Our network performs single-stage regression to graspable bounding boxes without using ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Portable OpenCL Out-of-Order Execution Framework for Heterogeneous Platforms

Portable OpenCL Out-of-Order Execution Framework for Heterogeneous Platforms | opencl, opengl, webcl, webgl | Scoop.it
Heterogeneous computing has become a viable option in seeking computing performance, to the side of conventional homogeneous multi-/single-processor approaches. The advantage of heterogeneity is th...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Theano-based Large-Scale Visual Recognition with Multiple GPUs

Theano-based Large-Scale Visual Recognition with Multiple GPUs | opencl, opengl, webcl, webgl | Scoop.it
In this report, we describe a Theano-based AlexNet (Krizhevsky et al., 2012) implementation and its naive data parallelism on multiple GPUs. Our performance on 2 GPUs is comparable with the state-o...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Neural Networks through Shared Maps in Mobile Devices

Neural Networks through Shared Maps in Mobile Devices | opencl, opengl, webcl, webgl | Scoop.it
We introduce a hybrid system composed of a convolutional neural network and a discrete graphical model for image recognition. This system improves upon traditional sliding window techniques for ana...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Massively Parallel A* Search on a GPU

Massively Parallel A* Search on a GPU | opencl, opengl, webcl, webgl | Scoop.it
A* search is a fundamental topic in artificial intelligence. Recently, the general purpose computation on graphics processing units (GPGPU) has been widely used to accelerate numerous computational...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

IPMACC: Open Source OpenACC to CUDA/OpenCL Translator

IPMACC: Open Source OpenACC to CUDA/OpenCL Translator | opencl, opengl, webcl, webgl | Scoop.it
In this paper we introduce IPMACC, a framework for translating OpenACC applications to CUDA or OpenCL. IPMACC is composed of set of translators translating OpenACC for C applications to CUDA or Ope...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

OpenCL™ Device Fission for CPU Performance - CodeProject

OpenCL™ Device Fission for CPU Performance - CodeProject | opencl, opengl, webcl, webgl | Scoop.it
The Intel SDK for OpenCL Applications provides a rich mix of OpenCL extensions and optional features that are designed for developers who want to utilize all resources available on Intel CPUs.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU accelerated feature algorithms for mobile devices

GPU accelerated feature algorithms for mobile devices | opencl, opengl, webcl, webgl | Scoop.it
Mobile devices offer many new avenues for computer vision and in particular mobile augmented reality applications that have not been feasible with desktop computers. The motivation for this researc...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

An Approach for Maximizing Performance on Heterogeneous Clusters of CPU and GPU

An Approach for Maximizing Performance on Heterogeneous Clusters of CPU and GPU | opencl, opengl, webcl, webgl | Scoop.it
Over the past years there has been significant enthusiasm for development of parallel computing on Graphics Processing Units (GPU) which have now become powerful and affordable hardware equipping d...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Real-Time Hair Rendering

Real-Time Hair Rendering | opencl, opengl, webcl, webgl | Scoop.it
An approach is represented to render hair in real-time by using a small number of guide strands to generate interpolated hairs on the graphics processing unit (GPU). Hair interpolation methods are ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Implementation of k-Means Clustering Algorithm in CUDA

Implementation of k-Means Clustering Algorithm in CUDA | opencl, opengl, webcl, webgl | Scoop.it
Big Data poses a very great computational challenge for programmers as well as machines as a lot of number crunching is to be done.Due to recent development in the shared memory inexpensive archite...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

cuLGT: Lattice Gauge Fixing on GPUs

cuLGT: Lattice Gauge Fixing on GPUs | opencl, opengl, webcl, webgl | Scoop.it
We adopt CUDA-capable Graphic Processing Units (GPUs) for Landau, Coulomb and maximally Abelian gauge fixing in 3+1 dimensional SU(3) and SU(2) lattice gauge field theories. A combination of simula...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

ArrayFire: A Portable Open-Source Accelerated Computing Library

ArrayFire: A Portable Open-Source Accelerated Computing Library | opencl, opengl, webcl, webgl | Scoop.it
The ArrayFire library is a high-performance software library with a focus on portability and productivity. It supports highly tuned, GPU-accelerated algorithms using an easy-to-use API. ArrayFire w...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

XIII International Conference on Parallel Processing, ICPP 2015

XIII International Conference on Parallel Processing, ICPP 2015 | opencl, opengl, webcl, webgl | Scoop.it
The ICPP 2015 : XIII International Conference on Parallel Processing is the premier interdisciplinary forum for the presentation of new advances and research results in the fields of Parallel Proce...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Big Integer Multiplication with CUDA FFT (cuFFT) Library

Big Integer Multiplication with CUDA FFT (cuFFT) Library | opencl, opengl, webcl, webgl | Scoop.it
It is well recognized in the computer algebra theory and systems communities that the Fast Fourier Transform (FFT) can be used for multiplying polynomials. Theory predicts that it is fast for "...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

CL3VER joins Amazon Web Services Partner Network

CL3VER joins Amazon Web Services Partner Network | opencl, opengl, webcl, webgl | Scoop.it
CL3VER ioining the APN is a demonstration that interactive 3D Visualization for professionals Architects and Manufacturers can be provided via the cloud
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

CUBPT: Lock-free bulk insertions to B+ tree on GPU architecture

CUBPT: Lock-free bulk insertions to B+ tree on GPU architecture | opencl, opengl, webcl, webgl | Scoop.it
B+-tree is one of the most widely-used index structures. To improve insertion process, several batch algorithms are proposed, which all use one thread to complete one node insertion and cannot make...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Scalability and Optimization Strategies for GPU Enhanced Neural Networks (GeNN)

Scalability and Optimization Strategies for GPU Enhanced Neural Networks (GeNN) | opencl, opengl, webcl, webgl | Scoop.it
Simulation of spiking neural networks has been traditionally done on high-performance supercomputers or large-scale clusters. Utilizing the parallel nature of neural network computation algorithms,...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

SiftCU: An Accelerated Cuda Based Implementation of SIFT

SiftCU: An Accelerated Cuda Based Implementation of SIFT | opencl, opengl, webcl, webgl | Scoop.it
Scale Invariant Feature Transform (SIFT) is a popular image feature extraction algorithm. SIFT's features are invariant to many image related variables including scale and change in viewpoint. Desp...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs

24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs | opencl, opengl, webcl, webgl | Scoop.it
We have simulated, for the first time, the long term evolution of the Milky Way Galaxy using 51 billion particles on the Swiss Piz Daint supercomputer with our $N$-body gravitational tree-code Bons...
more...
No comment yet.