opencl, opengl, w...
Follow
Find
17.3K views | +3 today
 
Scooped by Mikael Bourges-Sevenier
onto opencl, opengl, webcl, webgl
Scoop.it!

Data Structures and Algorithms for Counting Problems on Graphs using GPU | hgpu.org

Data Structures and Algorithms for Counting Problems on Graphs using GPU | Algorithms, Computer science, CUDA, Data Structures and Algorithms, nVidia, Tesla C1060, Tesla C2050, Tesla C2070
more...
No comment yet.
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Convolutional Nets With fbfft: A GPU Performance Evaluation

Fast Convolutional Nets With fbfft: A GPU Performance Evaluation | opencl, opengl, webcl, webgl | Scoop.it
We examine the performance profile of Convolutional Neural Network training on the current generation of NVIDIA Graphics Processing Units. We introduce two new Fast Fourier Transform convolution im...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs | opencl, opengl, webcl, webgl | Scoop.it
Deep Convolutional Neural Networks (DCNNs) have recently shown state of the art performance in high level vision tasks, such as image classification and object detection. This work brings together ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Automatic Tuning of Local Memory Use on GPGPUs

Automatic Tuning of Local Memory Use on GPGPUs | opencl, opengl, webcl, webgl | Scoop.it
The use of local memory is important to improve the performance of OpenCL programs. However, its use may not always benefit performance, depending on various application characteristics, and there ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Purine: A bi-graph based deep learning framework

Purine: A bi-graph based deep learning framework | opencl, opengl, webcl, webgl | Scoop.it
In this paper, we introduce a novel deep learning framework, termed Purine. In Purine, a deep network is expressed as a bipartite graph (bi-graph), which is composed of interconnected operators and...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU Pro 5: Advanced Rendering Techniques

GPU Pro 5: Advanced Rendering Techniques | opencl, opengl, webcl, webgl | Scoop.it
In GPU Pro5: Advanced Rendering Techniques, section editors Wolfgang Engel, Christopher Oat, Carsten Dachsbacher, Michal Valient, Wessam Bahnassi, and Marius Bjorge have once again assembled a high...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Parallel Recursive Approach for Solving All Pairs Shortest Path Problem on GPU using OpenCL

A Parallel Recursive Approach for Solving All Pairs Shortest Path Problem on GPU using OpenCL | opencl, opengl, webcl, webgl | Scoop.it
All-pairs shortest path problem(APSP) finds a large number of practical applications in real world. We owe to present a highly parallel and recursive solution for solving APSP problem based on Klee...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Efficient Workload Balancing on Heterogeneous GPUs using Mixed-Integer Non-Linear Programming

Efficient Workload Balancing on Heterogeneous GPUs using Mixed-Integer Non-Linear Programming | opencl, opengl, webcl, webgl | Scoop.it
Recently, heterogeneous system architectures are becoming mainstream for achieving high performance and power efficiency. In particular, many-core graphics processing units (GPUs) now play an impor...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Efficient GPU Implementation for Single Block Orthogonal Dictionary Learning

Efficient GPU Implementation for Single Block Orthogonal Dictionary Learning | opencl, opengl, webcl, webgl | Scoop.it
Dictionary training for sparse representations involves dealing with large chunks of data and complex algorithms that determine time consuming implementations. SBO is an iterative dictionary learni...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Performance Comparison of GPUs with a Genetic Algorithm based on CUDA

Performance Comparison of GPUs with a Genetic Algorithm based on CUDA | opencl, opengl, webcl, webgl | Scoop.it
Generally genetic algorithm (GA) has disadvantage of taking a lot of computation time, and it is worth reducing the execution time while keeping good quality and result. Comparative experiments are...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

MatConvNet - Convolutional Neural Networks for MATLAB

MatConvNet - Convolutional Neural Networks for MATLAB | opencl, opengl, webcl, webgl | Scoop.it
MatConvNet is an implementation of Convolutional Neural Networks (CNNs) for MATLAB. The toolbox is designed with an emphasis on simplicity and flexibility. It exposes the building blocks of CNNs as...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Multi-Centroid PSO Classification Learning on the GPU

Multi-Centroid PSO Classification Learning on the GPU | opencl, opengl, webcl, webgl | Scoop.it
Training classifiers can be seen as an optimization problem. With this view, we have developed a method to train a type of nearest centroid classifier with PSO. Results showed an improvement on mos...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Computationally Efficient Implementation of a Hamming Code Decoder using a Graphics Processing Unit

Computationally Efficient Implementation of a Hamming Code Decoder using a Graphics Processing Unit | opencl, opengl, webcl, webgl | Scoop.it
This paper presents a computationally efficient implementation of a Hamming code decoder on a graphics processing unit (GPU) to support real-time software-defined radio (SDR), which is a software a...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Accelerating Correlation Power Analysis Using Graphics Processing Units

Accelerating Correlation Power Analysis Using Graphics Processing Units | opencl, opengl, webcl, webgl | Scoop.it
Correlation Power Analysis (CPA) is a type of power analysis based side channel attack that can be used to derive the secret key of encryption algorithms including DES (Data Encryption Standard) an...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

IPMACC - An Open Source OpenACC to CUDA/OpenCL Translator - TechEnablement

IPMACC - An Open Source OpenACC to CUDA/OpenCL Translator - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
IPMACC is a research-grade open-source framework for translating OpenACC source code to CUDA or OpenCL. Binary executables can then be created with OpenCL or CUDA compilers.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Legion: Programming Distributed Heterogeneous Architectures with Logical Regions

Legion: Programming Distributed Heterogeneous Architectures with Logical Regions | opencl, opengl, webcl, webgl | Scoop.it
This thesis covers the design and implementation of Legion, a new programming model and runtime system for targeting distributed heterogeneous machine architectures. Legion introduces logical regio...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

SignalPU: A programming model for DSP applications on parallel and heterogeneous clusters

SignalPU: A programming model for DSP applications on parallel and heterogeneous clusters | opencl, opengl, webcl, webgl | Scoop.it
The biomedical imagery, the numeric communications, the acoustic signal processing and many others digital signal processing applications (DSP) are present more and more everyday in the numeric wor...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Review on Parallelization of Node based Game Tree Search Algorithms on GPU

A Review on Parallelization of Node based Game Tree Search Algorithms on GPU | opencl, opengl, webcl, webgl | Scoop.it
Game tree search is a classical problem in the field of game theory and artificial intelligence. Focus of the system is on how to leverage massive parallelism capabilities of GPUs to accelerate the...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

DeepSpeech: Scaling up end-to-end speech recognition

DeepSpeech: Scaling up end-to-end speech recognition | opencl, opengl, webcl, webgl | Scoop.it
We present a state-of-the-art speech recognition system developed using end-to-end deep learning. Our architecture is significantly simpler than traditional speech systems, which rely on laboriousl...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Multicore Scheduling of Parallel Real-Time Tasks with Multiple Parallelization Options

Multicore Scheduling of Parallel Real-Time Tasks with Multiple Parallelization Options | opencl, opengl, webcl, webgl | Scoop.it
Past researches on multicore scheduling assume that a computational unit has already been parallelized into a prefixed number of threads. However, with recent technologies such as OpenCL, a computa...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Intel Xeon Phi Used in Australia's DUG Oil and Gas Supercomputer - TechEnablement

Intel Xeon Phi Used in Australia's DUG Oil and Gas Supercomputer - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
Equivalent to the 8th largest system in the world, Australia's DownUnder GeoSolutions has purchased a Xeon Phi equipped supercomputer from SGI for Oil and Gas.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Easy-to-Use On-the-Fly Binary Program Acceleration on Many-Cores

Easy-to-Use On-the-Fly Binary Program Acceleration on Many-Cores | opencl, opengl, webcl, webgl | Scoop.it
This paper introduces Binary Acceleration At Runtime (BAAR), an easy-to-use on-the-fly binary acceleration mechanism which aims to tackle the problem of enabling existent software to automatically ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Our PowerVR Graphics SDK is now on GitHub - Imagination Blog

Our PowerVR Graphics SDK is now on GitHub - Imagination Blog | opencl, opengl, webcl, webgl | Scoop.it
We have started using GitHub as an alternative distribution channel for our PowerVR Graphics SDK.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

An Optimized GPU Memory Hierarchy Design for an OpenCL Kernel

An Optimized GPU Memory Hierarchy Design for an OpenCL Kernel | opencl, opengl, webcl, webgl | Scoop.it
With the advent of multi and many-core processors, communication has replaced computation as the performance bottleneck. Most current approaches to the problem try to tolerate memory access latency...
more...
No comment yet.