opencl, opengl, webcl, webgl
26.3K views | +4 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Performance comparison of Lattice Boltzmann fluid flow simulation using OpenCL and CUDA frameworks

Performance comparison of Lattice Boltzmann fluid flow simulation using OpenCL and CUDA frameworks | opencl, opengl, webcl, webgl | Scoop.it
This paper presents performance comparison, of the lid-driven cavity flow simulation, with Lattice Boltzmann method, example, between CUDA and OpenCL parallel programming frameworks. CUDA is parall...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Real-Time Incompressible Fluid Simulation on the GPU

Real-Time Incompressible Fluid Simulation on the GPU | opencl, opengl, webcl, webgl | Scoop.it
We present a parallel framework for simulating incompressible fluids with predictive-corrective incompressible Smoothed Particle Hydrodynamics (PCISPH) on the GPU in real time. To this end, we prop...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

ORNL Introductory Tutorials On Concurrent Kernels - TechEnablement

ORNL Introductory Tutorials On Concurrent Kernels - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
Oakridge has published two very introductory tutorials to teach how to utilize concurrent kernels on their systems. See TechEnablement for more tutorials.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

To Use or Not to Use: Graphics Processing Units for Pattern Matching Algorithms

To Use or Not to Use: Graphics Processing Units for Pattern Matching Algorithms | opencl, opengl, webcl, webgl | Scoop.it
String matching is an important part in today's computer applications and Aho-Corasick algorithm is one of the main string matching algorithms used to accomplish this. This paper discusses that whe...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Spectral classification using convolutional neural networks

Spectral classification using convolutional neural networks | opencl, opengl, webcl, webgl | Scoop.it
There is a great need for accurate and autonomous spectral classification methods in astrophysics. This thesis is about training a convolutional neural network (ConvNet) to recognize an object clas...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Extending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications

Extending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications | opencl, opengl, webcl, webgl | Scoop.it
CUDA and OpenCL are the most widely used programming models to exploit hardware accelerators. Both programming models provide a C-based programming language to write accelerator kernels and a host ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Speed Python Numerical Applications by 2x - 120x With HOPE - TechEnablement

Speed Python Numerical Applications by 2x - 120x With HOPE - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
HOPE is an open source specialized method-at-a-time JIT compiler that translates Python source code into C++ and achieves 2x - 120x speedup.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Convolutional Nets With fbfft: A GPU Performance Evaluation

Fast Convolutional Nets With fbfft: A GPU Performance Evaluation | opencl, opengl, webcl, webgl | Scoop.it
We examine the performance profile of Convolutional Neural Network training on the current generation of NVIDIA Graphics Processing Units. We introduce two new Fast Fourier Transform convolution im...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs | opencl, opengl, webcl, webgl | Scoop.it
Deep Convolutional Neural Networks (DCNNs) have recently shown state of the art performance in high level vision tasks, such as image classification and object detection. This work brings together ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Automatic Tuning of Local Memory Use on GPGPUs

Automatic Tuning of Local Memory Use on GPGPUs | opencl, opengl, webcl, webgl | Scoop.it
The use of local memory is important to improve the performance of OpenCL programs. However, its use may not always benefit performance, depending on various application characteristics, and there ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Purine: A bi-graph based deep learning framework

Purine: A bi-graph based deep learning framework | opencl, opengl, webcl, webgl | Scoop.it
In this paper, we introduce a novel deep learning framework, termed Purine. In Purine, a deep network is expressed as a bipartite graph (bi-graph), which is composed of interconnected operators and...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU Pro 5: Advanced Rendering Techniques

GPU Pro 5: Advanced Rendering Techniques | opencl, opengl, webcl, webgl | Scoop.it
In GPU Pro5: Advanced Rendering Techniques, section editors Wolfgang Engel, Christopher Oat, Carsten Dachsbacher, Michal Valient, Wessam Bahnassi, and Marius Bjorge have once again assembled a high...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Parallel Recursive Approach for Solving All Pairs Shortest Path Problem on GPU using OpenCL

A Parallel Recursive Approach for Solving All Pairs Shortest Path Problem on GPU using OpenCL | opencl, opengl, webcl, webgl | Scoop.it
All-pairs shortest path problem(APSP) finds a large number of practical applications in real world. We owe to present a highly parallel and recursive solution for solving APSP problem based on Klee...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Customization of OpenCL Applications for Efficient Task Mapping under Heterogeneous Platform Constraints

Customization of OpenCL Applications for Efficient Task Mapping under Heterogeneous Platform Constraints | opencl, opengl, webcl, webgl | Scoop.it
When targeting an OpenCL application to platforms with multiple heterogeneous accelerators, task tuning and mapping have to cope with device-specific constraints. To address this problem, we presen...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Facebook Open Source GPU FFT 1.5x Faster Than NVIDIA CUFFT - TechEnablement

Facebook Open Source GPU FFT 1.5x Faster Than NVIDIA CUFFT - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
Facebook has written a Fast Fourier Transform (fbfft) that is 1.5x faster than the NVIDIA CUFFT implementation at sizes 8-64.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Video: A Short Introduction to High Performance Computing - insideHPC

Video: A Short Introduction to High Performance Computing - insideHPC | opencl, opengl, webcl, webgl | Scoop.it
In this video, Dr. Andrew Turner from EPCC in the UK presents: A Short Introduction to High Performance Computing.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

How to Correctly Deal With Pseudorandom Numbers in Manycore Environments - Application to GPU programming with Shoverand

How to Correctly Deal With Pseudorandom Numbers in Manycore Environments - Application to GPU programming with Shoverand | opencl, opengl, webcl, webgl | Scoop.it
Stochastic simulations are often sensitive to the source of randomness that characterizes the statistical quality of their results. Consequently, we need highly reliable Random Number Generators (R...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Tool for Automatic Suggestions for Irregular GPU Kernel Optimization

A Tool for Automatic Suggestions for Irregular GPU Kernel Optimization | opencl, opengl, webcl, webgl | Scoop.it
Future computing systems, from handhelds all the way to supercomputers, will be more parallel and more heterogeneous than today's systems to provide more performance without an increase in power co...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Characterization of OpenCL on a Scalable FPGA Architecture

Characterization of OpenCL on a Scalable FPGA Architecture | opencl, opengl, webcl, webgl | Scoop.it
The recent release of Altera's SDK for OpenCL has greatly eased the development of FPGA-based systems. Research have shown performance improvements brought by OpenCL using a single FPGA device. How...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Machine Learning: What Computational Researchers Need to Know

Machine Learning: What Computational Researchers Need to Know | opencl, opengl, webcl, webgl | Scoop.it
Nvidia GPUs are powering a revolution in machine learning.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Computationally Efficient Implementation of a Hamming Code Decoder using a Graphics Processing Unit

Computationally Efficient Implementation of a Hamming Code Decoder using a Graphics Processing Unit | opencl, opengl, webcl, webgl | Scoop.it
This paper presents a computationally efficient implementation of a Hamming code decoder on a graphics processing unit (GPU) to support real-time software-defined radio (SDR), which is a software a...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Accelerating Correlation Power Analysis Using Graphics Processing Units

Accelerating Correlation Power Analysis Using Graphics Processing Units | opencl, opengl, webcl, webgl | Scoop.it
Correlation Power Analysis (CPA) is a type of power analysis based side channel attack that can be used to derive the secret key of encryption algorithms including DES (Data Encryption Standard) an...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

IPMACC - An Open Source OpenACC to CUDA/OpenCL Translator - TechEnablement

IPMACC - An Open Source OpenACC to CUDA/OpenCL Translator - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
IPMACC is a research-grade open-source framework for translating OpenACC source code to CUDA or OpenCL. Binary executables can then be created with OpenCL or CUDA compilers.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Legion: Programming Distributed Heterogeneous Architectures with Logical Regions

Legion: Programming Distributed Heterogeneous Architectures with Logical Regions | opencl, opengl, webcl, webgl | Scoop.it
This thesis covers the design and implementation of Legion, a new programming model and runtime system for targeting distributed heterogeneous machine architectures. Legion introduces logical regio...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

SignalPU: A programming model for DSP applications on parallel and heterogeneous clusters

SignalPU: A programming model for DSP applications on parallel and heterogeneous clusters | opencl, opengl, webcl, webgl | Scoop.it
The biomedical imagery, the numeric communications, the acoustic signal processing and many others digital signal processing applications (DSP) are present more and more everyday in the numeric wor...
more...
No comment yet.