opencl, opengl, w...
Follow
Find
19.2K views | +7 today
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Parallel Spectral Graph Partitioning on CUDA

Parallel Spectral Graph Partitioning on CUDA | opencl, opengl, webcl, webgl | Scoop.it
Parallelization of scientific problems is a challenging task which has a wide application area both on distributed programming, cloud computing and recently on GPGPU. Spectral graph partitioning is...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Multi-Elimination ILU Preconditioners on GPUs

Multi-Elimination ILU Preconditioners on GPUs | opencl, opengl, webcl, webgl | Scoop.it
Iterative solvers for sparse linear systems often benefit from using preconditioners. While there are implementations for many iterative methods that leverage the computing power of accelerators, p...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Real Time Face Detection on GPU Using OpenCL

Real Time Face Detection on GPU Using OpenCL | opencl, opengl, webcl, webgl | Scoop.it
This paper presents a novel approach for real time face detection using heterogeneous computing. The algorithm uses local binary pattern (LBP) as feature vector for face detection. OpenCL is used t...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Boolean Calculations Using the GPU

Fast Boolean Calculations Using the GPU | opencl, opengl, webcl, webgl | Scoop.it
The growing number of Boolean variables requires very efficient approaches to solve the given tasks. We explore the utilization of the GPU for fast parallel Boolean calculations in this paper. Hund...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU Programming with CUDA: A brief overview

GPU Programming with CUDA: A brief overview | opencl, opengl, webcl, webgl | Scoop.it
In this paper we describe the architecture of a NVIDIA GPU, as well as the CUDA programming model. The basic statements are explained. We also provide an example of CUDA code, explaining its execut...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Similarity-Based Analysis Tool for Scientific Application Porting

A Similarity-Based Analysis Tool for Scientific Application Porting | opencl, opengl, webcl, webgl | Scoop.it
Porting applications to a new system is a nontrivial job in the HPC field. It is a very time-consuming, labor-intensive process, and the quality of the results will depend critically on the experie...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast American Basket Option Pricing on a multi-GPU Cluster

Fast American Basket Option Pricing on a multi-GPU Cluster | opencl, opengl, webcl, webgl | Scoop.it
This article presents a multi-GPU adaptation of a specific Monte Carlo and classification based method for pricing American basket options, due to Picazo. The first part relates how to combine fine...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Optimizing Performance of Stencil Code with SPL Conqueror

Optimizing Performance of Stencil Code with SPL Conqueror | opencl, opengl, webcl, webgl | Scoop.it
A standard technique to numerically solve elliptic partial differential equations on structured grids is to discretize them via finite differences and then to apply an efficient geometric multi-gri...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Application of the Characteristic Basis Function Method using CUDA

Application of the Characteristic Basis Function Method using CUDA | opencl, opengl, webcl, webgl | Scoop.it
The Characteristic Basis Function Method (CBFM) is a popular technique for efficiently solving the Method of Moments (MoM) matrix equations. In this work, we address the adaptation of this method t...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Accelerator Aware MPI Micro-benchmarking using CUDA, OpenACC and OpenCL

Accelerator Aware MPI Micro-benchmarking using CUDA, OpenACC and OpenCL | opencl, opengl, webcl, webgl | Scoop.it
Recently MPI implementations have been extended to support accelerator devices, Intel Many Integrated Core (MIC) and nVidia GPU. This has been accomplished by changes to different levels of the sof...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

CRVI/OpenCLIPP

CRVI/OpenCLIPP | opencl, opengl, webcl, webgl | Scoop.it
OpenCLIPP - OpenCL Integrated Performance Primitives - A library of optimized OpenCL image processing functions
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Multi-Kepler GPU vs. Multi-Intel MIC for spin systems simulations

Multi-Kepler GPU vs. Multi-Intel MIC for spin systems simulations | opencl, opengl, webcl, webgl | Scoop.it
We present and compare the performances of two many-core architectures: the Nvidia Kepler and the Intel MIC both in a single system and in cluster configuration for the simulation of spin systems. ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Effective Multi-Modal Retrieval based on Stacked Auto-Encoders

Effective Multi-Modal Retrieval based on Stacked Auto-Encoders | opencl, opengl, webcl, webgl | Scoop.it
Multi-modal retrieval is emerging as a new search paradigm that enables seamless information retrieval from various types of media. For example, users can simply snap a movie poster to search relev...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

WPA/WPA2 Password Security Testing using Graphics Processing Units

WPA/WPA2 Password Security Testing using Graphics Processing Units | opencl, opengl, webcl, webgl | Scoop.it
This thesis focuses on the testing of WPA/WPA 2 password strength. Recently, due to progress in calculation power and technology, new factors must be taken into account when choosing a WPA/WPA2 sec...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Accelerating Content-Based Image Retrieval via GPU-adaptive Index Structure

Accelerating Content-Based Image Retrieval via GPU-adaptive Index Structure | opencl, opengl, webcl, webgl | Scoop.it
A tremendous amount of work has been conducted in content-based image retrieval (CBIR) on designing efficient index structure to accelerate the retrieval process. Most of them improve the retrieval...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Performance Impact of Data Layout on the GPU-accelerated IDW Interpolation

Performance Impact of Data Layout on the GPU-accelerated IDW Interpolation | opencl, opengl, webcl, webgl | Scoop.it
This paper focuses on evaluating the performance impact of different data layouts on the GPU-accelerated IDW interpolation. First, we redesign and improve our previous GPU implementation that was p...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

The battle of the giants: a case study of GPU vs FPGA optimisation for real-time image processing

The battle of the giants: a case study of GPU vs FPGA optimisation for real-time image processing | opencl, opengl, webcl, webgl | Scoop.it
This paper focuses on a thorough comparison of the two main hardware targets for real-time optimization of a computer vision algorithm: GPU and FPGA. Based on a complex case study algorithm for thr...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Towards a Performance-Portable FFT Library for Heterogeneous Computing

Towards a Performance-Portable FFT Library for Heterogeneous Computing | opencl, opengl, webcl, webgl | Scoop.it
The fast Fourier transform (FFT), a spectral method that computes the discrete Fourier transform and its inverse, pervades many applications in digital signal processing, such as imaging, tomograph...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Efficient pseudo-random number generation for monte-carlo simulations using graphic processors

Efficient pseudo-random number generation for monte-carlo simulations using graphic processors | opencl, opengl, webcl, webgl | Scoop.it
A hybrid approach based on the combination of three Tausworthe generators and one linear congruential generator for pseudo random number generation for GPU programing as suggested in NVIDIA-CUDA li...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

LDetector: A Low Overhead Race Detector For GPU Programs

LDetector: A Low Overhead Race Detector For GPU Programs | opencl, opengl, webcl, webgl | Scoop.it
Data race detection is an important problem in GPU programming. The paper presents a novel solution. It uses the compiler support to privatize shared data and then at run time parallelizes the race...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

High-Performance Graphics 2014

High-Performance Graphics 2014 | opencl, opengl, webcl, webgl | Scoop.it
High-Performance Graphics is the leading international forum for performance-oriented graphics and imaging systems research including innovative algorithms, efficient implementations, languages, pa...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Work-Efficient Parallel GPU Methods for Single-Source Shortest Paths

Work-Efficient Parallel GPU Methods for Single-Source Shortest Paths | opencl, opengl, webcl, webgl | Scoop.it
Finding the shortest paths from a single source to all other vertices is a fundamental method used in a variety of higher-level graph algorithms. We present three parallel-friendly and work-efficie...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Pixi.js - 2D webGL renderer with canvas fallback

Pixi.js - 2D webGL renderer with canvas fallback | opencl, opengl, webcl, webgl | Scoop.it
Pixi.js is a devoted rendering engine. There are a host of other engines covering game, sound and physics etc. and they all work beautifully with Pixi.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Porting FEASTFLOW to the Intel Xeon Phi: Lessons Learned

Porting FEASTFLOW to the Intel Xeon Phi: Lessons Learned | opencl, opengl, webcl, webgl | Scoop.it
In this paper we report our experiences in porting the FEASTFLOW software infrastructure to the Intel Xeon Phi coprocessor. Our efforts involved both the evaluation of programming models including ...
more...
No comment yet.