opencl, opengl, webcl, webgl
25.1K views | +0 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Mapping dynamic programming algorithms on graphics processing units

Mapping dynamic programming algorithms on graphics processing units | opencl, opengl, webcl, webgl | Scoop.it
Alignment is the fundamental operation used to compare biological sequences. It also serves to identify regions of similarity that are eventually consequences of structural, functional, or evolutio...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Parallel implementation of linear repetitive processes identification using subspace algorithms

Parallel implementation of linear repetitive processes identification using subspace algorithms | opencl, opengl, webcl, webgl | Scoop.it
This paper presents a new parallel approach to identification of linear repetitive processes based on subspace algorithms. Parallel realizations of these algorithms are tested on various graphic ca...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A new ray-tracing scheme for 3D diffuse radiation transfer on highly parallel architectures

A new ray-tracing scheme for 3D diffuse radiation transfer on highly parallel architectures | opencl, opengl, webcl, webgl | Scoop.it
We present a new numerical scheme to solve the transfer of diffuse radiation on three-dimensional mesh grids which is efficient on processors with highly parallel architecture such as recently popu...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

KBLAS: An Optimized Library for Dense Matrix-Vector Multiplication on GPU Accelerators

KBLAS: An Optimized Library for Dense Matrix-Vector Multiplication on GPU Accelerators | opencl, opengl, webcl, webgl | Scoop.it
KBLAS is a new open source high performance library that provides optimized kernels for a subset of Level 2 BLAS functionalities on CUDA-enabled GPUs. Since performance of dense matrix-vector multi...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

cuDNN: Efficient Primitives for Deep Learning

cuDNN: Efficient Primitives for Deep Learning | opencl, opengl, webcl, webgl | Scoop.it
We present a library that provides optimized implementations for deep learning primitives. Deep learning workloads are computationally intensive, and optimizing the kernels of deep learning workloa...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Embedding GPU Computations in Hadoop

Embedding GPU Computations in Hadoop | opencl, opengl, webcl, webgl | Scoop.it
As the size of high performance applications increases, four major challenges including heterogeneity, programmability, fault resilience, and energy efficiency have arisen in the underlying distrib...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Real-time Multi-view Depth Generation Using CUDA Multi-GPU

Real-time Multi-view Depth Generation Using CUDA Multi-GPU | opencl, opengl, webcl, webgl | Scoop.it
In this paper, we propose a real-time multi-view depth generation method using compute unified device architecture (CUDA) multi-graphics processing units (GPU). The objective is to generate multi-v...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Parallware: Automatic Parallelization of Sequential Codes - insideHPC

Parallware: Automatic Parallelization of Sequential Codes - insideHPC | opencl, opengl, webcl, webgl | Scoop.it
Manuel Arenaz, CEO at Appentra and Professor at the University of A Coruña, Spain presents: Parallware: Automatic Parallelization of Sequential Codes.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU Accelerated Radio Wave Propagation Modeling Using Ray Tracing

GPU Accelerated Radio Wave Propagation Modeling Using Ray Tracing | opencl, opengl, webcl, webgl | Scoop.it
Radar producers, which are mostly in defense industry, need radar environment simulator to test their products during the development. Such a simulator helps them to be able to get rid of costly fi...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

The Unabridged Chapter 1 Introduction To High Performance Parallelism Pearls

The Unabridged Chapter 1 Introduction To High Performance Parallelism Pearls | opencl, opengl, webcl, webgl | Scoop.it
Following is the full, unabridged text of the chapter 1 introduction (written by James Reinders) to High Performance Parallelism Pearls. Thanks to Morgan Kaufmann, James Reinders, and Jim Jeffers ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Estimation of Gaussian Mixture Model Parameters on GPU using CUDA

Fast Estimation of Gaussian Mixture Model Parameters on GPU using CUDA | opencl, opengl, webcl, webgl | Scoop.it
Gaussian Mixture Models (GMMs) are widely used among scientists e.g. in statistics toolkits and data mining procedures. In order to estimate parameters of a GMM the Maximum Likelihood (ML) training...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Microarchitectural Performance Characterization of Irregular GPU Kernels

Microarchitectural Performance Characterization of Irregular GPU Kernels | opencl, opengl, webcl, webgl | Scoop.it
GPUs are increasingly being used to accelerate general-purpose applications, including applications with data-dependent, irregular memory access patterns and control flow. However, relatively littl...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

CUDA Pro Tip: Optimized Filtering with Warp-Aggregated Atomics

CUDA Pro Tip: Optimized Filtering with Warp-Aggregated Atomics | opencl, opengl, webcl, webgl | Scoop.it
In this post, I’ll introduce warp-aggregated atomics, a useful technique to improve performance when many threads atomically add to a single counter. In warp aggregation, the threads of a warp firs...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

FDTD on Distributed Heterogeneous Multi-GPU Systems

FDTD on Distributed Heterogeneous Multi-GPU Systems | opencl, opengl, webcl, webgl | Scoop.it
Finite-Difference Time-Domain (FDTD) is a popular technique for modeling computational electrodynamics, and is used within many research areas, such as the development of antennas, ultrasound imagi...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Code Refinement of Stencil Codes

Code Refinement of Stencil Codes | opencl, opengl, webcl, webgl | Scoop.it
A straightforward implementation of an algorithm in a general-purpose programming language does usually not deliver peak performance: Compilers often fail to automatically tune the code for certain...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Framework for the Volumetric Integration of Depth Images

A Framework for the Volumetric Integration of Depth Images | opencl, opengl, webcl, webgl | Scoop.it
Volumetric models have become a popular representation for 3D scenes in recent years. One of the breakthroughs leading to their popularity was KinectFusion, where the focus is on 3D reconstruction ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Movement Tracking in Terrain Conditions Accelerated with CUDA

Movement Tracking in Terrain Conditions Accelerated with CUDA | opencl, opengl, webcl, webgl | Scoop.it
The paper presents a solution to the problem of movement tracking in images acquired from video cameras monitoring outside terrain. The solution is resistant to such adverse factors as: leaves flut...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Load Balancing in Data Warehouse - Evolution and Perspectives

Load Balancing in Data Warehouse - Evolution and Perspectives | opencl, opengl, webcl, webgl | Scoop.it
The problem of load balancing is one of the crucial features in distributed data warehouse systems. In this article original load balancing algorithms are presented. The Adaptive Load Balancing Alg...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Using Graphics Processing Unit to Accelerate Database Query Execution

Using Graphics Processing Unit to Accelerate Database Query Execution | opencl, opengl, webcl, webgl | Scoop.it
One of the major problems in database management systems is handling large amounts of data while providing short response time. Problem is not only proper manner of storing records but also efficie...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Parallel Shortest Path Algorithm for Voronoi Diagrams with Generalized Distance Functions

Parallel Shortest Path Algorithm for Voronoi Diagrams with Generalized Distance Functions | opencl, opengl, webcl, webgl | Scoop.it
Voronoi diagrams are fundamental data structures in computational geometry with applications on different areas. Recent soft object simulation algorithms for real time physics engines require the c...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Deep Dynamic Neural Networks for Gesture Segmentation and Recognition

Deep Dynamic Neural Networks for Gesture Segmentation and Recognition | opencl, opengl, webcl, webgl | Scoop.it
The purpose of this paper is to describe a novel method called Deep Dynamic Neural Networks(DDNN) for the Track 3 of the Chalearn Looking at People 2014 challenge [1]. A generalised semi-supervised...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Increasing the Throughput of your GPU-enabled Cluster with rCUDA

Increasing the Throughput of your GPU-enabled Cluster with rCUDA | opencl, opengl, webcl, webgl | Scoop.it
In this video from the HPC Advisory Council Spain Conference, Federico Silla from the Technical University of Valencia presents: Increasing the throughput of your GPU-enabled cluster with rCUDA.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A stencil-based implementation of Parareal in the C++ domain specific embedded language STELLA

A stencil-based implementation of Parareal in the C++ domain specific embedded language STELLA | opencl, opengl, webcl, webgl | Scoop.it
In view of the rapid rise of the number of cores in modern supercomputers, time-parallel methods that introduce concurrency along the temporal axis are becoming increasingly popular. For the soluti...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Automatic Heuristic Construction Using Active Learning

Fast Automatic Heuristic Construction Using Active Learning | opencl, opengl, webcl, webgl | Scoop.it
Building effective optimization heuristics is a challenging task which often takes developers several months if not years to complete. Predictive modelling has recently emerged as a promising solut...
more...
No comment yet.