opencl, opengl, webcl, webgl
23.0K views | +10 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Parallel multi-agent path planning in dynamic environments for real-time applications

Current pathplanning algorithms are not efficient enough to provide optimal pathplanning in dynamic environments for a large number of agents in real time. Furthermore, there are no real-time algor...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Pipeline strategies to accelerate range query processing on a multi-GPU environment

Nowadays, similarity search is becoming a field of increasing interest because these kinds of methods can be applied to different areas in computer science and engineering, such as voice and image ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Improving Resource Utilization in Heterogeneous CPU-GPU Systems

Graphics processing units (GPUs) have attracted enormous interest over the past decade due to substantial increases in both performance and programmability. Programmers can potentially leverage GPU...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Processing MPI Derived Datatypes on Noncontiguous GPU-Resident Data

Driven by the goals of efficient and generic communication of noncontiguous data layouts in GPU memory, for which solutions do not currently exist, we present a parallel, noncontiguous data-process...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Endmember Extraction for Massive Hyperspectral Sensor Data on GPUs

Hyperspectral imaging sensor becomes increasingly important in multi-sensor collaborative observation. The spectral mixture problem seriously influences the efficiency of hyperspectral data exploit...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Accelerating Habanero-Java Programs with OpenCL Generation

The initial wave of programming models for general-purpose computing on GPUs, led by CUDA and OpenCL, has provided experts with low-level constructs to obtain significant performance and energy imp...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A GPU Implementation of Parallel Constraint-based Local Search

In this paper we study the performance of constraint-based local search solvers on a GPU. The massively parallel architecture of the GPU makes it possible to explore parallelism at two different le...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A streaming model for nested data parallelism

Efficient parallel algorithms are often written with embedded knowledge of the back-end that they are meant to be executed on, and if they are not, the translation to target language often produces...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

ClusterWatch: Flexible, Lightweight Monitoring for High-end GPGPU Clusters

The ClusterWatch middleware provides runtime flexibility in what system-level metrics are monitored, how frequently such monitoring is done, and how metrics are combined to obtain reliable informat...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Preconditioned conjugate gradient solver for structural problems

Matrix solvers play a crucial role in solving real world physics problem. In engineering practice, transition analysis is most often used, which requires a series of similar matrices to be solved. ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Exponential Integrators on Graphics Processing Units

In this paper we revisit stencil methods on GPUs in the context of exponential integrators. We further discuss boundary conditions, in the same context, and show that simple boundary conditions (fo...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Realtime Deformation of Constrained Meshes Using GPU

Constrained meshes play an important role in freeform architectural design, as they can represent panel layouts on freeform surfaces. It is challenging to perform realtime manipulation on such mesh...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

BenchFriend: Correlating the Performance of GPU Benchmarks

Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly diff...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Performance of OpenCL

OpenCL is a relatively new standard that supports computation on a variety of parallel architectures. The author was unable to find reliable information about performance of OpenCL programs on CPU'...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Paralleling Variable Block Size Motion Estimation of HEVC on Multi- Core CPU Plus GPU Platform

Motion estimation with variable block sizes (VBSME) is one of the most complex models in the HEVC encoder. The HEVC standard supports up to 12 variable block sizes ranging from 4x8/8x4 to 64x64 for...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Investigating the Performance of Motion Estimation Block-Matching Algorithms on GPU Cards

In the field of video compression, motion estimation (ME) is a process that leads to high computational complexity. Implementation of ME block-matching (BM) algorithms on general purpose Central Pr...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU Accelerated Parameter Estimation by Global Optimization using Interval Analysis

This master thesis treats the topic of non-linear parameter estimation using global optimization methods based on interval analysis (IA), accelerated by parallel implementation on a Graphics Proces...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Performing DCT8x8 Computation on GPU Using NVIDIA CUDA Technology

In this paper, we have proposed sequential and parallel Discrete Cosine Transform (DCT) in compute unified device architecture (CUDA) libraries. The introduction of programmable pipeline in the gra...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Optimization solutions for the segmented sum algorithmic function

In this paper, there are depicted optimization solutions for the segmented sum algorithmic function, developed using the Compute Unified Device Architecture (CUDA), a powerful and efficient solutio...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

gNek: A GPU Accelerated Incompressible Navier Stokes Solver

This thesis presents a GPU accelerated implementation of a high order splitting scheme with a spectral element discretization for the incompressible Navier Stokes (INS) equations. While others have...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Can GPUs Sort Strings Efficiently?

String sorting or variable-length key sorting has lagged in performance on the GPU even as the fixed-length key sorting has improved dramatically. Radix sorting is the fastest on the GPUs. In this ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Workload Analysis and Efficient OpenCL-based Implementation of SIFT Algorithm on a Smartphone

Feature detection and extraction are essential in computer vision applications such as image matching and object recognition. The Scale-Invariant Feature Transform (SIFT) algorithm is one of the mo...
more...
No comment yet.