opencl, opengl, w...
Follow
Find
17.3K views | +9 today
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU-Qin: A Methodology for Evaluating the Error Resilience of GPGPU Applications

GPU-Qin: A Methodology for Evaluating the Error Resilience of GPGPU Applications | opencl, opengl, webcl, webgl | Scoop.it
While graphics processing units (GPUs) have gained wide adoption as accelerators for general-purpose applications (GPGPU), the end-to-end reliability implications of their use have not been quantif...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Performance Criteria for parallel Computation on basis of block size using CUDA Architecture

A Performance Criteria for parallel Computation on basis of block size using CUDA Architecture | opencl, opengl, webcl, webgl | Scoop.it
GPU based on CUDA Architecture developed by NVIDIA is a high performance computing device. Multiplication of matrices of large order can be computed in few seconds using GPU based on CUDA Architect...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Consolidating Applications for Energy Efficiency in Heterogeneous Computing Systems

Consolidating Applications for Energy Efficiency in Heterogeneous Computing Systems | opencl, opengl, webcl, webgl | Scoop.it
By scheduling multiple applications with complementary resource requirements on a smaller number of compute nodes, we aim to improve performance, resource utilization, energy consumption, and energ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A QUDA-branch to compute disconnected diagrams in GPUs

A QUDA-branch to compute disconnected diagrams in GPUs | opencl, opengl, webcl, webgl | Scoop.it
Although QUDA allows for an efficient computation of many QCD quantities, it is surprinsingly lacking tools to evaluate disconnected diagrams, for which GPUs are specially well suited. We aim to fi...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU-Accelerated BWT Construction for Large Collection of Short Reads

GPU-Accelerated BWT Construction for Large Collection of Short Reads | opencl, opengl, webcl, webgl | Scoop.it
Advances in DNA sequencing technology have stimulated the development of algorithms and tools for processing very large collections of short strings (reads). Short-read alignment and assembly are a...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Novel Graphical Processing Unit Method for Power Systems Security Analysis

A Novel Graphical Processing Unit Method for Power Systems Security Analysis | opencl, opengl, webcl, webgl | Scoop.it
There is an increasing need for computational power to drive software tools used in power systems planning and operations, since the emergence of modern energy markets and recent renewable generati...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Platform-Specific Optimization and Mapping of Stencil Codes through Refinement

Platform-Specific Optimization and Mapping of Stencil Codes through Refinement | opencl, opengl, webcl, webgl | Scoop.it
A straightforward implementation of an algorithm in a general-purpose programming language does usually not deliver peak performance: compilers often fail to automatically tune the code for certain...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

gem5-gpu: A Heterogeneous CPU-GPU Simulator

gem5-gpu: A Heterogeneous CPU-GPU Simulator | opencl, opengl, webcl, webgl | Scoop.it
gem5-gpu is a new simulator that models tightly integrated CPU-GPU systems. It builds on gem5, a modular fullsystem CPU simulator, and GPGPU-Sim, a detailed GPGPU simulator. gem5-gpu routes most me...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Hybrid strategy for stencil computations on the APU

Hybrid strategy for stencil computations on the APU | opencl, opengl, webcl, webgl | Scoop.it
Stencil computations are very regular and well adapted to GPU execution. However, the PCI-E bus that connects a discrete GPU to the system memory has a relatively low bandwidth when compared to the...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Improvement of the fused CUDA kernels performance prediction

Improvement of the fused CUDA kernels performance prediction | opencl, opengl, webcl, webgl | Scoop.it
In this thesis a tool for improving the performance prediction of a source-to-source compiler of mapped functions developed on the Faculty of Informatics is presented. This tool integrates the modi...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

On the Portability of the OpenCL Dwarfs on Fixed and Reconfigurable Parallel Platforms

On the Portability of the OpenCL Dwarfs on Fixed and Reconfigurable Parallel Platforms | opencl, opengl, webcl, webgl | Scoop.it
The proliferation of heterogeneous computing systems presents the parallel computing community with the challenge of porting legacy and emerging applications to multiple processors with diverse pro...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

clpeak - peak performance of your opencl device

clpeak - peak performance of your opencl device | opencl, opengl, webcl, webgl | Scoop.it
clpeak is a benchmarking tool intended toward developers to fine-tune opencl kernels for a particular device/class of device. It calculates bandwidth & compute performance for different vector-...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

SPIR Protects OpenCL C Code

CMSoft Solutions in Computational Mathematics Software Soluções em Software de Matemática Computacional
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Automatic Resource-Constrained Static Task Parallelization

Automatic Resource-Constrained Static Task Parallelization | opencl, opengl, webcl, webgl | Scoop.it
This thesis intends to show how to efficiently exploit the parallelism present in applications in order to enjoy the performance benefits that multiprocessors can provide, using a new automatic tas...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

On the Programmability and Performance of Heterogeneous Platforms

On the Programmability and Performance of Heterogeneous Platforms | opencl, opengl, webcl, webgl | Scoop.it
General-purpose computing on an ever-broadening array of parallel devices has led to an increasingly complex and multi-dimensional landscape with respect to programmability and performance optimiza...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Detailed GPU Cache Model Based on Reuse Distance Theory

A Detailed GPU Cache Model Based on Reuse Distance Theory | opencl, opengl, webcl, webgl | Scoop.it
As modern GPUs rely partly on their on-chip memories to counter the imminent off-chip memory wall, the efficient use of their caches has become important for performance and energy. However, optimi...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A GPU accelerated algorithm for 3D Delaunay triangulation

A GPU accelerated algorithm for 3D Delaunay triangulation | opencl, opengl, webcl, webgl | Scoop.it
We propose the first algorithm to compute the 3D Delaunay triangulation (DT) on the GPU. Our algorithm uses massively parallel point insertion followed by bilateral flipping, a powerful local opera...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Comparing the Performance of Different x86 SIMD Instruction Sets for a Medical Imaging Application on Modern Multi- and Manycore Chips

Comparing the Performance of Different x86 SIMD Instruction Sets for a Medical Imaging Application on Modern Multi- and Manycore Chips | opencl, opengl, webcl, webgl | Scoop.it
Single Instruction, Multiple Data (SIMD) vectorization is a major driver of performance in current architectures, and is mandatory for achieving good performance with codes that are limited by inst...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Low-latency Image Recognition with GPU-accelerated Convolutional Networks for Web-based Services

Low-latency Image Recognition with GPU-accelerated Convolutional Networks for Web-based Services | opencl, opengl, webcl, webgl | Scoop.it
In this work, we describe an application of convolutional networks to object classification and detection in images. The task of image based object recognition is surveyed in the first chapter. Its...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Dynamic Offload Scheduler for spatial multitasking on Intel Xeon Phi Coprocessor

A Dynamic Offload Scheduler for spatial multitasking on Intel Xeon Phi Coprocessor | opencl, opengl, webcl, webgl | Scoop.it
Intel Xeon Phi Coprocessor appears and it fully supports multitasking, but it does not automatically ensure high performance in this case. A conventional task level resource allocation scheduler co...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Efficient Parallel Implementation of Active Appearance Model Fit-ting Algorithm on GPU

Efficient Parallel Implementation of Active Appearance Model Fit-ting Algorithm on GPU | opencl, opengl, webcl, webgl | Scoop.it
The Active Appearance Model (AAM) is one of the most powerful model-based object detecting and tracking methods that has been widely used in various situations. However, the high-dimensional textur...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Optimizing Stencil Computations for NVIDIA Kepler GPUs

Optimizing Stencil Computations for NVIDIA Kepler GPUs | opencl, opengl, webcl, webgl | Scoop.it
We present a series of optimization techniques for stencil computations on NVIDIA Kepler GPUs. Stencil computations with regular grids had been ported to the older generations of NVIDIA GPUs with s...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A High-productivity Framework for Multi-GPU computation of Mesh-based applications

A High-productivity Framework for Multi-GPU computation of Mesh-based applications | opencl, opengl, webcl, webgl | Scoop.it
The paper proposes a high-productivity framework for multi-GPU computation of mesh-based applications. In order to achieve high performance on these applications, we have to introduce complicated o...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Build 3D applications with the WebGL-based MontageJS 3D component

Build 3D applications with the WebGL-based MontageJS 3D component | opencl, opengl, webcl, webgl | Scoop.it
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

OpenSSL acceleration using Graphics Processing Units

OpenSSL acceleration using Graphics Processing Units | opencl, opengl, webcl, webgl | Scoop.it
Cryptography: The study of techniques focused on security. Typically, an implementation of cryptography is computationally heavy, leading to performance issues on general purpose systems. Adding th...
more...
No comment yet.