opencl, opengl, webcl, webgl
23.0K views | +3 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

High performance methods for frequent pattern mining

High performance methods for frequent pattern mining | opencl, opengl, webcl, webgl | Scoop.it
Current Big Data era is generating tremendous amount of data in most fields such as business, social media, engineering, and medicine. The demand to process and handle the resulting "big data&...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

NMF-mGPU: non-negative matrix factorization on multi-GPU systems

NMF-mGPU: non-negative matrix factorization on multi-GPU systems | opencl, opengl, webcl, webgl | Scoop.it
BACKGROUND: In the last few years, the Non-negative Matrix Factorization (NMF) technique has gained a great interest among the Bioinformatics community, since it is able to extract interpretable pa...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Reproducible Triangular Solvers for High-Performance Computing

Reproducible Triangular Solvers for High-Performance Computing | opencl, opengl, webcl, webgl | Scoop.it
On modern parallel architectures, floating-point computations may become non-deterministic and, therefore, non-reproducible mainly due to non-associativity of floating-point operations. We propose ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Locally-Oriented Programming: A Simple Programming Model for Stencil-Based Computations on Multi-Level Distributed Memory Architectures

Locally-Oriented Programming: A Simple Programming Model for Stencil-Based Computations on Multi-Level Distributed Memory Architectures | opencl, opengl, webcl, webgl | Scoop.it
Emerging hybrid accelerator architectures for high performance computing are often suited for the use of a data-parallel programming model. Unfortunately, programmers of these architectures face a ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU Pro Tip: Fast Dynamic Indexing of Private Arrays in CUDA

GPU Pro Tip: Fast Dynamic Indexing of Private Arrays in CUDA | opencl, opengl, webcl, webgl | Scoop.it
Sometimes you need to use small per-thread arrays in your GPU kernels. The performance of accessing elements in these arrays can vary depending on a number of factors. In this post I'll cover sever...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

How OpenCL Could Open the Gates for FPGAs

How OpenCL Could Open the Gates for FPGAs | opencl, opengl, webcl, webgl | Scoop.it
"The silver bullet in HLS is the ability to take a sequential description that has been written in C and then find this parallelism, the concurrencies, without the user having to think. That was a ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Tutorial on the OpenCL 2.0 Generic Address Space - TechEnablement

Tutorial on the OpenCL 2.0 Generic Address Space - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
The OpenCL 2.0 generic address space makes writing OCL apps easier by removing the requirement of decorating all pointers with a points to address space.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Analysis and Modeling of the Timing Behavior of GPU Architectures

Analysis and Modeling of the Timing Behavior of GPU Architectures | opencl, opengl, webcl, webgl | Scoop.it
Graphics processing units (GPUs) offer massive parallelism. Since a couple of years GPUs can also be used for more general purpose applications; a wide variety of applications can be accelerated ef...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Subgraph Matching on Large Graphs using Graphics Processors

Fast Subgraph Matching on Large Graphs using Graphics Processors | opencl, opengl, webcl, webgl | Scoop.it
Subgraph matching is the task of finding all matches of a query graph in a large data graph, which is known as an NP-complete problem. Many algorithms are proposed to solve this problem using CPUs....
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Comparison of OpenCL performance on different platforms using VexCL and Blaze

Comparison of OpenCL performance on different platforms using VexCL and Blaze | opencl, opengl, webcl, webgl | Scoop.it
This technical report provides performance numbers for several benchmark problems running on several different hardware platforms. The goal of this report is twofold. First, it helps us better unde...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Stochastic Gradient Descent on GPUs

Stochastic Gradient Descent on GPUs | opencl, opengl, webcl, webgl | Scoop.it
Irregular algorithms such as Stochastic Gradient Descent (SGD) can benefit from the massive parallelism available on GPUs. However, unlike in data-parallel algorithms, synchronization patterns in S...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Memory-efficient Adaptive Subdivision for Software Rendering on the GPU

Memory-efficient Adaptive Subdivision for Software Rendering on the GPU | opencl, opengl, webcl, webgl | Scoop.it
The adaptive subdivision step for surface tessellation is a key component of the Reyes rendering pipeline. While this operation has been successfully parallelized for execution on the GPU using a b...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Auto-tuning Shallow water simulations on GPUs

Auto-tuning Shallow water simulations on GPUs | opencl, opengl, webcl, webgl | Scoop.it
Graphic processing units (GPUs) have gained popularity in scientific computing the recent years. This is because of the massive computing power they can provide for parallel tasks, and while GPUs a...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Primal Dual Affine Scaling on GPUs

Primal Dual Affine Scaling on GPUs | opencl, opengl, webcl, webgl | Scoop.it
Here we present an implementation of Primal-Dual Affine scaling method to solve linear optimization problem on GPU based systems. Strategies to convert the system generated by complementary slackne...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

NUPAR: A Benchmark Suite for Modern GPU Architectures

NUPAR: A Benchmark Suite for Modern GPU Architectures | opencl, opengl, webcl, webgl | Scoop.it
Heterogeneous systems consisting of multi-core CPUs, Graphics Processing Units (GPUs) and many-core accelerators have gained widespread use by application developers and data-center platform develo...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Real-time GPU Implementation of the SIFT Algorithm for Large-Scale Video Analysis Tasks

A Real-time GPU Implementation of the SIFT Algorithm for Large-Scale Video Analysis Tasks | opencl, opengl, webcl, webgl | Scoop.it
The SIFT algorithm is one of the most popular feature extraction methods and therefore widely used in all sort of video analysis tasks like instance search and duplicate/near-duplicate detection. W...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Accelerating Bioinformatics with NVBIO

Accelerating Bioinformatics with NVBIO | opencl, opengl, webcl, webgl | Scoop.it
NVBIO is an open-source C++ template library of high performance parallel algorithms and containers designed by NVIDIA to accelerate sequence analysis and bioinformatics applications. NVBIO has a t...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Patterns and Rewrite Rules for Systematic Code Generation (From High-Level Functional Patterns to High-Performance OpenCL Code)

Patterns and Rewrite Rules for Systematic Code Generation (From High-Level Functional Patterns to High-Performance OpenCL Code) | opencl, opengl, webcl, webgl | Scoop.it
Computing systems have become increasingly complex with the emergence of heterogeneous hardware combining multicore CPUs and GPUs. These parallel systems exhibit tremendous computational power at t...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU-accelerated HMM for Speech Recognition

GPU-accelerated HMM for Speech Recognition | opencl, opengl, webcl, webgl | Scoop.it
Speech recognition is used in a wide range of applications and devices such as mobile phones, in-car entertainment systems and web-based services. Hidden Markov Models (HMMs) is one of the most pop...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Speech Recognition on Modern Graphic Processing Units

Speech Recognition on Modern Graphic Processing Units | opencl, opengl, webcl, webgl | Scoop.it
Speech Recognition run on Graphic Processing Units (GPUs) has shown some promising performance improvements ranging 2-10x speedups when compare to execution on CPUs. GPU has continued to introduce ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Intel Posts OpenCL 2.0 QuickSort Tutorial (Compare to TE CUDA Version) - TechEnablement

Intel Posts OpenCL 2.0 QuickSort Tutorial (Compare to TE CUDA Version) - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
Intel Engineer Robert Ioffe has posted an OpenCL QuickSort tutorial that utilizes nested parallelism and Workgroup-scan functions. In particular, the tutorial shows how to use the OpenCL™ 2.0 enqueue_kernel functions that queue kernels from the device...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Video: How to Build a Cheap Supercomputer

Video: How to Build a Cheap Supercomputer | opencl, opengl, webcl, webgl | Scoop.it
In this video, Rasim Muratovic shows you how to to build a cheap super computer using Raspberry Pi devices. In related news, the $35 Raspberry Pi 2 is out with a faster processor and twice the mem...
more...
No comment yet.