opencl, opengl, w...
Follow
Find
18.5K views | +0 today
 
Scooped by Mikael Bourges-Sevenier
onto opencl, opengl, webcl, webgl
Scoop.it!

Kokkos: Enabling performance portability across manycore architectures

The manycore revolution in computational hardware can be characterized by increasing thread counts, decreasing memory per thread, and architecture specific performance constraints for memory access...
more...
No comment yet.
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Video: Increasing Cluster Throughput for GPU Workloads - insideHPC

Video: Increasing Cluster Throughput for GPU Workloads - insideHPC | opencl, opengl, webcl, webgl | Scoop.it
Federico Silla from the Universitat Politècnica de València presents: Increasing Cluster Throughput while Reducing Energy Consumption for GPU Workloads.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Optimizing Navier-Stokes Equations

Optimizing Navier-Stokes Equations | opencl, opengl, webcl, webgl | Scoop.it
Solving Navier-Sokes equations are popular because they describe the physics of in a number of areas of interest to scientists and engineers.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

The Feasibility of Using OpenCL Instead of OpenMP for Parallel CPU Programming

The Feasibility of Using OpenCL Instead of OpenMP for Parallel CPU Programming | opencl, opengl, webcl, webgl | Scoop.it
OpenCL, along with CUDA, is one of the main tools used to program GPGPUs. However, it allows running the same code on multi-core CPUs too, making it a rival for the long-established OpenMP. In this...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Energy-efficient Computing on Distributed GPUs using Dynamic Parallelism and GPU-controlled Communication

Energy-efficient Computing on Distributed GPUs using Dynamic Parallelism and GPU-controlled Communication | opencl, opengl, webcl, webgl | Scoop.it
GPUs are widely used in high performance computing, due to their high computational power and high performance per Watt. Still, one of the main bottlenecks of GPU-accelerated cluster computing is t...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

insideHPC Guide to Open Computing - insideHPC

insideHPC Guide to Open Computing - insideHPC | opencl, opengl, webcl, webgl | Scoop.it
This guide to Open Computing is design to help organizations optimize their HPC environment to achieve higher performance at a lower operating cost.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Financial Benchmark for GPGPU Compilation

A Financial Benchmark for GPGPU Compilation | opencl, opengl, webcl, webgl | Scoop.it
Commodity many-core hardware is now mainstream, driven in particular by the evolution of general purpose graphics programming units (GPGPUs), but parallel programming models are lagging behind in e...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU Kernels for High-Speed 4-Bit Astrophysical Data Processing

GPU Kernels for High-Speed 4-Bit Astrophysical Data Processing | opencl, opengl, webcl, webgl | Scoop.it
Interferometric radio telescopes often rely on computationally expensive O(N^2) correlation calculations; fortunately these computations map well to massively parallel accelerators such as low-cost...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Single stream parallelization of generalized LSTM-like RNNs on a GPU

Single stream parallelization of generalized LSTM-like RNNs on a GPU | opencl, opengl, webcl, webgl | Scoop.it
Recurrent neural networks (RNNs) have shown outstanding performance on processing sequence data. However, they suffer from long training time, which demands parallel implementations of the training...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Speeding Up Computer Vision Applications on Mobile Computing Platforms

Speeding Up Computer Vision Applications on Mobile Computing Platforms | opencl, opengl, webcl, webgl | Scoop.it
Computer vision (CV) is widely expected to be the next "Big Thing" in mobile computing. For example, Google has recently announced their project "Tango", a 5-inch Android phone ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Implementation of a Practical Distributed Calculation System with Browsers and JavaScript, and Application to Distributed Deep Learning

Implementation of a Practical Distributed Calculation System with Browsers and JavaScript, and Application to Distributed Deep Learning | opencl, opengl, webcl, webgl | Scoop.it
Deep learning can achieve outstanding results in various fields. However, it requires so significant computational power that graphics processing units (GPUs) and/or numerous computers are often re...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication

CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication | opencl, opengl, webcl, webgl | Scoop.it
Sparse matrix-vector multiplication (SpMV) is a fundamental building block for numerous applications. In this paper, we propose CSR5 (Compressed Sparse Row 5), a new storage format, which offers hi...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

DIGITS: Deep Learning GPU Training System

DIGITS: Deep Learning GPU Training System | opencl, opengl, webcl, webgl | Scoop.it
The hottest area in machine learning today is Deep Learning, which uses Deep Neural Networks (DNNs) to teach computers to detect recognizable concepts in data. Researchers and industry practitioner...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

The Vulkan & SPIR-V Presentations From NVIDIA GTC 2015 - Phoronix

The Vulkan & SPIR-V Presentations From NVIDIA GTC 2015 - Phoronix | opencl, opengl, webcl, webgl | Scoop.it
Phoronix is the leading technology website for Linux hardware reviews, open-source news, Linux benchmarks, open-source benchmarks, distribution screenshots, interviews, and computer hardware tests.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Data driven scheduling approach for the multi-node multi-GPU Cholesky decomposition

Data driven scheduling approach for the multi-node multi-GPU Cholesky decomposition | opencl, opengl, webcl, webgl | Scoop.it
Recently large scale scientific computation on heterogeneous supercomputers equipped with accelerators is receiving attraction. However, traditional static job execution methods and memory manageme...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Analysis of illumination conditions at the lunar south pole using parallel computing techniques

Analysis of illumination conditions at the lunar south pole using parallel computing techniques | opencl, opengl, webcl, webgl | Scoop.it
In this Master Thesis an analysis of illumination conditions at the lunar south pole using parallel computing techniques is presented. Due to the small inclination (1.54o) of the lunar rotational a...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Pseudorandom Numbers Generation for Monte Carlo Simulations on GPUs: OpenCL Approach

Pseudorandom Numbers Generation for Monte Carlo Simulations on GPUs: OpenCL Approach | opencl, opengl, webcl, webgl | Scoop.it
General principles of pseudorandom numbers production for Monte Carlo simulations on GPUs are discussed by creating an OpenCL open-source library of pseudorandom number generators PRNGCL. The libra...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Massively Parallel Construction of the Cell Graph

Massively Parallel Construction of the Cell Graph | opencl, opengl, webcl, webgl | Scoop.it
Motion planning is an important and well-studied field of robotics. A typical approach to finding a route is to construct a cell graph representing a scene and then to find a path in such a graph. ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Curracurrong: a stream processing system for distributed environments

Curracurrong: a stream processing system for distributed environments | opencl, opengl, webcl, webgl | Scoop.it
Advances in technology have given rise to applications that are deployed on wireless sensor networks (WSNs), the cloud, and the Internet of things. There are many emerging applications, some of whi...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Raising the Bar for Using GPUs in Software Packet Processing

Raising the Bar for Using GPUs in Software Packet Processing | opencl, opengl, webcl, webgl | Scoop.it
Numerous recent research efforts have explored the use of Graphics Processing Units (GPUs) as accelerators for software-based routing and packet handling applications, typically demonstrating throu...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

PTX2Kernel: Converting PTX Code into Compilable Kernels

PTX2Kernel: Converting PTX Code into Compilable Kernels | opencl, opengl, webcl, webgl | Scoop.it
GPUs are now widely used as high performance general purpose computing devices. More and more applications have achieved large speedups with one or more GPUs, and the number of GPU programs is grow...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

The More We Share, The More We Have: Improving GPU performance through Register Sharing

The More We Share, The More We Have: Improving GPU performance through Register Sharing | opencl, opengl, webcl, webgl | Scoop.it
Graphics Processing Units (GPUs) consisting of Streaming Multiprocessors (SMs) achieve high throughput by running a large number of threads and context switching among them to hide execution latenc...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Sharing Surfaces between OpenCL™ and DirectX 11 on Intel® Processor Graphics - CodeProject

Sharing Surfaces between OpenCL™ and DirectX 11 on Intel® Processor Graphics - CodeProject | opencl, opengl, webcl, webgl | Scoop.it
This tutorial demonstrates how to share surfaces between OpenCL™ and DirectX 11 with Intel ® Processor Graphics on Microsoft Windows, using the surface sharing extension in OpenCL.; Author: Intel Corporation; Updated: 18 Mar 2015; Section: Product...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU Pro Tip: Fast Histograms Using Shared Atomics on Maxwell

GPU Pro Tip: Fast Histograms Using Shared Atomics on Maxwell | opencl, opengl, webcl, webgl | Scoop.it
Histograms are an important data representation with many applications in computer vision, data analytics and medical imaging. A histogram is a graphical representation of the data distribution acr...
more...
No comment yet.