opencl, opengl, webcl, webgl
24.6K views | +0 today
Follow
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Methods and Metrics for Fair Server Assessment under Real-Time Financial Workloads

Methods and Metrics for Fair Server Assessment under Real-Time Financial Workloads | opencl, opengl, webcl, webgl | Scoop.it
Energy efficiency has been a daunting challenge for datacenters. The financial industry operates some of the largest datacenters in the world. With increasing energy costs and the financial service...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A New Sparse Matrix Vector Multiplication GPU Algorithm Designed for Finite Element Problems

A New Sparse Matrix Vector Multiplication GPU Algorithm Designed for Finite Element Problems | opencl, opengl, webcl, webgl | Scoop.it
Recently, graphics processors (GPUs) have been increasingly leveraged in a variety of scientific computing applications. However, architectural differences between CPUs and GPUs necessitate the dev...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Performance comparison of Lattice Boltzmann fluid flow simulation using OpenCL and CUDA frameworks

Performance comparison of Lattice Boltzmann fluid flow simulation using OpenCL and CUDA frameworks | opencl, opengl, webcl, webgl | Scoop.it
This paper presents performance comparison, of the lid-driven cavity flow simulation, with Lattice Boltzmann method, example, between CUDA and OpenCL parallel programming frameworks. CUDA is parall...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Real-Time Incompressible Fluid Simulation on the GPU

Real-Time Incompressible Fluid Simulation on the GPU | opencl, opengl, webcl, webgl | Scoop.it
We present a parallel framework for simulating incompressible fluids with predictive-corrective incompressible Smoothed Particle Hydrodynamics (PCISPH) on the GPU in real time. To this end, we prop...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

ORNL Introductory Tutorials On Concurrent Kernels - TechEnablement

ORNL Introductory Tutorials On Concurrent Kernels - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
Oakridge has published two very introductory tutorials to teach how to utilize concurrent kernels on their systems. See TechEnablement for more tutorials.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

To Use or Not to Use: Graphics Processing Units for Pattern Matching Algorithms

To Use or Not to Use: Graphics Processing Units for Pattern Matching Algorithms | opencl, opengl, webcl, webgl | Scoop.it
String matching is an important part in today's computer applications and Aho-Corasick algorithm is one of the main string matching algorithms used to accomplish this. This paper discusses that whe...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Spectral classification using convolutional neural networks

Spectral classification using convolutional neural networks | opencl, opengl, webcl, webgl | Scoop.it
There is a great need for accurate and autonomous spectral classification methods in astrophysics. This thesis is about training a convolutional neural network (ConvNet) to recognize an object clas...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Extending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications

Extending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications | opencl, opengl, webcl, webgl | Scoop.it
CUDA and OpenCL are the most widely used programming models to exploit hardware accelerators. Both programming models provide a C-based programming language to write accelerator kernels and a host ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Speed Python Numerical Applications by 2x - 120x With HOPE - TechEnablement

Speed Python Numerical Applications by 2x - 120x With HOPE - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
HOPE is an open source specialized method-at-a-time JIT compiler that translates Python source code into C++ and achieves 2x - 120x speedup.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast Convolutional Nets With fbfft: A GPU Performance Evaluation

Fast Convolutional Nets With fbfft: A GPU Performance Evaluation | opencl, opengl, webcl, webgl | Scoop.it
We examine the performance profile of Convolutional Neural Network training on the current generation of NVIDIA Graphics Processing Units. We introduce two new Fast Fourier Transform convolution im...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs | opencl, opengl, webcl, webgl | Scoop.it
Deep Convolutional Neural Networks (DCNNs) have recently shown state of the art performance in high level vision tasks, such as image classification and object detection. This work brings together ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Automatic Tuning of Local Memory Use on GPGPUs

Automatic Tuning of Local Memory Use on GPGPUs | opencl, opengl, webcl, webgl | Scoop.it
The use of local memory is important to improve the performance of OpenCL programs. However, its use may not always benefit performance, depending on various application characteristics, and there ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

GPU: Power vs Performance

GPU: Power vs Performance | opencl, opengl, webcl, webgl | Scoop.it
GPUs are widely being used to meet the ever increasing demands of High performance computing. High-end GPUs are one of the highest consumers of power in a computer. Power dissipation has always bee...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Subdivision Surface Evaluation as Sparse Matrix-Vector Multiplication

Subdivision Surface Evaluation as Sparse Matrix-Vector Multiplication | opencl, opengl, webcl, webgl | Scoop.it
We present an interpretation of subdivision surface evaluation in the language of linear algebra. Specifically, the vector of surface points can be computed by left-multiplying the vector of contro...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Disjunctive Normal Networks

Disjunctive Normal Networks | opencl, opengl, webcl, webgl | Scoop.it
Artificial neural networks are powerful pattern classifiers; however, they have been surpassed in accuracy by methods such as support vector machines and random forests that are also easier to use ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Customization of OpenCL Applications for Efficient Task Mapping under Heterogeneous Platform Constraints

Customization of OpenCL Applications for Efficient Task Mapping under Heterogeneous Platform Constraints | opencl, opengl, webcl, webgl | Scoop.it
When targeting an OpenCL application to platforms with multiple heterogeneous accelerators, task tuning and mapping have to cope with device-specific constraints. To address this problem, we presen...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Facebook Open Source GPU FFT 1.5x Faster Than NVIDIA CUFFT - TechEnablement

Facebook Open Source GPU FFT 1.5x Faster Than NVIDIA CUFFT - TechEnablement | opencl, opengl, webcl, webgl | Scoop.it
Facebook has written a Fast Fourier Transform (fbfft) that is 1.5x faster than the NVIDIA CUFFT implementation at sizes 8-64.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Video: A Short Introduction to High Performance Computing - insideHPC

Video: A Short Introduction to High Performance Computing - insideHPC | opencl, opengl, webcl, webgl | Scoop.it
In this video, Dr. Andrew Turner from EPCC in the UK presents: A Short Introduction to High Performance Computing.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

How to Correctly Deal With Pseudorandom Numbers in Manycore Environments - Application to GPU programming with Shoverand

How to Correctly Deal With Pseudorandom Numbers in Manycore Environments - Application to GPU programming with Shoverand | opencl, opengl, webcl, webgl | Scoop.it
Stochastic simulations are often sensitive to the source of randomness that characterizes the statistical quality of their results. Consequently, we need highly reliable Random Number Generators (R...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Tool for Automatic Suggestions for Irregular GPU Kernel Optimization

A Tool for Automatic Suggestions for Irregular GPU Kernel Optimization | opencl, opengl, webcl, webgl | Scoop.it
Future computing systems, from handhelds all the way to supercomputers, will be more parallel and more heterogeneous than today's systems to provide more performance without an increase in power co...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Characterization of OpenCL on a Scalable FPGA Architecture

Characterization of OpenCL on a Scalable FPGA Architecture | opencl, opengl, webcl, webgl | Scoop.it
The recent release of Altera's SDK for OpenCL has greatly eased the development of FPGA-based systems. Research have shown performance improvements brought by OpenCL using a single FPGA device. How...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Machine Learning: What Computational Researchers Need to Know

Machine Learning: What Computational Researchers Need to Know | opencl, opengl, webcl, webgl | Scoop.it
Nvidia GPUs are powering a revolution in machine learning.
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Computationally Efficient Implementation of a Hamming Code Decoder using a Graphics Processing Unit

Computationally Efficient Implementation of a Hamming Code Decoder using a Graphics Processing Unit | opencl, opengl, webcl, webgl | Scoop.it
This paper presents a computationally efficient implementation of a Hamming code decoder on a graphics processing unit (GPU) to support real-time software-defined radio (SDR), which is a software a...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Accelerating Correlation Power Analysis Using Graphics Processing Units

Accelerating Correlation Power Analysis Using Graphics Processing Units | opencl, opengl, webcl, webgl | Scoop.it
Correlation Power Analysis (CPA) is a type of power analysis based side channel attack that can be used to derive the secret key of encryption algorithms including DES (Data Encryption Standard) an...
more...
No comment yet.