opencl, opengl, w...
Follow
Find
17.3K views | +16 today
Your new post is loading...
Your new post is loading...
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Analysis-Driven Design of Parallel Floating-Point Matrix Multiplication for Implementation in Reconfigurable Logic

The objective of this research is to design an efficient and flexible implementation of parallel matrix multiplication for FPGA devices by analyzing the computation and studying its design space. I...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Performance Optimization of Vision Apps on Mobile Application Processor

Optimizing performance of compute-intensive vision apps running on mobile application processor (AP) is critical to satisfactory experience for smartphone and tablet users. Most existing vision alg...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Estimation of Skin Optical Parameters for Real-Time Hyperspectral Imaging Applications using GPGPU Parallel Computing

Hyperspectral imaging with a high spatial and spectral resolution can be used to analyze materials using spectroscopic methods. This can be applied on skin as a general purpose real-time diagnostic...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A memory access model for highly-threaded many-core architectures

A number of highly-threaded, many-core architectures hide memory-access latency by low-overhead context switching among a large number of threads. The speedup of a program on these machines depends...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Algorithms for Compression on GPUs

This project seeks to produce an algorithm for fast lossless compression of data. This is attempted by utilisation of the highly parallel graphic processor units (GPU), which has been made easier t...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Performance Drawbacks for Matrix Multiplication using Set Associative Cache in GPU devices

Performance of shared memory processors show negative performance impulses (drawbacks) in certain regions for execution of the basic matrix multiplication algorithm. In this paper we continue with ...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Synthesis of Custom Networks of Heterogeneous Processing Elements for Complex Physical System Emulation

Physical system models that consist of thousands of ordinary differential equations can be synthesized to field-programmable gate arrays (FPGAs) for highly-parallelized, real-time physical system e...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Fast and Flexible: Parallel Packet Processing with GPUs and Click | hgpu.org

Fast and Flexible: Parallel Packet Processing with GPUs and Click | Computer science, CUDA, nVidia, nVidia GeForce GTX 480, Package, Software router, String matching
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Towards a Distributed GPU-Accelerated Matrix Inversion | hgpu.org

Towards a Distributed GPU-Accelerated Matrix Inversion | Algorithms, Computer science, CUDA, Factorization, Matrix inversion, nVidia, Tesla C2050, Tesla M2050
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Towards Path Tracing in Games | hgpu.org

Towards Path Tracing in Games | 3D Graphics and Realism, Algorithms, Computer science, CUDA, nVidia, nVidia GeForce GTX 470, Raytracing, Rendering
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Accelerating Random Forests on CPUs and GPUs for Object-Class Image Segmentation | hgpu.org

Accelerating Random Forests on CPUs and GPUs for Object-Class Image Segmentation | Computer science, Computer vision, CUDA, Machine learning, nVidia, nVidia GeForce GTX 480, nVidia GeForce GTX 690, nVidia GeForce GTX Titan, Package, Tesla K20, Thesis...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Programming Dense Linear Algebra Kernels on Vectorized Architectures | hgpu.org

Programming Dense Linear Algebra Kernels on Vectorized Architectures | Algorithms, Computer science, Intel Phi, Linear Algebra, Thesis
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

OpenCL Programming Guide for Mac

OpenCL (Open Computing Language) is an open standard for cross-platform, programming of modern highly-parallel processor architectures. Introduced withOS X v10.6,OpenCL consists of a C99-based prog...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

SOCL: An OpenCL Implementation with Automatic Multi-Device Adaptation Support

To fully tap into the potential of today's heterogeneous machines, offloading parts of an application on accelerators is not sufficient. The real challenge is to build systems where the application...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Unified Framework for Multi-Sensor HDR Video Reconstruction

One of the most successful approaches to modern high quality HDR-video capture is to use camera setups with multiple sensors imaging the scene through a common optical system. However, such systems...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Transfer Time Reduction of Data Transfers between CPU and GPU

In real-time video processing data transfer between CPU and GPU is a time critical action; time spent transferring data is processing time lost. Several variants of standard transfer methods were d...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Domain-Specific Language and Compiler for Stencil Computations on Short-Vector SIMD and GPU Architectures

Stencil computations are an integral part of applications in a number of scientific computing domains, such as image processing and partial differential equations. We describe a domain-specific lan...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Median Based Parallel Steering Kernel Regression for Image Reconstruction

Image reconstruction is a process of obtaining the original image from corrupted data. Applications of image reconstruction include Computer Tomography, radar imaging, weather forecasting etc. Rece...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

A Shader Library for OpenGL 4 and GLSL 4.3 Learning and Development

In the past decades, besides experiencing a huge development in terms of computation speed, we have also experienced the emergence of the Programmable GPU, giving birth to languages like GLSL and C...
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Encrypting video streams using OpenCL code on-demand | hgpu.org

Encrypting video streams using OpenCL code on-demand | Algorithms, Image processing, nVidia, nVidia GeForce GTX 550, OpenCL, Security, Video encoding
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Formal specification and verification of OpenCL Kernel optimization | hgpu.org

Formal specification and verification of OpenCL Kernel optimization | Computer science, nVidia, OpenCL, Optimization, Tesla S2050, Thesis
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Lossless LZW Data Compression Algorithm on CUDA | hgpu.org

Lossless LZW Data Compression Algorithm on CUDA | Algorithms, Compression, Computer science, CUDA, nVidia, nVidia GeForce GTX 580, nVidia GeForce GTX 680
more...
No comment yet.
Scooped by Mikael Bourges-Sevenier
Scoop.it!

Detecting Data Races on OpenCL Kernels with Symbolic Execution | hgpu.org

Detecting Data Races on OpenCL Kernels with Symbolic Execution | Computer science, OpenCL, Programming Languages
more...
No comment yet.