Search For:

Displaying 1-8 out of 8 total
Exploiting Multimedia Extensions in C++: A Portable Approach
Found in: Computing in Science & Engineering
By Pierre Esterie,Mathias Gaunard,Joel Falcou,Jean-Thierry Lapresté
Issue Date:September 2012
pp. 72-77
Single instruction, multiple data (SIMD) extensions have been a feature of choice for processor manufacturers for a couple of decades. Designed to provide significant accelerations, they require expertise, the use of potentially fragile compiler support, o...
 
Parallel Biological Sequence Comparison on Heterogeneous High Performance Computing Platforms with BSP++
Found in: Computer Architecture and High Performance Computing, Symposium on
By Khaled Hamidouche,Fernando M. Mendonca,Joel Falcou,Daniel Etiemble
Issue Date:October 2011
pp. 136-143
Biological Sequence Comparison is an important operation in Bioinformatics that is often used to relate organisms. Smith and Waterman proposed an exact algorithm (SW) that compares two sequences in quadratic time and space. Due to high computing and memory...
 
Three High Performance Architectures in the Parallel APMC Boat
Found in: Parallel and Distributed Methods in Verification, 2010 Ninth International Workshop on, and High Performance Computational Systems Biology, Second International Workshop on
By Khaled Hamidouche, Alexandre Borghi, Pierre Esterie, Joel Falcou, Sylvain Peyronnet
Issue Date:October 2010
pp. 20-27
Approximate probabilistic model checking, and more generally sampling based model checking methods, proceed by drawing independent executions of a given model and by checking a temporal formula on these executions. In theory, these methods can be easily ma...
 
Algorithmic Skeletons within an Embedded Domain Specific Language for the CELL Processor
Found in: Parallel Architectures and Compilation Techniques, International Conference on
By Tarik Saidani, Joel Falcou, Claude Tadonki, Lionel Lacassagne, Daniel Etiemble
Issue Date:September 2009
pp. 67-76
Efficiently using the hardware capabilities of the Cell processor, a heterogeneous chip multiprocessor that uses several levels of parallelism to deliver high performance, and being able to reuse legacy code are real challenges for application developers. ...
 
Parallel Programming with Skeletons
Found in: Computing in Science and Engineering
By Joel Falcou
Issue Date:May 2009
pp. 58-63
Parallel programming is bound to become the main concern of software developers in the coming decades. Various models aim to solve this tension, trading efficiency for abstraction or vice versa, but how about getting both?
 
Boost.SIMD: generic programming for portable SIMDization
Found in: Proceedings of the 2014 Workshop on Programming models for SIMD/Vector processing (WPMVP '14)
By Jean-Thierry Lapresté, Joel Falcou, Mathias Gaunard, Pierre Estérie
Issue Date:February 2014
pp. 1-8
SIMD extensions have been a feature of choice for processor manufacturers for a couple of decades. Designed to exploit data parallelism in applications at the instruction level, these extensions still require a high level of expertise or the use of potenti...
     
Boost.SIMD: generic programming for portable SIMDization
Found in: Proceedings of the 21st international conference on Parallel architectures and compilation techniques (PACT '12)
By Brigitte Rozoy, Jean-Thierry Lapresté, Joel Falcou, Mathias Gaunard, Pierre Estérie
Issue Date:September 2012
pp. 431-432
SIMD extensions have been a feature of choice for processor manufacturers for a couple of decades. Designed to exploit data parallelism in applications at the instruction level and provide significant accelerations, these extensions still require a high le...
     
Hybrid bulk synchronous parallelism library for clustered smp architectures
Found in: Proceedings of the fourth international workshop on High-level parallel programming and applications (HLPP '10)
By Daniel Etiemble, Joel Falcou, Khaled Hamidouche
Issue Date:September 2010
pp. 55-62
This paper presents the design and implementation of BSP++, a C++ parallel programming library based on the Bulk Synchronous Parallelism model to perform high performance computing on both SMP and SPMD architectures using OpenMPI and MPI. We show how C++ s...
     
 1