Sciweavers

2609 search results - page 310 / 522
» Optimizing for parallelism and data locality
Sort
View
CLUSTER
2008
IEEE
16 years 1 months ago
OpenMP-centric performance analysis of hybrid applications
—Several performance analysis tools support hybrid applications. Most originated as MPI profiling or tracing tools and OpenMP capabilities were added to extend the performance a...
Karl Fürlinger, Shirley Moore
EDOC
2004
IEEE
15 years 10 months ago
Write Once, Deploy N: A Performance Oriented MDA Case Study
To focus the comparison of languages for model checking and transformation on criteria that matter in practical development, there is an urgent need for more, and more realistic, ...
Pieter Van Gorp, Dirk Janssens, Tracy Gardner
ENTCS
2002
98views more  ENTCS 2002»
15 years 6 months ago
Verified Code Generation for Embedded Systems
Digital signal processors provide specialized SIMD (single instruction multiple data) operations designed to dramatically increase performance in embedded systems. While these ope...
Sabine Glesner, Rubino Geiß, Boris Boesler
IEEEPACT
2006
IEEE
16 years 20 days ago
Compiling for stream processing
This paper describes a compiler for stream programs that efficiently schedules computational kernels and stream memory operations, and allocates on-chip storage. Our compiler uses...
Abhishek Das, William J. Dally, Peter R. Mattson
CASES
2007
ACM
15 years 10 months ago
Stack size reduction of recursive programs
For memory constrained environments like embedded systems, optimization for program size is often as important, if not more important, as optimization for execution speed. Commonl...
Stefan Schäckeler, Weijia Shang