Sciweavers

2226 search results - page 346 / 446
» Fault-Tolerant Parallel Applications with Dynamic Parallel S...
Sort
View
ASPLOS
2011
ACM
14 years 10 months ago
On-the-fly elimination of dynamic irregularities for GPU computing
The power-efficient massively parallel Graphics Processing Units (GPUs) have become increasingly influential for scientific computing over the past few years. However, their ef...
Eddy Z. Zhang, Yunlian Jiang, Ziyu Guo, Kai Tian, ...
ICPP
2009
IEEE
16 years 28 days ago
Group Operation Assembly Language - A Flexible Way to Express Collective Communication
—The implementation and optimization of collective communication operations is an important field of active research. Such operations directly influence application performance...
Torsten Hoefler, Christian Siebert, Andrew Lumsdai...
IWOMP
2009
Springer
16 years 25 days ago
A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures
Abstract. OpenMP has evolved recently towards expressing unstructured parallelism, targeting the parallelization of a broader range of applications in the current multicore era. Ho...
Eduard Ayguadé, Rosa M. Badia, Daniel Cabre...
ICDCS
2008
IEEE
16 years 22 days ago
Mobility-Assisted Spatiotemporal Detection in Wireless Sensor Networks
Wireless sensor networks (WSNs) deployed for missioncritical applications face the fundamental challenge of meeting stringent spatiotemporal performance requirements using nodes w...
Guoliang Xing, Jianping Wang, Ke Shen, Qingfeng Hu...
ISCAS
2005
IEEE
155views Hardware» more  ISCAS 2005»
15 years 12 months ago
Hyperblock formation: a power/energy perspective for high performance VLIW architectures
— Architectures based on Very Long Instruction Word (VLIW) processors are an optimal choice in the attempt to obtain high performance levels in mobile devices. The effectiveness ...
Giuseppe Ascia, Vincenzo Catania, Maurizio Palesi,...