Sciweavers

8315 search results - page 1325 / 1663
» Performance Evaluation for Global Computation
Sort
View
HPCA
1997
IEEE
15 years 11 months ago
Architectural Support for Compiler-Synthesized Dynamic Branch Prediction Strategies: Rationale and Initial Results
This paper introduces a new architectural approach that supports compiler-synthesized dynamic branch predication. In compiler-synthesized dynamic branch prediction, the compiler g...
David I. August, Daniel A. Connors, John C. Gyllen...
HPCA
1997
IEEE
15 years 11 months ago
Datapath Design for a VLIW Video Signal Processor
This paper represents a design study of the datapath for a very long instruction word (VLIW) video signal processor (VSP). VLIW architectures provide high parallelism and excellen...
Andrew Wolfe, Jason Fritts, Santanu Dutta, Edil S....
166
Voted
ICPP
1997
IEEE
15 years 11 months ago
Communication in Parallel Applications: Characterization and Sensitivity Analysis
Communication characterization of parallel applications is essential to understand the interplay between architectures and applications in determining the maximum achievable perfo...
Dale Seed, Anand Sivasubramaniam, Chita R. Das
IPPS
1997
IEEE
15 years 11 months ago
Enhancing Software DSM for Compiler-Parallelized Applications
Current parallelizing compilers for message-passing machines only support a limited class of data-parallel applications. One method for eliminating this restriction is to combine ...
Peter J. Keleher, Chau-Wen Tseng
ICS
1997
Tsinghua U.
15 years 11 months ago
Eliminating Cache Conflict Misses through XOR-Based Placement Functions
This paper makes the case for the use of XOR-based placement functions for cache memories. It shows that these XOR-mapping schemes can eliminate many conflict misses for direct-ma...
Antonio González, Mateo Valero, Nigel P. To...
« Prev « First page 1325 / 1663 Last » Next »