Sciweavers

2048 search results - page 211 / 410
» Optimizing pipelines for power and performance
Sort
View
PLDI
2005
ACM
16 years 3 days ago
Code placement for improving dynamic branch prediction accuracy
Code placement techniques have traditionally improved instruction fetch bandwidth by increasing instruction locality and decreasing the number of taken branches. However, traditio...
Daniel A. Jiménez
HPCA
2000
IEEE
15 years 11 months ago
Design of a Parallel Vector Access Unit for SDRAM Memory Systems
We are attacking the memory bottleneck by building a “smart” memory controller that improves effective memory bandwidth, bus utilization, and cache efficiency by letting appl...
Binu K. Mathew, Sally A. McKee, John B. Carter, Al...
MICRO
2000
IEEE
72views Hardware» more  MICRO 2000»
15 years 6 months ago
PipeRench implementation of the instruction path coprocessor
This paper demonstrates how an Instruction Path Coprocessor (I-COP) can be efficiently implemented using the PipeRench reconfigurable architecture. An I-COP is a programmable on-c...
Yuan C. Chou, Pazhani Pillai, Herman Schmit, John ...
CVPR
2012
IEEE
13 years 9 months ago
Beyond spatial pyramids: Receptive field learning for pooled image features
In this paper we examine the effect of receptive field designs on classification accuracy in the commonly adopted pipeline of image classification. While existing algorithms us...
Yangqing Jia, Chang Huang, Trevor Darrell
ICCD
2007
IEEE
746views Hardware» more  ICCD 2007»
16 years 3 months ago
Hardware design of a Binary Integer Decimal-based floating-point adder
Because of the growing importance of decimal floating-point (DFP) arithmetic, specifications for it are included in the IEEE Draft Standard for Floating-point Arithmetic (IEEE P75...
Charles Tsen, Sonia Gonzalez-Navarro, Michael J. S...