Run-time parallelization is often the only way to execute the code in parallel when data dependence information is incomplete at compile time. This situation is common in many imp...
Abstract. We introduce a collection of high performance kernels for basic linear algebra. The kernels encapsulate small xed size computations in order to provide building blocks fo...
Designer's productivity has become the key-factor of the development of electronic systems. An increasing application of design data reuse is widely recognized as a promising...
This paper introduces an analysis technique, commutativity analysis, for automatically parallelizing computations that manipulate dynamic, pointer-based data structures. Commutati...
Compression with Reversible Embedded Wavelets (CREW) is a uni ed lossless and lossy continuous-tone still image compression system. It is wavelet-based using a \reversible" a...
A. Zandi, James D. Allen, Edward L. Schwartz, Mart...