Get Free Shipping on orders over $79
Stream Processor Architecture : KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE - Scott Rixner

Stream Processor Architecture

By: Scott Rixner

Hardcover | 31 October 2001

At a Glance

Hardcover


$169.75

or 4 interest-free payments of $42.44 with

 or 

Ships in 5 to 7 business days

Media processing applications, such as three-dimensional graphics, video compression, and image processing, currently demand 10-100 billion operations per second of sustained computation. Fortunately, hundreds of arithmetic units can easily fit on a modestly sized 1cm2 chip in modern VLSI. The challenge is to provide these arithmetic units with enough data to enable them to meet the computation demands of media processing applications. Conventional storage hierarchies, which frequently include caches, are unable to bridge the data bandwidth gap between modern DRAM and tens to hundreds of arithmetic units. A data bandwidth hierarchy, however, can bridge this gap by scaling the provided bandwidth across the levels of the storage hierarchy.
The stream programming model enables media processing applications to exploit a data bandwidth hierarchy effectively. Media processing applications can naturally be expressed as a sequence of computation kernels that operate on data streams. This programming model exposes the locality and concurrency inherent in these applications and enables them to be mapped efficiently to the data bandwidth hierarchy. Stream programs are able to utilize inexperience local data bandwidth when possible and consume expensive global data bandwidth only when necessary.
Stream Processor Architecture presents the architecture of the Imagine streaming media processor, which delivers a peak performance of 20 billion floating-point operations per second. Imagine efficiently supports 48 arithmetic units with a three-tiered data bandwidth hierarchy. At the base of the hierarchy, the streaming memory system employs memory access scheduling to maximize the sustained bandwidth of external DRAM. At the center of the hierarchy, the global stream register file enables streams of data to be recirculated directly from one computation kernel to the next without returning data to memory. Finally, local distributed register files that directly feed the arithmetic units enable temporary data to be stored locally so that it does not need to consume costly global register bandwidth. The bandwidth hierarchy enables Imagine to achieve up to 96% of the performance of a stream processor with infinite bandwidth from memory and the global register file.
Industry Reviews
`I can recommend Stream Processor Architecture to every engineer and researcher interested in this subject. The book should be interesting to anyone working on processing of data, where latency and accuracy are less important than speed.'
IEEE Communications December 2003

More in Microprocessors

The Scaling Era : An Oral History of AI, 2019-2025 - Dwarkesh Patel
Digital Design and Computer Architecture : ARM Edition - Sarah Harris
Digital Design and Computer Architecture : 2nd Edition - Sarah Harris
Programming 32-bit Microcontrollers in C : Exploring the PIC32 - Lucio Di Jasio
Node.js for Embedded Systems - Kelsey Breseman

RRP $57.00

$30.75

46%
OFF
Making Things Smart - Gordon F. Williams

RRP $66.75

$34.99

48%
OFF
Robot Magic : Beginner Robotics for the Maker and Magician - Mario Marchese
Microprocessor Technology - J S Anderson

RRP $103.00

$91.75

11%
OFF
Microcontroller Theory and Applications : HC12 and  S12 - Daniel Pack