| High Performance Computing Trends and Self Adapting Numerical Software | p. 1 |
| Kilo-instruction Processors | p. 10 |
| CARE: Overview of an Adaptive Multithreaded Architecture | p. 26 |
| Numerical Simulator III - A Terascale SMP-Cluster System for Aerospace Science and Engineering: Its Design and the Performance Issue | p. 39 |
| Code and Data Transformations for Improving Shared Cache Performance on SMT Processors | p. 54 |
| Improving Memory Latency Aware Fetch Policies for SMT Processors | p. 70 |
| Tolerating Branch Predictor Latency on SMT | p. 86 |
| A Simple Low-Energy Instruction Wakeup Mechanism | p. 99 |
| Power Performance Trade-Offs in Wide and Clustered VLIW Cores for Numerical Codes | p. 113 |
| Field Array Compression in Data Caches for Dynamically Allocated Recursive Data Structures | p. 127 |
| FIBER: A Generalized Framework for Auto-tuning Software | p. 146 |
| Evaluating Heuristic Scheduling Algorithms for High Performance Parallel Processing | p. 160 |
| Pursuing Laziness for Efficient Implementation of Modern Multithreaded Languages | p. 174 |
| SPEC HPG Benchmarks for Large Systems | p. 189 |
| Distribution-Insensitive Parallel External Sorting on PC Clusters | p. 202 |
| Distributed Genetic Algorithm for Inference of Biological Scale-Free Network Structure | p. 214 |
| Is Cook's Theorem Correct for DNA-Based Computing? | p. 222 |
| LES of Unstable Combustion in a Gas Turbine Combustor | p. 234 |
| Grid Computing Supporting System on ITBL Project | p. 245 |
| A Visual Resource Integration Environment for Distributed Applications on the ITBL System | p. 258 |
| Development of Remote Visualization and Collaborative Visualization System in ITBL Grid Environment | p. 269 |
| Performance of Network Intrusion Detection Cluster System | p. 278 |
| Constructing a Virtual Laboratory on the Internet: The ITBL Portal | p. 288 |
| Evaluation of High-Speed VPN Using CFD Benchmark | p. 298 |
| The Development of the UPACS CFD Environment | p. 307 |
| Virtual Experiment Platform for Materials Design | p. 320 |
| Ab Initio Study of Hydrogen Hydrate Clathrates for Hydrogen Storage within the ITBL Environment | p. 330 |
| RI2N - Interconnection Network System for Clusters with Wide-Bandwidth and Fault-Tolerancy Based on Multiple Links | p. 342 |
| A Bypass-Sensitive Blocking-Preventing Scheduling Technique for Mesh-Connected Multicomputers | p. 352 |
| Broadcast in a MANET Based on the Beneficial Area | p. 360 |
| An Optimal Method for Coordinated En-route Web Object Caching | p. 368 |
| An Improved Algorithm of Multicast Topology Inference from End-to-End Measurements | p. 376 |
| Chordal Topologies for Interconnection Networks | p. 385 |
| Distributed Location of Shared Resources and Its Application to the Load Sharing Problem in Heterogeneous Distributed Systems | p. 393 |
| Design and Implementation of a Parallel Programming Environment Based on Distributed Shared Arrays | p. 402 |
| Design and Implementation of Parallel Modified PrefixSpan Method | p. 412 |
| Parallel LU-decomposition on Pentium Streaming SIMD Extensions | p. 423 |
| Parallel Matrix Multiplication and LU Factorization on Ethernet-Based Clusters | p. 431 |
| Online Remote Trace Analysis of Parallel Applications on High-Performance Clusters | p. 440 |
| Performance Study of a Whole Genome Comparison Tool on a Hyper-Threading Multiprocessor | p. 450 |
| The GSN Library and FORTRAN Level I/O Benchmarks on the NS-III HPC System | p. 458 |
| Large Scale Structures of Turbulent Shear Flow via DNS | p. 468 |
| Molecular Dynamics Simulation of Prion Protein by Large Scale Cluster Computing | p. 476 |
| OpenMP/MPI Hybrid vs. Flat MPI on the Earth Simulator: Parallel Iterative Solvers for Finite Element Methods | p. 486 |
| Performance Evaluation of Low Level Multithreaded BLAS Kernels on Intel Processor Based cc-NUMA Systems | p. 500 |
| Support of Multidimensional Parallelism in the OpenMP Programming Model | p. 511 |
| On the Implementation of OpenMP 2.0 Extensions in the Fujitsu PRIMEPOWER Compiler | p. 523 |
| Improve OpenMP Performance by Extending BARRIER and REDUCTION Constructs | p. 529 |
| OpenMP for Adaptive Master-Slave Message Passing Applications | p. 540 |
| OpenGR: A Directive-Based Grid Programming Environment | p. 552 |
| Author Index | p. 565 |
| Table of Contents provided by Blackwell. All Rights Reserved. |