Frontiers96 Banner

The Frontiers'96 Final Program

NASA LogoIEEE Logo
October 27 - 31, 1996
Lowes Annapolis Hotel, Annapolis, Maryland, USA
The Sixth Symposium on the Frontiers of Massively Parallel Computation

Sponsored by IEEE Computer Society. In cooperation with NASA Goddard Space Flight Center and USRA/CESDIS

Table of Contents

Introduction

Frontiers '96 features two workshops in areas of rapidly growing interest to the high-performance computing community. The topics of the workshops are 1) The Petaflops Frontier, and 2) Domain Specific Systems. These day and a half workshops are organized to provide a forum for presenting the most recent advances across a broad range of related topics within these interdisciplinary fields. Mixed with the presentations are open discussions on key topics from emerging technologies that may impact future directions to the policies establishing those directions. For more information contact the Program Chair at frontiers96@cesdis.gsfc.nasa.gov.
Frontiers '96 is the sixth in a series of meetings on massively parallel computation, focusing on research related to systems scalable to many hundreds of processors. The conference will include 34 original research papers surrounding the central theme of research related to the exploitation of massive parallelism, and any aspects of the design, analysis, development, and/or use of massively parallel computers. The realm of computing considered includes general purpose, domain specific, and special purpose systems and techniques. Other highlights include two panels, invited speakers, two pre-conference workshops, and conference banquet. Of special note are two sessions discussing interim results of eight Point Design Studies of high performance computing environments - awardees of the proposals solicited by the New Technologies Program in ASC (NSF), the Microelectronics Systems Architecture Program in MIPS (NSF), the Computer Systems Program in CCR (NSF), and in collaboration with DARPA and NASA.

Return to the Table of Contents

Technical Program Session and Panel Chairs

Tuesday, October 29

10:30 - 12:00 Concurrent Sessions

Session 2A: Scheduling 1, Gyungho Lee, University of Texas

Session 2B: Routing, Walter B. Ligon, III, Clemson University

1:30 -3:30 pm Concurrent Sessions

Session 3A: Applications & Algorithms, Phyllis E. Crandall, University of Connecticut

Session 3B: Petaflops Computing / Point Design Studies, Paul H. Smith, Department of Energy

4:00 -5:30 pm Panel Session

Panel Session How Do We Break the Barrier to the Software Frontier?, Paul Messina, California Institute of Technology

Wednesday, October 30

10:30 am - 12 pm Concurrent Sessions

Session 5A: Scheduling 2, Danny Z. Chen, University of Notre Dame

Session 5B: SIMD, David Schimmel, Georgia Institute of Technology

1:30 -3:30 pm Concurrent Sessions

Session 6A: I/O Techniques, Steven Hotovy, Cornell Theory Center

Session 6B: Memory Management, Larry Meadows, The Portland Group

4:00 -5:30 pm Panel Session

Panel Session Petaflops Alternative Paths, Paul Messina, California Institute of Technology

Thursday, October 31

10:30 am - 12 pm Concurrent Sessions

Session 8A: Synchronization, William Carlson, IDA Center for Computing Science

Session 8B: Networks, H. J. Siegel, Purdue University

1:30 pm -3:30 pm Concurrent Sessions

Session 9A: Performance Analysis, Tarek El-Ghazawi, George Washington University

Session 9B: Petaflops Computing / Point Design Studies, Wally Kleinfelder, Supercomputer Research Center

Return to the Table of Contents

Workshops

Sunday, October 27 1:00 - 5:00
Monday, October 28 8:30 - 5:00

Workshop A: The Petaflops Frontier - Parts 1 & 2

George Lake, University of Washington, Chair

The second workshop in this series explores the scaling properties of application algorithms, alternative architecture models, and device technology as they contribute to the feasibility of achieving computing performance in the regime of 10^15 operations per second. This is the only such workshop open to the general community and is an important forum for presenting new ideas.

SUNDAY

13:00 Welcome, Objectives: GL and LS

13:15 Keynote, TERA Looks at Peta: Issues in Compilers and Operating Systems Burton Smith, Tera Computer Company

14:00 An Introduction to Petaflop Point Design Studies --John von Rosendale, The National Science Foundation

14:15 Architecture and Technology, Session A; Larry Snyder, Chair

15:30 BREAK

15:55 Architecture and Technology Session B; Larry Snyder, Chair

18:00 ADJOURN

MONDAY

08:00 Coffee

09:00 Architecture and Technology Session C; Larry Snyder, Chair

10:15 BREAK

10:45 Panel of Architecture and Technology Issues, Larry Snyder, Chair

Panel participants: All Speakers so far.

11:45 Lunch

13:00 Applications and System Software Session; Peter Kogge, Chair

15:30 Break

16:00 Panel and Floor Discussion on any and all Petaflop Issues, George Lake

17:45 Closing Comments, GL and LS

18:00 ADJOURN WORKSHOP

Workshop B: Domain Specific Systems - Parts 1 & 2

Jose Fortes, Purdue University, Chair

This new series of workshops is intended to highlight systems architecture and software that exploit the opportunity of alternative structures and methods to achieve very high performance for possibly narrow ranges of applications. Topics include special purpose or embedded processors, reconfigurable architectures, SIMD, digital signal processors, image processors, data compression devices, and other application-driven designs.

SUNDAY, October 28

1.00 pm - 1.10 pm - Opening remarks, J. Fortes

1:10 pm - Session on Domain-specific hardware systems, J. Fortes, Chair

1:15 pm - 1:45 pm - Evaluating SIMD Arrays for Domain Specific Applications

Using ENPASSANT, M. Herbrordt and C. Weems, University of Houston and University of Massachusetts, USA.

1.45 pm - 2.15 pm - Efficient RMESH Algorithms for String Matching, H. C. Lee and F. Ercal, Dept. of Computer Science, University of Missouri-Rolla, USA.

2:15 pm - 2:45 pm - A 3.2 TOPS Enhanced Motion Estimation System, J. Berns and T. Noll, University of Technology RWTH Aachen, Germany.

2:45 pm - 3:00 pm BREAK

3:00 pm - 3:30 pm - Highly Parallel Computing for Solving Partial Differential Equations, I. Martin-Llorente and F. Tirado, Universidad Complutense, Spain.

3.30 pm - 4.00 pm - Parallel Programming by Data Queue Processing, P. Baglietto, M. Maresca, M. Migilardi, N. Zingirian, University of Genova and University of Padua, Italy.

4:00 pm - 4:30 pm - Session closing discussion and adjourn.

MONDAY, October 28

8:30 am - 10:30 am - Working session on Reconfigurable Architectures

Chair: Dennis Hawver, Annapolis Micro Systems

10:30 am - 10:45 am - Break

10:45 am - 12:30 pm - Working session on Reconfigurable Architectures

Moderator: Dennis Hawver, Annapolis Micro Systems

2:00 pm - Session on Domain-specific software and heterogeneous systems,

Jose Fortes, Chair

2:00 pm - 2:30 pm - Ct++: An Approach for a Retargetable High-Level Programming Environment for Image Processing Architecture, M. Pic, F. Bodin and H. Essafi, LETI and IRISA, France.

2:30 pm - 3:00 pm - An Application Domain Specific Language for Describing Board Games, J. Romein, H. Bal and D. Grune, Vrije Universiteit, The Netherlands.

3:00 pm - 3:30 pm - Products and Processes that Drive Domain-specific Architectures, L. Coglianese, LGA, Inc., USA

3:30 pm - 4:00 pm - Domain-specific Processing in the Context Heterogeneous Computing, H. J. Siegel, Purdue University, USA

4:00 pm - 5:00 pm - Floor discussion and Adjourn

Return to the Table of Contents

Technical Program

Tuesday, October 29

8:45 - 9:00 am

Opening Remarks: Thomas Sterling, JPL/California Institute of Technology

9:00 - 10:00 am

Session 1: Invited Speaker - John Hopson, Accelerated Strategic Computing Initiative (ASCI)
"From ASCI to Teraflops"


10:00 - 10:30 am Break

10:30 am - 12 noon Concurrent Sessions

Session 2A: Scheduling 1

Gang Scheduling for Highly Efficient Distributed Multiprocessor Systems,
H. Franke, IBM T. J. Watson Research Center, P. Pattnaik, L. Rudolph, Hebrew University

Integrating Polling, Interrupts, and Thread Management,
K. Langendoen, J. Romein, R. Bhoedjang, Vrije Universiteit

A Practical Processor Design for Multithreading,
L. Amamiya, T. Kawano, H. Tomiyasu, S. Kusakabe, Kyushu University

Session 2B: Routing

Deadlock-Free Path-Based Wormhole Multicasting in Meshes*,
E. Fleury, P. Fraigniaud, Ecole Normale Superieure de Lyon

Efficient Multicast in Wormhole-Routed 2D Mesh/Torus Multicomputers: A Network-Partitioning Approach.
S. Y. Wang, C. W. Ho, National Central University , Y. C. Tseng, Chung-Hua Polytechnic University

Turn Grouping for Supporting Efficient Multicast in Wormhole Mesh Networks,
K. P. Fan, C. T. King, National Tsing Hua University

12 noon - 1:30 pm Conference Luncheon - Ceremony to Transfer MPP from Goddard Space Flight Center to Smithsonian Institution
Jim Fischer, NASA Goddard Space Flight Center; Paul Ceruzzi, Curator, Smithsonian Institute

1:30 pm -3:30 pm Concurrent Sessions

Session 3A: Applications & Algorithms

A3: A Simple and Asymptotically Accurate Model for Parallel Computation,
V. Kumar, A. Grama, University of Minnesota, S. Ranka, University of Florida, V. Singh, IBM T. J. Watson Research Center

Fault Tolerant Matrix Operations Using Checksum and Reverse Computation,
Y. Kim, J. Plank, J. Dongarra, University of Tennessee

A Statistically-Based Multi-Algorithmic Approach for Load-Balancing Sparse Matrix Computations,
S. Nastea, O. Frieder, T. El-Ghazawi, George Washington University

Session 3B: Petaflops Computing / Point Design Studies

Pursuing a Petaflop: Point Designs for 100 TF Computers Using PIM Technologies,
P. M. Kogge, S. C. Bass, J. B. Brockman, D. Z. Chen, E. H. M. Sha, University of Notre Dame

Hybrid Technology Multi-Threaded Architecture,
G. Gao, McGill University, K. Likharev, SUNY, Stony Brook, P. Messina, California Institute of Technology, T. Sterling, JPL/Caltech

Design Studies on Petaflops Special-Purpose Hardware for Astrophysical Particle Simulations,
S. L.W. McMillan, Drexel University, P. Hut, Institute for Advanced Study, Princeton, J. Makino, University of Tokyo, M. L. Norman, University of Illinois at Urbana-Champaign, F. J. Summers, Princeton University

The Illinois Aggressive Cache-Only Memory Architecture Multiprocessor - (I-ACOMA)
J. Torrellas, D. Padua, University of Illinois at Urbana-Champaign

3:30 pm - 4:00 pm Break

4:00 pm - 5:30 pm Session 3B Panel Session - "How Do We Break the Barrier to the Software Frontier?"
Rick Stevens, Argonne National Laboratory, Panel Chair

5:30 pm - 7:00 pm Cocktail Party and Reception

Wednesday, October 30

9:00 - 10:00 am Session 4: Invited Speaker - Feng-Hsiung Hsu, IBM Research, TJ Watson Laboratory
"Parallelism in the Deep Blue Chess Automoton"


10:00 - 10:30 am Break

10:30 am - 12 noon Concurrent Sessions

Session 5A: Scheduling 2

Largest-Job-First-Scan-All Scheduling Policy for 2D Mesh-Connected Systems,
S. M. Yoo, University of Texas at Arlington

Scheduling for Large-Scale Parallel Video Servers,
M. Y. Wu, State University of New York at Buffalo

Effect of Variation in Compile Time Costs on Scheduling Tasks on Distributed Memory Systems,
S. Darbha, Rutgers University, S. Pande, University of Cincinnati

Session 5B: SIMD

Processor Autonomy and It's Effect on Parallel Program Execution,
D. Hawver, G. Adams III, Purdue University

Particle-Mesh Techniques on the MasPar,
C. Mobarry, NASA Goddard Space Flight Center, K. Olson, George Mason University

Solving Irregular Problems on SIMD Architectures,
M. Y. Wu, W. Shu, State University of New York at Buffalo

12 noon - 1:30 pm Lunch (on your own)

1:30 pm -3:30 pm Concurrent Sessions

Session 6A: I/O Techniques

Intelligent, Adaptive File System Policy Selection,
T. Madhyastha, D. Reed, University of Illinois, Urbana

An Abstract-Device Interface for Implementing Portable Parallel-I/O Interfaces,
R. Thakur, W. Gropp, E. Lusk, Argonne National Laboratory

PMPIO - A Portable Implementation of MPI-IO,
S. Fineberg, NAS/NASA Ames Research Center, P. Wong, B. Nitzberg, C. Kuszmaul, MRJ., Inc.

Disk Resident Arrays: An Array-Oriented I/O Library for Out-Of-Core Computations,
J. Nieplocha, Pacific Northwest National Laboratory, I. Foster, Argonne National Laboratory

Session 6B: Memory Management

Hardware-Controlled Prefeching in Directory-based Cache Coherent Systems,
W. Hu, P. Xia, Institute of Computing Technology, Academia Sinica

Preliminary Insights on Shared Memory PIC Code Performance on the Convex Exemplar SPP1000,
P. MacNeice, Hughes STX, C. Mobarry, NASA Goddard Space Flight Center, J. Crawford, USRA, Thomas Sterling, JPL/Caltech

Scalability of Dynamic Storage Allocation Algorithms,
A. Iyengar, IBM Thomas J. Watson Research Center

An Interprocedural Framework for Determining Efficient Data Redistributions in Distributed Memory Machines,
S. Gupta, S. Krishnamurthy, Ohio University

3:30 pm - 4:00 pm Break

4:00 pm - 5:30 pm Panel Session - "Petaflops Alternative Paths"
David Bailey, NASA Ames Research Center, Panel Chair

5:30 pm - 7:00 pm Conference Banquet:
Invited Speaker - John Toole, National Coordination Office for HPCC
"Direction and Goals in Advanced Computing - A Perspective on the National Agenda"

Thursday, October 31

9:00 - 10:00 am
Session 7: Invited Speaker - Steven Wallach, HP-Convex
"Independence Day"


10:00 - 10:30 am Break

10:30 am - 12 noon Concurrent Sessions

Session 8A: Synchronization

A Fair Fast Distributed Concurrent-Reader Exclusive-Writer,
T. Johnson, J. Yoon, University of Florida

Locks Improvement Technique for Release Consistency in Distributed Shared Memory Systems,
S. Fu, N. F. Tzeng, University of Southwestern Louisiana

A Quasi-Barrier Technique to Improve Performance of An Irregular Application,
H. Shah, J. Fortes, Purdue University

Session 8B: Networks

Performance Analysis and Fault Tolerance of Randomized Routing on Close Networks,
M. Bhatia, Bowie State University, A. Youssef, George Washington University

Performing BMMC Permutations in Two Passes Through the Expanded Delta Network and MasPar MP-2,
L. Wisniewski, Thinking Machines Corporation, T. Cormen, T. Sundquist, Dartmouth College

Macro-Star Networks: Efficient Low-Degree Alternatives to Star Graphs for Large-Scale Parallel Architectures,
C. H. Yeh, E. Varvarigos, University of California, Santa Barbara

12 noon - 1:30 pm Lunch (on your own)

1:30 pm -3:30 pm Concurrent Sessions

Session 9A: Performance Analysis

Modeling and Identifying Bottlenecks in EOSDIS,
J. Demmel, M. Ivory, S. L. Smith, University of California, Berkeley

Tools-Supported HPF and MPI Parallelization of the NAS Parallel Benchmarks,
C. Clemencon, W. Sawyer, K. M. Decker, J. Fritscher, B. J. N. Wylie, V. R. Deshpande, P. A. R. Lorenzo, R. Ruhl, Swiss Center for Scientific Computing, A. Endo, N. Masuda, F. Zimmerman, NEC European Supercomputer Systems

A Comparison of Workload Traces from Two Production Parallel Machines,
K. Windisch, V. Lo, University of Oregon, D. Feitelson, Hebrew University, B. Nitzberg, NASA Ames Research Center, R. Moore, San Diego Supercomputer Center

Morphological Image Processing on Parallel Machines,
H. J. Siegel, M. D. Theys, R. Born, M. Allemang, Purdue University

Session 9B: Petaflops Computing / Point Design Studies

MORPH: A Flexible Architecture for Executing Component Software at 100 TeraOPS,
A. A. Chien, R. K. Gupta, University of Illinois at Urbana-Champaign

Architecture, Algorithms and Applications for Future Generation Supercomputers,
V. Kumar, A. Sameh, University of Minnesota

Hierarchical Processors-and-Memory Architecture for High Performance Computing,
J. A.B. Fortes, R. Eigenmann, Purdue University, V. Taylor, Northwestern University
A Scalable-Feasible Parallel Computer Implementing Electronic and Optical Interconnections for 156 TeraOPS Minimum Performance,
S. G. Ziavras, H. Grebel, New Jersey Institute of Technology, A. T. Chronopoulos, Wayne State University

3:30 pm Adjourn

Return to the Table of Contents