DRAFT Recommendations

Petaflops Architecture WorkShop
Oxnard, California
April 22-25, 1996
by: Dr. Thomas Sterling, Senior Staff Scientist
Center of Excellence in Space Data and Information Sciences
NASA Goddard Space Flight Center

RECOMMENDATIONS

The Petaflops Architecture WorkShop identified many critical areas in which advances can and should be made to enhance our understanding of potential approaches and reduce the risk to achieving petaflops computing capability in the earlierst possible time frame. While a large number of specific possible activities were considered, a few were recognized as key to overall progress or appropriate for immediate action. The following section highlights the most significant recommendations. Section 2 presents a few important actions that can be undertaken immediately and would have important consequences for this emerging discipline. Finally, Section 3 provides details of many additional recommendations, most of which are rather specific but which should be performed at some point in the near future.

1.0 MAJOR RECOMMENDATIONS

EVALUATE AN EVOLUTIONARY COTS BASED MIMD PETAFLOPS ARCHITECTURE:
Conduct a detailed design study of a MIMD petaflops architecture implemented with commercial off-the-shelf components available in 2014. This will provide the important baseline against which to assess all more radical architecture approaches.

PERFORM DETAILED APPLICATIONS STUDIES: :
Develop a set of petaflops scale applications and characterize them in detail to expose operational factors that determine performance. Evaluate and project the performance of the three archtypical architectures including COTS-based MIMD, PIM, and Hybrid-Technology Multi-threaded using the petaflops application test suite.

DEMONSTRATE 100 K CONCURRENT THREADS: :
Through analysis and simulation, demonstrate that applications of economic and societal value can expose more than 100 thousand-way concurrency by supplying computing demands to between 100 K and 1 M parallel threads. Show that a petaflops architecture can manage and exploit this level of parallelism.

DEVELOP TECHNIQUES FOR LATENCY MANAGEMENT AT PETAFLOPS SCALE COMPUTING: :
Pursue approaches in hardware, software, and algorithms for minimizing the impact of system latency on performance. Develop a new family of algorithmic techniques for latency tolerance. Derive system structures that minimize average latency for a broad range of applications. Devise hardware mechanisms for efficient hiding of latency. Incorporate latency management strategies and policies in resource management system software.

DESIGN AND EVALUATE AN ENHANCED PIM PETAFLOPS ARCHITECTURE: :
Extend the results of the PIM architecture point design study to incorporate cache-only-memory-architecture (COMA) and possible latency management mechanisms for general purpose processing and evaluate through simulation.

DESIGN AND EVALUATE A HYBRID TECHNOLOGY PETAFLOPS ARCHITECTURE: :
Extend the results of the HTMT architecture point design study to achieve a detailed system design and evaluate through simulation. This will provide the basis for a future prototyping initiative.

ADVANCE SUPERCONDUCTING RSFQ TECHNOLOGY: :
Accelerate research in superconducting technology with an emphasis on rapid single flux quantum logic (RSFQ) to determine its potential as a basis for petaflops architecture and to provide the technological means for implementing a petaflops computer. Critical areas include RSFQ cache memory, submicron fabrication of LSI density superconducting devices, and high-bandwidth low-power external interconnect.

EXPLORE ALGORITHMS FOR SPECIAL PURPOSE AND RECONFIGURABLE STRUCTURES: :
Study algorithms to identify opportunities for performance acceleration through special purpose structures. By clustering similar algorithms, examine the possibility of exploiting reconfigurable logic to enhance generality of specialized architecture hardware.

CONDUCT STUDY OF OPTICAL MEMORY TECHNOLOGY FOR PETAFLOPS APPLICATIONS: :
Conduct a detailed analysis of the potential of optical holographic memory for significant reduction of semiconductor main memory. Identify key technological advances required to enable this technology as a practical storage mechanism.

ESTABLISH PROGRAMMATIC FRAMEWORK FOR ADVANCING GOALS OF PETAFLOPS COMPUTING: :
Required are formal procedures for capturing results of intermediate studies, assessing implications and directions, and establishing new inter-agency collaborative initiatives.


2.0 IMMEDIATE ACTIONS

3.0 DETAILED RECOMMENDATIONS

Go To:


lpicha:5/30/96