Você está na página 1de 7

 Software:

Sun HPC ClusterTools

Ubiquitous Parallel Computing

Ferhat Hatay, Rolf vandeVaart, Josh Simons

Sun Microsystems, Inc.

Abstract well as in commercial computing circles has


awaited the maturation and stabilization of a
Sun HPC ClusterTools Software is a robust software development environment.
complete software environment to support The Message Passing Interface (MPI)
parallel high performance and technical standard was developed in the early 1990s
computing applications ubiquitously over by a forum of hardware and software
single as well as networked Sun computer vendors and developer community. They
system solutions. Sun HPC ClusterTools created a platform- independent
Software has long offered industry standard programming paradigm that could
programming interfaces along with support realistically achieve scalable high
for multiple communication protocols and performance on different system
mixed-parallel programming paradigms. architectures. Over the years, the MPI
Consequently, the Sun high performance standard has achieved unprecedented
computing (HPC) user community has industry-wide adoption among hardware and
developed and deployed their parallel software vendors, as well as scientific and
applications without being land-locked in
engineering application developers. The
superficial technology enclaves. The newly
original MPI functionality was later
released Sun HPC ClusterTools 4 Software
extended through the additional features
introduces an open and expandable
introduced by the MPI-2 standard. Most
architecture with the new Loadable Protocol
recently, mixed programming techniques
Modules enabling the seamless integration
of existing and future network protocols. have been gathering interest, in which MPI
Parallel user applications remain unaffected distributed parallel techniques are used in
with no performance compromises conjunction with thread-parallel and shared-
throughout. memory parallel techniques such as those
employed in the OpenMP standard.
Sun readily provides high performance and
1. Introduction technical computing (HPTC) customers with
The potential power and economic appeal of powerful and scalable Symmetric
parallel computing can hardly be questioned. Multiprocessing (SMP) compute servers
However, the wide acceptance of parallel with up to 64 CPUs. HPTC clusters of Sun
computing in research and development as SMP systems enabled by Sun HPC

1
ClusterTools Software provide further safety feature of Sun MPI allows hybrid
scalability and performance levels to meet programming techniques to be applied
the most challenging high performance within a single application.
computing demands of research,
In this paper, the key features in the recently
development, design, and production
released Sun HPC ClusterTools 4 Software
communities involved in the art and science
are discussed. The outstanding capabilities
of computational modeling in their
of Sun MPI, Sun's native implementation of
respective fields.
the MPI standard, are also covered for the
Sun HPC ClusterTools Software provides a benefit of parallel application developers
complete development environment for the who are making strategic choices for
analysis, debugging, and performance programming paradigms, system
monitoring of parallel MPI applications on architectures, and high performance
Sun system solutions, which are based on clustering infrastructures.
the UltraSPARC processor architecture
In addition to providing the source code
and the Solaris Operating Environment.
under the Sun Community Source Code
Sun HPC ClusterTools Software enables the
License (SCSL) program, Sun HPC
deployment of distributed parallel
ClusterTools 4 Software also introduces a
applications with continuous and portable
new open and expandable architecture
scalability from one to thousands of
framework to support network protocols of
processes. The components included in the
the present and future within one robust
Sun HPC ClusterTools Software package
software development environment. This
are:
paper is an introduction to the new
• Sun Cluster Runtime Environment (CRE) architecture as well as an invitation to the
• Sun MPI communication library high performance computing developer
community for collaboration.
• Prism graphical programming
environment
• Sun Scalable Scientific Subroutine 2. What is new in Sun HPC
Library (S3L) ClusterTools 4 Software?
• Sun Parallel File System (PFS) Sun HPC ClusterTools 4 software is the new
release following the Sun HPC ClusterTools
The Sun MPI library is the core component
3.1 software in product offerings. The key
of the Sun HPC ClusterTools toolkit. It is a
new features in the Sun HPC ClusterTools 4
highly optimized, native MPI Software include:
implementation that includes most of the
extended MPI-2 standard. A notable • Scalability to 2048 processes and 64
exception is MPI one-sided communication nodes (SMPs).
between SMP nodes, which will appear in a Sun HPC ClusterTools 4 supports the
later release. The Sun MPI library includes a execution of parallel MPI applications with
full, native implementation of the MPI I/O up to 2048 processes, running across up to
part of the MPI-2 standard extensions which 64 computational nodes where each node
allows access to the Sun Parallel File System can be an SMP of any size.
from Sun MPI applications. The thread-

2
• Support for UltraSPARC III based Sun S3L library now features new direct and
systems. iterative sparse matrix solvers, linear
programming and equity option pricing
The Sun S3L (Scalable Scientific Subroutine
functions as well as additional mathematical
Library) provides a set of parallel and
transformations.
scalable functions widely used in scientific
and engineering computing. The Sun S3L • Support for dynamically spawned MPI
library now includes support for the processes in the Sun Prism graphical
TM

UltraSPARC III processor architecture along programming environment.


with the UltraSPARC II architecture in the The spawn feature of the MPI 2 standard
same library, and transparently provided to allows programmers to create sets of related
the user applications. processes. Sun HPC ClusterTools 4 includes
• Support for Loadable Protocol Modules. a new feature that allows the developers to
debug codes using MPI_SPAWN
Sun HPC ClusterTools Software supports
functionality of the MPI 2 standard.
multiple communication protocols to
achieve scalable performance on parallel • Improved security, error logging and
high performance computing applications. corefile handling services in Sun CRE.
With the modularity introduced by Loadable Sun CRE is a cluster administration and job
Protocol Modules, new network and launching facility. With the Sun HPC
communication fabrics are supported ClusterTools 4 Software release, the errors
natively by the Sun MPI framework without detected by Sun CRE can now be localized
requiring a recompilation or relinking of the to the node level with messages and
Sun MPI library or the existing parallel user recommended actions for appropriate
applications. owners. Sun CRE, as are all components in
• Infrastructure to support the use of Sun HPC ClusterTools 4 Software, is
Remote Shared Memory (RSM) protocol scalable to 2048 processes over 64 nodes
within Sun MPI over the next generation with fast parallel job execution startup and
Sun interconnect hardware. shutdown operations. New adaptive time-out
functionality in Sun CRE introduces
The RSM protocol provides direct and low-
variability with job size for improved fault
latency memory-to-memory communication
and error identification.
between cluster nodes over RSM-capable
interconnects. Applications demanding low- • Sun HPC ClusterTools 4 / 3.1 software
latency communication framework will coexistence.
benefit from the direct remote memory Users are able to have both Sun HPC
access capability in a high performance ClusterTools 3.1 software and Sun HPC
cluster over RSM capable interconnects. Sun ClusterTools 4 software installed on the
HPC ClusterTools 4 Software provides built- same system and can run either version
in support from the Sun MPI library for next under the Solaris 8 Operating Environment.
generation Sun interconnect hardware Consequently, customers can provisionally
through the Loadable Protocol Module upgrade a cluster to Sun HPC ClusterTools 4
technology. software while keeping the option of
• Enhancements and additions to Sun S3L returning to Sun HPC ClusterTools 3.1
parallel performance library. software if desired. User applications that

3
are compiled and linked with Sun HPC the library at any given time. In addition,
ClusterTools 3.1 release will work under the both 32- and 64-bit versions of the library
new Sun HPC ClusterTools 4 Software are included with Sun HPC ClusterTools and
without any changes. Sun MPI programs may be debugged and
• Free Web download. tuned with the Prism parallel development
environment.
Sun HPC ClusterTools 4 Software is now
available for free download from the Web Figure 1 illustrates the relationship between
directly for unlimited use. the components of the Sun HPC
ClusterTools Software and other software
development products available from Sun.
3. Thread-safety: support for
multiple programming
paradigms
Two primary high performance computing
programming models are supported in the
Sun environment: a single-process model
and a multiprocess model. The single-
process model includes all types of Figure 1 Sun HPC ClusterTools Software in
multithreaded applications. These may be context
automatically parallelized by Sun's high
performance compilers using parallelization
directives (e.g., OpenMP) or explicitly 4. Support for multiple network
parallelized with user-inserted Solaris or protocols: open and expandable
POSIX threads. The multiprocess model, Loadable Protocol Module
which is the topic of this paper, supports the Architecture
MPI standard for parallel applications that
run both on single SMPs and on clusters of Sun MPI, which currently supports
SMPs or thin nodes. It should be noted that a applications spanning up to 64 compute
third, hybrid model is also supported: the nodes and consuming up to 2048 processes,
mixing of threads and MPI parallelism to includes extensive optimizations for running
create applications that use MPI for both within and between SMP nodes.
communication between cooperating Versions of Sun HPC ClusterTools Software
processes and threads within each process. prior to the Release 4 included code for
Such codes may make most efficient use of communication between processes across
the capabilities of individual SMP nodes in TCP networks, shared memory interfaces
the high performance cluster environment. (SHM), and remote shared memory (RSM).
Hybrid parallel programming paradigm can To accommodate communication across a
only be considered in a thread-safe wider variety of networks and interfaces,
framework. Sun HPC ClusterTools 4 has extracted the
communications code into individual
Sun MPI is fully thread-safe with locking Loadable Protocol Modules that can be
pushed as low as possible within the called by the main Sun MPI library. Each
implementation to allow for concurrency: protocol module enables Sun HPC
multiple user threads may be active within ClusterTools communication across a

4
network with a different communications The architecture of the Sun MPI library is
protocol. shown in Figure 2. The lowest level of the
Protocol modules offer several advantages library, called the Protocol Module Layer,
over embedded code: includes support for several communication
mechanisms. When communication between
• New communications protocols can be
two processes in a Sun MPI job is initiated,
developed and supported without the
the library chooses the most efficient
need for a release of the Sun MPI library
transport pathway between those two
and the Sun CRE software.
processes. For example, two processes co-
• No recompilation or relinking is located on a single SMP node will
necessary for existing Sun MPI parallel communicate via shared memory segments
applications to use the new and additional under control of the shared-memory protocol
dynamic Loadable Protocol Modules. module, whereas two processes on different
• Individual protocol modules can be nodes will use the protocol module
patched without patching the entire Sun corresponding to the best available network
MPI Library. connection between those nodes. These
connection decisions are made automatically
on a pairwise basis at run time.

The TCP protocol module allows Sun MPI


jobs to run across any available TCP-capable
interconnects. While this allows the use of
numerous commodity interconnects offering
a wide variety of interconnect bandwidths, it
does not address the issue of low-latency
communication that is of interest for many
classes of HPTC applications.
Sun HPC ClusterTools 4 Software includes a
Remote Shared Memory (RSM) protocol
module for low-latency communications.
RSM protocol module addresses the latency
issue by allowing nodes connected with
capable interconnects to bypass the Solaris
Figure 2 Sun MPI architecture Operating Environment and perform MPI
data transfers with user-space load and store
operations. The Dynamic Loadable Protocol
Module open architecture allows the
development and integration of other low-
latency protocol modules within the Sun
MPI framework.

5
5. Sun HPC ClusterTools Sun 6. Myrinet2000 Loadable Protocol
Community Source License Module Development
(SCSL) program Myricom, Inc. has been developing a
In addition to being available as a supported dynamically Loadable Protocol Module
product, the source code for the Sun HPC under Sun Community Source License
ClusterTools Software has been available to program for their latest Myrinet2000
the HPTC community available through interconnect. Because these modules are
Sun's Community Source License (SCSL) loaded at run time by the standard Sun MPI
program since November 1999. This library, users of such third-party modules do
mechanism provides free access to source not need access to Sun HPC ClusterTools
code to those individuals and organizations source code to make use of this capability.
who wish to experiment with or contribute Furthermore, ISV applications readily
to the development of these tools. compiled and certified with the current
version of Sun HPC ClusterTools may also
In the new release 4, Sun HPC ClusterTools take advantage of such third-party
Software introduces an open software interconnects without re-compilation or re-
architecture to support different network linking.
protocols natively under the ubiquitous Sun
MPI framework as explained in the
previous section. In addition to the standard 7. Summary
Protocol Modules for shared memory,
The Sun HPC ClusterTools toolkit is a full-
remote shared memory, and TCP, the Sun
featured suite that supports the development
MPI supports the concept of dynamically
and the execution of high performance,
Loadable Protocol Modules.
distributed-memory, parallel applications
Under the Sun HPC ClusterTools Software across all Sun system solutions. The toolkit
SCSL program, third-party interconnect currently supports applications that span up
vendors can develop Loadable Protocol to 64 SMP nodes and that contain up to
Modules and provide native support for their 2048 processes. Sun's MPI implementation
communication hardware and software is shared-memory aware, thread-safe, and
within the Sun MPI framework. The open supports a low-latency RSM communication
architecture of Loadable Protocol Module protocol over capable interconnects.
enables such partners to integrate their Scalable, parallel I/O is supported from MPI
hardware and low-level interconnect applications. In addition, the Sun HPC
software with the Sun MPI library to offer ClusterTools Software includes a robust
the full capabilities of their interconnect debugging, performance analysis, and data
under the Sun HPTC software and stack. visualization environment. In the new
Users of Sun HPTC solutions, in return, release 4, Sun HPC ClusterTools Software
experience a stable and continuous software introduces an open software architecture to
environment, protecting their existing support multiple different network protocols
software investments while allowing natively under the ubiquitous Sun MPI
upgrades of their cluster interconnects with framework.
new solutions.

6
8. Further Information Copyright © 2001 Sun Microsystems, Inc., 901
San Antonio Road, Palo Alto, California
The following references provide additional 94303, U.S.A. All rights reserved.
technical detail on a number of the
capabilities covered in this paper. Sun Microsystems, Inc. has intellectual
property rights relating to technology
Information about Sun HPC ClusterTools embodied in this product. In particular, and
Software product can be found at this without limitation, these intellectual
location: property rights may include one or more of
the U.S. patents listed at
http://www.sun.com/software/hpc
http://www.sun.com/patents and one or more
additional patents or pending patent
applications in the U.S. and other countries.
Sun HPC ClusterTools Software can be
downloaded for unlimited use from: This product is distributed under licenses
restricting its use, copying distribution,
http://www.sun.com/ \
and decompilation. No part of this product
software/hpc/tryandbuy.html may be reproduced in any form by any means
without prior written authorization of Sun
and its licensors, if any.
The following page contains a selection of
Third-party software, including font
HPC-related whitepapers, including a
technology, is copyrighted and licensed from
technical whitepaper on Sun PFS and Sun suppliers.
additional information on S3L:
Sun, Sun Microsystems, the Sun logo, Solaris,
http://www.sun.com/ \ Sun HPC ClusterTools, and Prism are
software/solutions/hpc/docs/ trademarks or registered trademarks of Sun
Microsystems, Inc. in the U.S. and other
countries.
Sun Community Source web site for the Sun
HPC ClusterTools toolkit provides All SPARC trademarks are used under license
and are trademarks or registered trademarks
information on how to join the Sun HPC
of SPARC International, Inc. in the U.S. and
ClusterTools community and how to
other countries. Products bearing SPARC
download the source code. trademarks are based upon architecture
http://www.sun.com/ \ developed by Sun Microsystems, Inc.

software/solutions/hpc/communitysource

Sun's primary site for online documentation


contains the full manual set for the Sun
HPC ClusterTools Software is:
http://docs.sun.com

Você também pode gostar