|
¡@
The Computational Molecular Science (CMS)
cluster computing facility has been built up with funding from a variety
of sources at The University of Queensland, including DVC(Research),
BACS and EPSA faculties, IMB, AIBN, and core user group contributions.
The strategy for its usage has been to advance research at UQ in the
general area of molecular computations. In line with this strategy, the
facility is generally available for usage by the research groups of the
Associate
Investigators of CCMS.
¡@
¡@
The CMS cluster facility consists of three distributed memory
clusters and two shared memory machines. Below are the technical
specifications of the currently installed hardware. The first
cluster is used by the CCMS and the research groups of its associate
investigators at UQ. The second cluster was funded primarily through
a Qld Government Smart State Grant, applied for by the Hypersonics
centre at UQ with Smith as a CI, and is mainly used by researchers
from the Hypersonics centre and the CCMS. The third cluster is a
90-node version of the cluster above, with upgraded opteron
processors (2.4GHz) and disk (73GB per node). 63 of these nodes are
being acquired by the CCMS and 27 are being acquired for the
research group of Federation Fellow Professor Alan Mark.
¡@
1.1 Cluster
Overview-Hardware
The specifications of
the currently installed machines are as follows:
Xeon
Cluster:
-
128 Sun Fire
v20z servers
-
CPU: 2 x Xeon
(32-bit) Processor (2.8GHz)
-
Physical
memory:3 GB RAM
-
Disk space: 1
x 36GB SCSI Hard Disk
-
Network
connection:2 x Gigabit Ethernet interfaes
2.2GHz
Opteron Cluster:
-
66 Sun Fire
v20z servers
-
CPU: 2 x AMD
Opteron 248 Processors (2.2GHz)
-
Physical
memory: 4GB RAM
-
Disk space: 1
x 36 GB SCSI Hard Disk
-
Network
connection: 2 x Gigabit Ethernet interfaces
-
Service
Processor: basic remote administration through IMPI (Intelligent
Platform Management Interface).
2.4GHz
Opteron Cluster:
-
90 Sun Fire
v20z servers
-
CPU: 2 x AMD
Opteron 250 Processors (2.4 GHz)
-
Physical
Memory: 4GB RAM
-
Disk space: 1x
73 GB SCSI Hard Disk
-
Network
connection: 2x Gigabit Ethernet interfaces
-
Service
Processor: basic remote administration through IMPI (Intelligent
Platform Management Interface).
The above
hard ware has been arranged into the following 4 clusters:
¡@
HPC Cluster: Giza
Giza is used to run
Gaussian, Molpro and Gamess calculations and mainly single processor
jobs using home grown Fortran and C/C++ codes.
-
Compute nodes: 92 Sun
Fire v60x servers
-
CPU: 2 x Xeon (32-bit)
processor (2.8GHz)
-
Physical memory: 3GB RAM
-
Disk space: 1 x 36GB
SCSI Hard Disk
-
Network connection: 2 x Gigabit Ethernet Interfaces
-
Batch and scheduling
system: Sun Grid Engine, Enterprise edition 5.3
-
Operating System: Linux
¡@ |
 |
|
¡P
32
compute nodes have been taken out of the cluster due to limitations in
space and cooling. They can be used individually if the need arises but
not as part of the cluster. This current setup is likely to change in
future when more space and appropriate cooling facilities become
available.
HPC Cluster: Blackhole
The calculations run
on Blackhole are mainly home grown parallel MPI codes in Fortran and
C/C++. A few nodes are set aside for single processor calculations
like Molpro, Gamess and home-grown codes.
-
Compute nodes: 33 Sun Fire v20z
servers
-
CPU: 2 x
AMD Opteron 248 Processors (2.2GHz)
-
Physical
memory: 4GB RAM
-
Disk space: 1 x 36 GB SCSI Hard Disk
-
Compute nodes: 60 Sun Fire v20z severs
-
CPU: 2x AMD Opteron 250 Processors (2.4GHz)
-
Physical
memory: 4GB RAM
-
Disk space: 1x 73 GB SCSI Hard Disk
-
Network
connection: 2 x Gigabit Ethernet Interfaces
-
Service
Processor: basic remote administration through IMPI (Intelligent
Platform Management Interface)
-
Batch
and scheduling system: Sun Grid Engine, Enterprise edition 5.3
-
Operating System: Linux
|
 |
| ¡@ |
¡@ |
¡@
HPC Cluster: Grape
-
Compute nodes: 30 Sun
Fire v20z Servers
-
CPU: 2 x AMD Opteron 250
Processors (2.4 GHz)
-
Physical memory: 4 GB RAM
-
Disk space: 1 x 73 GB
SCSI Hard Disk
-
Network connection: 2 x
Gigabit Ethernet Interfaces
-
Service
Processor: basic remote administration through IMPI (Intelligent
Platform Management Interface)
-
Operating
System: Linux
¡@
HPC Cluster: Baby-Blackhole
This is a separate
cluster which is housed by the Hypersonics Centre. This cluster will be
used as a test cluster for testing not just codes but also new operating
systems and cluster configurations. This will enable us to test new
systems first before we port them to the other 3 clusters.
-
Compute nodes: 33 Sun
Fire v20z Servers
-
CPU: 2 x AMD Opteron 248
Processors (2.4 GHz)
-
Physical memory: 4 GB RAM
-
Disk space: 1 x 36 GB
SCSI Hard Disk
-
Network connection: 2 x
Gigabit Ethernet Interfaces
-
Service
Processor: basic remote administration through IMPI (Intelligent
Platform Management Interface)
-
Operating
System: Linux
¡@
Shared Memory
Workstation: Redback
-
Sun Fire
280R
-
CPU: 2 x
UltraSPARC III 1.2GHz
-
Physical
memory: 8GB
-
Disk
space: 2 x 73GB SCSI Hard Disk
-
Network connection: Gigabit Ethernet Interface
-
Remote
System Control (RSC) module
-
Operating system: Solaris 9
|
 |
¡P
¡@
Shared Memory
Workstation: Huntsman
-
Sun Fire
V880
-
CPU: 8 x
UltraSPARC III 1.2GHz
-
Physical memory: 32GB
-
Disk space: 6 x 73GB SCSI Hard Disk
-
Network connection: Gigabit Ethernet Interface
-
Remote
System Control (RSC) module
-
Operating system: Solaris 9
|
 |
¡P
Other Hardware
1.2 Cluster Overview-Set Up
Giza, Blackhole, Grape, Huntsman and Redback are located at the CCMS
in the Chemistry building (Baby-Blackhole is housed in the
Hypersonics centre in the Mechanical Engineering building). The
clusters and memory machines are housed in two separate neighbouring
rooms. The first room houses, the 3 racks of Blackhole, Redback and
Huntsman. The second room houses the 3 racks of Giza, the one rack
of Grape, the switch, the 3 desktops running the queues, the
firewall and the front ends for Giza, Blackhole and Grape.

The cluster network is on an internal network behind a fire wall
which controls the access to our clusters from the out side world.
The only access from the outside world is to Giza's, Blackhole's and
Grape's front end or Redback. There is no direct access from the
outside to any of the other hardware on the internal network. Users
can access either Giza's or Blackhole's or Grape's front end. The
front end of the clusters is where users have their home
directories, can edit and compile their codes and submit their jobs
to the cluster. Users have no access to the compute nodes.

Giza has 92 compute nodes, 32 to each rack (rack3 has 4 additional
nodes that are used for testing and for network services like DHCP
and image server), which are named and numbered Sphinx001 to
Sphinx092. Blackhole has 93 compute nodes, 33 in rack1 (these are
the nodes with the 2.2GHz processors and the 36GB SCSI disks) and 30
in each of rack2 and rack3 (these are the nodes with the 2.4GHz
processors and the 73GB SCSI disks), which are named and numbered
Star01 to Star93. Grape has 30 compute nodes which are named and
numbered Merlot01 to Merlot30. Each cluster is managed by a separate
queueing system running the Sun Grid Engine Enterprise version 5.3
software. Below is a sketch of the principle set up of the three
clusters and the two SMP machines.

Users can only submit jobs to the Sphinx compute nodes from Giza, to
the Star compute nodes from Blackhole and to the Merlot compute
nodes from Grape. The home directories on Blackhole are mounted on
all the Star nodes, the home directories on Giza are mounted on all
the Sphinx nodes and the home directories on Grape are mounted on
all the Merlot nodes. This enables users to read and write from and
to their home directories during a calculation. For large temporary
files all nodes (Sphinx, Star and Merlot) have 21GB (or 53GB) of
local scratch space available which can be accessed by a job running
on the node.
¡@
Huntsman and Redback are stand alone machines. Users have their home
directories local on each machine and also run jobs directly on
Huntsman or Redback. There is no queueing system as both machines
only have a very small and exclusive user community. The SMP
machines are utilised to perform large memory calculations employing
Gaussian, Molpro or Gamess.
¡@
We recently acquired a server (funnelweb) which houses additional
temporary file space for Huntsman and Redback. This server is
currently being set up. |