Parallel Computing
Sub-System
The Parallel Computing Hardware
The Parallel Computing Sub-System is the “performance
provider” of SMART-PIV. It consists of two major modules:
- The Parallel Computing Hardware
- The Parallel Computing SW
| The Parallel Computing Hardware
(Figure 1) aims at providing a high-performance,
scalable platform for the parallel evaluation of acquired images
and the parallel numerical analysis simulation. In order to
achieve this, the Parallel Computing HW is built upon a new
philosophy based on the use of commodity components to build
high-performance and cost-efficient parallel computing platforms.
This new approach assembles commodity PCs and interconnects
them with the use of a widely used interconnection network to
implement a so called PC Cluster. |
|
Figure 1: Photo of SMART-PIV
Parallel Computing HW |
PC Clusters are an increasingly popular parallel
computing platform, having already dominated the small and medium-scale
parallel applications and constantly penetrating the large-scale
ones. In November 2002, it was reported that 18% of the 500 most
powerful supercomputers worldwide were PC clusters, while in June
2003 this percentage had grown to 29.8%.
There are two key features of PC Clusters, responsible
for their success and popularity:
Commodity HW enables for cost-effective, up-to-date systems to be
built that are easily scalable and provide high computational power.
Note that using the most up-to-date processors one can always deliver
the highest possible performance to the parallel applications, whereas
this is not the case for vendor-built parallel machines. When a
new parallel architecture project is launched by a manufacturer,
the latest microprocessor is used to design and implement the configuration.
However, at the time the architecture reaches the market (at least
six months later) this CPU has become rather obsolete.
Open-Source SW extensively used in computer clusters (Linux OS,
GCC compiler, MPI parallel library), keeps the installation cost
low and provides ease of maintenance.
PC Clusters provide the highest performance to cost
ratio, and are consequently the most popular solution for small
and medium-scale parallel applications. As far as the computational
needs of the SMART-PIV application are concerned (image processing
and numerical analysis), it is believed that a Linux Cluster is
the most suitable parallel architecture. Built upon the latest processor
technology, the SMART-PIV cluster is expected to deliver the required
performance. Due to its scalability, more processing nodes can be
added at any time in the future, further contributing to performance
if necessary. It is also a fact that, at the same cost, no other
supercomputing infrastructure can compete with the performance and
scalability offered by a PC Cluster. It seems that PC Clusters are
the unique affordable solution for similar small to medium-scale
applications, similar to the image processing and numerical analysis
for SMART-PIV.
The Parallel Computing HW has already been implemented.
The basic components of the platform are:
4 Processing nodes:
| Processor type |
Intel Pentium 4 2.66GHz, 533MHz FSB |
| Memory |
512MB DDR PC2700-333MHz |
| Motherboard |
QDI Superb 4LE-6AL |
| Chipset |
SiS 651 |
| NIC |
3Com 3C905CX-TX-M PCI 10/100 BaseTX |
| Hard disk |
Western Digital WDC400BB 40GB, 7200 rpm, UDMA/100 |
| Power supply |
300W |
1
File Server:
| Processor type |
Dual Intel Xeon 2.4GHz (533MHz FSB) |
| Memory |
2x 512MB DDR 266MHz ECC Registered |
| Motherboard |
Supermicro X5DPA-GG-O |
| Chipset |
Intel E7501 |
| NIC |
2x Intel 82541 Gigabit Ethernet (onboard) |
| Hard disk |
Western Digital WDC400BB 40GB, 7200 rpm, UDMA/100 |
| Power supply |
400W |
Interconnection Network:
| Fast Ethernet Switch |
3Com, 3C1670108 , Office Connect Switch with Gigabit uplink
(8*10/100/1000BaseT) |
| Gigabit Ethernet Cards |
Intel 82541 Gigabit Ethernet (onboard) |
| Fast Ethernet Cards |
3Com 3C905CX-TX-M PCI 10/100 BaseTX |
The general design is shown in Figure 2.
The interconnection network between the processing nodes is Fast
Ethernet. The Cluster of the processing nodes is connected to the
File Server with a Gigabit Ethernet link. The link between the File
Server and the Recording PC is Gigabit Ethernet as well.
A typical operational scenario would require the
camera attached at the Recording PC to record images at a rate of
15-60 Mbytes/sec. These images are transferred directly to the File
Server where they are made available for the cluster processing
nodes. The nodes read the images from the File.
Server perform image processing and evaluation write
the results (vector fields) back to the File Server for further
processing and/or visualization.

Figure 2: Design of Parallel Computing HW |