![]() |
ITS currently operates and maintains a 42-node beowulf cluster. This cluster was implemented to augment the current computational infrastructure ITS supports for the research needs of the Brock community. |
|
Hardware Main Node -> beowulf.ac.brocku.ca
Compute Nodes -> node2.brockwulf.ca to node42.brockwulf.ca
File System All beowulf
compute nodes have a local file system (CentOS release 5.4) with a NFS
mounted /home directory
swap
Access Secure shell access is available to the main node from the local .brocku.ca network eg: Access to each compute node
is through the main node. PBS Brock's beowulf uses PBS (Portable Batch System) as its resource manager to ensure a balanced use of the available resources and also to submit jobs. MAUI is used to schedule the jobs through PBS to ensure fair usage. The main node (beowulf.ac.brocku.ca) is used for logging in and compiling programs. The compute nodes are used for CPU intensive computations (node2.brockwulf.ca to node42.brockwulf.ca). PLEASE DO NOT RUN CPU intensive programs on the main node as this affects other users and their ability to logon and use Brock's Beowulf cluster. The PBS environment is configured to use a "whole pool" approach where all jobs are submitted to PBS through a single queue (workq). THe MAUI scheduler then evaluates each job and decides when it can run. The following example scripts can be used as a templates for requesting resources through PBS. PBS Example Scripts A single pbs job -> single_job.pbs A pvm pbs job -> pvm_job.pbs A MPICH MPI pbs job -> mpich-mpi_job.pbs A LAM MPI pbs job -> lam-mpi_job.pbs An interactive pbs session that brings up a xterm window-> xterm_ssh.pbs PBS Commands You can then submit your jobs to PBS using the qsub command. qsub -V single_job.pbsFor a 7 processor LAM-MPI job use: qsub -l nodes=7:ppn=1 lam-mpi_job.pbsTo see all the jobs in the queue use: qstat -nTo see a list of jobs you have submitted: qstat -nu login_idOr a specific job qstat -n 2533To delete a job, use qdel and specify the job number: qdel 2533Type man qsub for more information or look at the PBS documentation Job Stdout and Stderr PBS directs the stdout and stderr from running jobs into temporary files in the users $HOME directory. job#.beowulf.OU job#.beowulf.ER Upon job completion these are copied to $PBS_O_WORKDIR
(where the job was started from) to files specifed in pbs submit script
by the If you would like to see the stdout and stderr of a running job you can use the
qjob_ou and qjob_er commands and specify the job number: qjob_ou 2533 qjob_er 2533These commands are equivilant to the standard less text paging command. Type man less for more information. Utilities
C3 tools The Cluster Command Control (C3) tools
are a suite of cluster tools developed at Oak Ridge National Laboratory that are useful
for executing commands on cluster nodes such as file distribution and gathering, and
process query and termination.
cget - retrieves files or directories from all cluster nodes
cpush - distribute files or directories to all cluster nodes
ckill - terminates a user specified process on all cluster nodes
crm - remove files or directories from all cluster nodes Compilers and Libraries Brockwulf has Intel's C/C++ (icc) and Fortran (ifc) compilers installed along with the GNU familiy of compilers (gcc, g77). You can use the following optimization flags with the Intel compilers -O2 -xW -tpp7which will optimize your code for the P4 processor For the GNU family of compilers you can use -O2 -march=pentium4Intel's Math Kernel Library is also avaliable and provides vendor optimized LAPACK and BLAS libraries. They can be linked using -L/opt/intel/mkl70cluster/lib/32/ -lmkl_lapack -lmkl_p4 -lguide -lpthreadBoth versions 3.0.1 and 2.1.5 of the FFTW implementation of DFT are avaliable on Brockwulf.
Additional Information The official beowulf FAQ: http://www.canonical.org/~kragen/beowulf-faq.txt Programming environments LAM/MPI: http://www.lam-mpi.org/. PVM (Parallel Virtual Machine): http://www.epm.ornl.gov/pvm/. Local Intel PDFs containing information on:
Local additional PBS documentation: pbs
system monitoring: http://beowulf.ac.brocku.ca/ganglia disk usage monitoring: http://beowulf.ac.brocku.ca/usage |