You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 5 Next »

Job queues

Isabella cluster nodes are divided into multiple queues according to the architecture and purpose :

  • a12.q
    • queue for all jobs with no limitations
    • servers HP ProLiant SL250s (12)
    • parallel jobs can spread exclusively within individual nodes because Infiniband network is not available to them 
  • a16.q
    • queue for all jobs with no limitations
    • servers HP ProLiant SL250s
  • gpu.X.q
    • queue for jobs that use graphical processors (GPUs) with a maximum duration of 7 days
    • servers Dell EMC PowerEdge C4140
  • p20.q
    • queue for parallel jobs with a maximum duration of 7 days
    • servers HP ProLiant SL230s Gen8
  • p28.q
    • queue for parallel jobs with a maximum duration of 7 days
    • servers Lenovo NeXtScale nx360 M5
  • p28-big.q
    • queue for parallel jobs that  
      • require number of cores is the multiple of number 28 and
      • duration up to 7 days
    • servers Lenovo NeXtScale nx360 M5
  • p28-long.q
    • queue for parallel jobs with a maximum duration of 30 days
    • servers Lenovo NeXtScale nx360 M5
  • test.q
    • queue for the needs of application testing with a maximum duration of 24 hours
    • servers HP ProLiant SL250s
  • vsmp.q
    • queue for the use of ScaleMP vSMP nodes or jobs that require a lot of RAM or processing cores on a single node
    • servers HP ProLiant SL230s Gen8.

The server's technical specifications are available on the official Isabella computer cluster webpage.

Default job queue is a16q. Selection of job queue is done by setting the appropriate parallel environment.

Parallel environments

Parallel environment is an SGE mechanism that enables multi-core processing for the needs of parallel jobs. Since SGE does not have the option of detailed core division on nodes, Isabella cluster has three parallel environments defined: 

  • *mpi

    • SGE will try to assign processes on the same node

    • when a node is at full capacity, processes will be assigned to the next node

  • *mpisingle

    • SGE will assign all processes on the same node

    • number of requested cores must not exceed the number of cores on a node

  • *mpifull

    • SGE will reserve the whole node

    • number of cores must be a multiple of the cores available on a node

Parallel environments available on Isabella are assigned to queues:

  • a12.q
    • a12-mpisingle – all cores are assigned on a single node
  • a16.q
    • a16-mpi – general parallel jobs
    • a16-mpisingle – all assigned cores are on a single node
    • a16-mpifull - cores are assigned as a multiple of number 16 (whole nodes are reserved)
  • gpu.X.q
    • gpu – general jobs that require graphical processors (GPUs)
    • gpusingle – all GPUs are assigned to a single node
    • gpufull - GPUs are assigned as a multiple of number 4 (whole nodes are reserved)
  • p20.q
    • p20-mpi – general parallel jobs
    • p20-mpisingle – all assigned cores are on a single node
    • p20-mpifull - cores are assigned as a multiple of number 20 (whole nodes are reserved)
  • p28.q
    • p28-mpi – general parallel jobs
    • p28-mpisingle – all assigned cores are on a single node
    • p28-mpifull - cores are assigned as a multiple of number 28 (whole nodes are reserved)
  • p28-big.q
    • p28-mpifull - cores are assigned as a multiple of number 28 (whole nodes are reserved)
  • p28-long.q
    • p28-mpi-long – general parallel jobs
    • p28-mpisingle-long – all assigned cores are on a single node
    • p28-mpifull-long - cores are assigned as a multiple of number 28 (whole nodes are reserved)
  • vsmp.q
    • vsmp – cores are assigned to ScaleMP vSMP nodes
  • test.q
    • test – access to test node

Jobs can be assigned to any queue by command :

-pe *mpi X

or for mpisingle job:

-pe *mpisingle X

  • No labels