Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Isabella cluster nodes are divided into multiple queues according to the architecture and purpose:

  • a12a28.q
    • queue for all jobs with no limitations
    • servers HP ProLiant SL250s (12)
    • parallel jobs can spread exclusively within individual nodes because Infiniband network is not available to them 
  • a16.q
    • queue for all jobs with no limitations
    • servers HP ProLiant SL250s
    • Lenovo NeXtScale nx360 M5
  • gpu.X.q
    • queue for jobs that use graphics processors (GPUs) with a maximum duration of 7 days
    • servers Dell EMC PowerEdge C4140
    p20.q
    • queue for parallel jobs with a maximum duration of 7 days
    • servers HP ProLiant SL230s Gen8
  • p28.q
    • queue for parallel jobs with a maximum duration of 7 days
    • servers Lenovo NeXtScale nx360 M5
  • p28-big.q
    • queue for parallel jobs
      • require number of cores that is a multiple of number 28 and
      • with a maximum duration of 7 days
    • servers Lenovo NeXtScale nx360 M5
  • p28-long.q
    • queue for parallel jobs with a maximum duration of 30 days
    • servers Lenovo NeXtScale nx360 M5
  • test.q
    • queue for the needs of application testing with a maximum duration of 24 hours
    • servers HP ProLiant SL250s
  • vsmp.q
    • queue for the use of ScaleMP vSMP nodes or jobs that require a lot of RAM or processing cores on a single node
    • servers HP ProLiant SL230s Gen8.

The server's technical specifications are available on the official Isabella computer cluster webpage.

Default job queue is a16qa28q. Selection of job queue is done by setting the appropriate parallel environment.

...

Parallel environments available on Isabella are assigned to queues:

  • a12a28.q
    • a12-mpisingle – all cores are assigned on a single node
    a16.q
    • a16-a28-mpi – general parallel jobs
    • a16a28-mpisingle – all assigned cores are on a single node
    • a16a28-mpifull - cores are assigned as a multiple of number 16 28 (whole nodes are reserved)
  • gpu.X.q
    • gpu – general jobs that require graphics processors (GPUs)
    • gpusingle – all GPUs are assigned to a single node
    • gpufull - GPUs are assigned as a multiple of number 4 (whole nodes are reserved)
    p20.q
    • p20-mpi – general parallel jobs
    • p20-mpisingle – all assigned cores are on a single node
    • p20-mpifull - cores are assigned as a multiple of number 20 (whole nodes are reserved)
  • p28.q
    • p28-mpi – general parallel jobs
    • p28-mpisingle – all assigned cores are on a single node
    • p28-mpifull - cores are assigned as a multiple of number 28 (whole nodes are reserved)
  • p28-big.q
    • p28-mpifull - cores are assigned as a multiple of number 28 (whole nodes are reserved)
  • p28-long.q
    • p28-mpi-long – general parallel jobs
    • p28-mpisingle-long – all assigned cores are on a single node
    • p28-mpifull-long - cores are assigned as a multiple of number 28 (whole nodes are reserved)
  • vsmp.q
    • vsmp – cores are assigned to ScaleMP vSMP nodes
  • test.q
    • test – access to test node

Jobs can be assigned to any queue by command :

...