Aug 23rd, 2013

Fighting with slurm on n0011. After changing the executables to the old(er) version, replacing the corresponding libraries, and fixing passwordless ssh to node, n0011 appears in the sinfo list.

Next targets are

  • fix slurm.conf to allow more than one job to run on n0011 simultaneously ⇒ Done
  • update NAMDjob to allow (semi)-automatic usage of new node ⇒ Done
  • Use LSI webbios to prepare a RAID 1 (mirroring) ⇒ Done
  • format n0011 disks' and export as /home2 with a mode of 1777 to be used as a second writing device ⇒ Done
  • Try to export n0011's RAID to all nodes (not just norma) ⇒ Done
  • Enter n0011 to the various scripts (load, shutdown, …)
  • Find a suitable CUDA-enabled card for the box

Should it be RAID 0 instead of RAID 1 ???????

maintenance/aug_23rd_2013.txt · Last modified: 2013/08/26 12:33 (external edit)