Tips, tricks, FAQ
All the little things to save you time on the clusters
Deleting model_output/ (or any big folder) is too long on the cluster
model_output/ (or any big folder) is too long on the clusterYes, it takes ages because IO can be so slow, and there are many small files. If you are in a hurry, you can do
mv model_output/ model_output_old
rm -r model_output_old &The first command rename/move model_output, it is instantaneous. You can now re-run something. To delete the renamed folder, run the second command. the & at the end makes it execute in the background.
Use seff to analyze a job
seff to analyze a jobAfter a job has run (either to completion or got terminated/fail), you may run:
seff JOB_IDto know how much ressources your job used in your node, what was the cause for termination and so on. If you don't remember the JOB_ID, look for the number in the filename of the slurm log (slurm_{JOB_ID}.out).
Last updated
