%%bash
cat example_001/preprocess.sh
hpcrunner.pl submit_jobs --infile preprocess.sh
[2016/11/03 08:50:03] Beginning to submit jobs to the scheduler
[2016/11/03 08:50:03] Schedule is preprocess
[2016/11/03 08:50:03] Submitting all preprocess job types
[2016/11/03 08:50:04] Submitted batch job 23162
[2016/11/03 08:50:04] Submited job /scratch/gencore/nov_dalma_training/example_001/hpc-runner/scratch/001_preprocess.sh
With Slurm jobid 23162
[2016/11/03 08:50:04] There are 6 batches for job type preprocess
%%bash
tree example_001
Each individual task gets its own output file. The structure is date of submission, jobtype, date of executition, task count, and processID.
%%bash
cat example_001/hpc-runner/logs/2016-11-03-001_preprocess/2016-11-03-CMD_001-PID_13774.md
Additionally, all output from the scheduler is logged. This is useful when debugging submissions. If, for instance, we had mistyped a module name, submitted to the wrong queue, or requested impossible resources, this would be recorded here.
%%bash
cat example_001/hpc-runner/logs/2016-11-03-hpcrunner_logs/001_preprocess.log
The process table is a table for the whole job. It records version ( more on this later), the Slurm scheduler ID, the Slurm jobname, any task tags, and process pid, the exit code and the duration.
It is highly recommended to record this table in a project management tool.
%%bash
ls example_001/hpc-runner/logs/2016-11-03-hpcrunner_logs/001-process_table.md
Version | Scheduler Id | Jobname | Task Tags | ProcessID | ExitCode | Duration |
---|---|---|---|---|---|---|
0.0 | 23162 | 001_preprocess | 16551 | 0 | 0 years, 00 months, 0 days, 00 hours, 00 minutes, 30 seconds | |
0.0 | 23167 | 001_preprocess | 16553 | 0 | 0 years, 00 months, 0 days, 00 hours, 00 minutes, 30 seconds | |
0.0 | 23163 | 001_preprocess | 13774 | 0 | 0 years, 00 months, 0 days, 00 hours, 00 minutes, 30 seconds | |
0.0 | 23166 | 001_preprocess | 16555 | 0 | 0 years, 00 months, 0 days, 00 hours, 00 minutes, 30 seconds | |
0.0 | 23165 | 001_preprocess | 13776 | 0 | 0 years, 00 months, 0 days, 00 hours, 00 minutes, 30 seconds | |
0.0 | 23164 | 001_preprocess | 13778 | 0 | 0 years, 00 months, 0 days, 00 hours, 00 minutes, 30 seconds |