Commit 985f9467 authored by Mani Tofigh's avatar Mani Tofigh

Merge branch 'mt-dev'

parents 8a7ab99e f6b81970
...@@ -133,6 +133,9 @@ http://127.0.0.1:9001/?token=...(your token is here)... ...@@ -133,6 +133,9 @@ http://127.0.0.1:9001/?token=...(your token is here)...
``` ```
Copy that address and paste it into your browser, and you must successfuly access Jupyter's GUI. Copy that address and paste it into your browser, and you must successfuly access Jupyter's GUI.
### Apptainer TensorFlow batch job example
## Interactive jobs ## Interactive jobs
### Starting an Interactive job ### Starting an Interactive job
To start an interactive job, you use the srun command with specific parameters that define your job's resource requirements. Here's an example: To start an interactive job, you use the srun command with specific parameters that define your job's resource requirements. Here's an example:
...@@ -291,7 +294,7 @@ mpirun -n $NUM ./$OBJ ...@@ -291,7 +294,7 @@ mpirun -n $NUM ./$OBJ
``` ```
This script compiles the MPI program using `mpicc` and runs it with `mpirun`, and specifies the number of processes with `-n`. This script compiles the MPI program using `mpicc` and runs it with `mpirun`, and specifies the number of processes with `-n`.
Next, prepare a SLURM batch job script named `job-test-mpi.sh` to submit your MPI job. This script requests cluster resources and runs your MPI program through `mpi_hello_world.sh`: Next, prepare a SLURM batch job script named `job-test-mpi.sbatch` to submit your MPI job. This script requests cluster resources and runs your MPI program through `mpi_hello_world.sh`:
```bash ```bash
#!/bin/bash #!/bin/bash
#SBATCH --job-name=mpi_job_test #SBATCH --job-name=mpi_job_test
...@@ -300,13 +303,14 @@ Next, prepare a SLURM batch job script named `job-test-mpi.sh` to submit your MP ...@@ -300,13 +303,14 @@ Next, prepare a SLURM batch job script named `job-test-mpi.sh` to submit your MP
#SBATCH --nodelist=gpu1,gpu2,cn01 #SBATCH --nodelist=gpu1,gpu2,cn01
#SBATCH --time=10:00 #SBATCH --time=10:00
#SBATCH --mem-per-cpu=1000 #SBATCH --mem-per-cpu=1000
module load openmpi4 module load openmpi4
echo "run mpi program using parallel processes" echo "run mpi program using parallel processes"
sh mpi_hello_world.sh $1 sh mpi_hello_world.sh $1
``` ```
This script sets up a job with the name mpi_job_test, specifies output and error files, requests resources (All 3 nodes from the cluster), and loads the OpenMPI module. It then runs the `mpi_hello_world.sh` script and passes the number of processes as an argument. This script sets up a job with the name mpi_job_test, specifies output and error files, requests resources (All 3 nodes from the cluster), and loads the OpenMPI module. It then runs the `mpi_hello_world.sh` script and passes the number of processes as an argument.
In the end, submit your parallel MPI job to SLURM using the sbatch command, specifying the desired number of parallel processes with -n. For example, to run with 8 parallel processes: In the end, submit your parallel MPI job to SLURM using the sbatch command `sbatch job-test-mpi.sbatch`, specifying the desired number of parallel processes with `-n`. For example, to run with 8 parallel processes:
```bash ```bash
sbatch -n 8 job-test-mpi.sh 8 sbatch -n 8 job-test-mpi.sh 8
``` ```
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment