diff --git a/01-access-machines.md b/01-access-machines.md
index 92a9439..02bbdc8 100644
--- a/01-access-machines.md
+++ b/01-access-machines.md
@@ -1,22 +1,20 @@
 ---
-author: Alexandre Strube // Sabrina Benassou
-title: Accessing the machines, intro
+author: Alexandre Strube // Sabrina Benassou // Javad Kasravi
+title: Bringing Deep Learning Workloads to JSC supercomputers course
 # subtitle: A primer in supercomputers`
-date: June 25, 2024
+date: November 19, 2024
 ---
 ## Communication:
 
 Links for the complimentary parts of this course: 
 
-- [Zoom](https://go.fzj.de/bringing-dl-workloads-to-jsc-zoom)
-- [Slack](https://go.fzj.de/bringing-dl-workloads-to-jsc-slack)
-- [JSC Training Page](https://go.fzj.de/bringing-dl-workloads-to-jsc-course)
-- [Judoor project page invite](https://go.fzj.de/bringing-dl-workloads-to-jsc-project-join)
-- [This document: https://go.fzj.de/bringing-dl-workloads-to-jsc](https://go.fzj.de/bringing-dl-workloads-to-jsc)
+- [Event page](https://go.fzj.de/dl-in-neuroscience-course)
+- [Judoor project page invite](https://go.fzj.de/dl-in-neuroscience-project-join)
+- [This document: https://go.fzj.de/dl-in-neuroscience](https://go.fzj.de/dl-in-neuroscience)
 - Our mailing list for [AI news](https://lists.fz-juelich.de/mailman/listinfo/ml)
-- [Survey at the end of the course](https://go.fzj.de/bringing-dl-workloads-to-jsc-survey)
+- [Survey at the end of the course](https://go.fzj.de/dl-in-neuroscience-survey)
 - [Virtual Environment template](https://gitlab.jsc.fz-juelich.de/kesselheim1/sc_venv_template)
-- [SOURCE of the course/slides on Github](https://go.fzj.de/bringing-dl-workloads-to-jsc-repo)
+- [SOURCE of the course/slides on Github](https://go.fzj.de/dl-in-neuroscience-repo)
 
 ![](images/Logo_FZ_Juelich_rgb_Schutzzone_transparent.svg)
 
@@ -44,34 +42,38 @@ Links for the complimentary parts of this course:
 :::: {.col}
 ![Sabrina Benassou](pics/sabrina.jpg)
 ::::
+:::: {.col}
+![Javad Kasravi](pics/javad.jpg)
+::::
+
 :::
 
 ![](images/Logo_FZ_Juelich_rgb_Schutzzone_transparent.svg)
 
 ---
 
-### Schedule for day 1
+### Schedule
 
 | Time          | Title        |
 | ------------- | -----------  |
-| 10:00 - 10:15 | Welcome      |
-| 10:15 - 11:00 | Introduction |
-| 11:00 - 11:15 | Coffee break |
-| 11:16 - 11:30 | Judoor, Keys |
-| 11:30 - 12:00 | SSH, Jupyter, VS Code |
+| 09:00 - 09:15 | Welcome      |
+| 09:15 - 10:00 | Introduction |
+| 11:00 - 10:15 | Coffee break |
+| 10:16 - 10:30 | Judoor, Keys |
+| 10:30 - 11:00 | Jupyter-JSC |
+| 11:00 - 11:15 | Coffee Break |
+| 11:15 - 12:00 | Running services on the login and compute nodes | 
 | 12:00 - 12:15 | Coffee Break |
-| 12:15 - 13:00 | Running services on the login and compute nodes | 
-| 13:00 - 13:15 | Coffee Break |
-| 13:30 - 14:00 | Sync (everyone should be at the same point) |
+| 12:30 - 13:00 | Sync (everyone should be at the same point) |
 
 ---
 
 ### Note
 
 Please open this document on your own browser! We will need it for the exercises.
-[https://go.fzj.de/bringing-dl-workloads-to-jsc](https://go.fzj.de/bringing-dl-workloads-to-jsc)
+[https://go.fzj.de/dl-in-neuroscience](https://go.fzj.de/dl-in-neuroscience)
 
-![Mobile friendly, but you need it on your computer, really](images/bringing-dl-workloads-to-jsc.png)
+![Mobile friendly, but you need it on your computer, really](images/dl-in-neuroscience.png)
 
 ---
 
@@ -228,12 +230,12 @@ Please open this document on your own browser! We will need it for the exercises
 ### Connecting to Jureca DC
 
 #### Getting compute time
-- Go to [https://go.fzj.de/bringing-dl-workloads-to-jsc-project-join](https://go.fzj.de/bringing-dl-workloads-to-jsc-project-join)
-- Join the course project `training2425`
+- Go to [https://go.fzj.de/dl-in-neuroscience-project-join](https://go.fzj.de/dl-in-neuroscience-project-join)
+- Join the course project `training2441`
 - Sign the Usage Agreements ([Video](https://drive.google.com/file/d/1mEN1GmWyGFp75uMIi4d6Tpek2NC_X8eY/view))
 - Compute time allocation is based on compute projects. For every compute job, a compute project pays.
-- Time is measured in core-hours. One hour of Jureca DC is 48 core-hours.
-- Example: Job runs for 8 hours on 64 nodes of Jureca DC: 8 * 64 * 48 = 24576 core-h!
+- Time is measured in core-hours. One hour of Jureca DC is 128 core-hours.
+- Example: Job runs for 8 hours on 64 nodes of Jureca DC: 8 * 64 * 128 = 65536 core-h!
 
 ---
 
@@ -250,277 +252,32 @@ Please open this document on your own browser! We will need it for the exercises
 
 ## Jupyter
 
-#### Pay attention to the partition - DON'T RUN IT ON THE LOGIN NODE!!!
 
 ![](images/jupyter-partition.png)
 
 ---
 
-## Connecting to Jureca DC
-
----
-
-## VSCode
-
-- [Download VScode: code.visualstudio.com](https://code.visualstudio.com/download)
-- Install and run it
-  - On the local terminal, type `code`
-- Install [Remote Development Tools](https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.vscode-remote-extensionpack)
-- Install [Remote: SSH](https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.remote-ssh)
-- If you have Windows, you need WSL as explained on the email.
-
----
-
-## VSCode
-
-### Now with the remote explorer tab
-![](images/vscode-welcome.png)
-
-
----
-
-#### SSH
-- SSH is a secure shell (terminal) connection to another computer
-- You connect from your computer to the LOGIN NODE
-- Security is given by public/private keys
-- A connection to the supercomputer needs a 
-    1. Key,
-    2. Configuration
-    3. Key/IP address known to the supercomputer
-
----
-
-### SSH
-
-#### Create key in VSCode's Terminal (menu View->Terminal)
-
-```bash
-mkdir ~/.ssh/
-ssh-keygen -a 100 -t ed25519 -f ~/.ssh/id_ed25519-JSC
-```
-
-```bash
-$ ssh-keygen -a 100 -t ed25519 -f ~/.ssh/id_ed25519-JSC
-Generating public/private ed25519 key pair.
-Enter passphrase (empty for no passphrase): 
-Enter same passphrase again: 
-Your identification has been saved in /Users/strube1/.ssh/id_ed25519-JSC
-Your public key has been saved in /Users/strube1/.ssh/id_ed25519-JSC.pub
-The key fingerprint is:
-SHA256:EGNNC1NTaN8fHwpfuZRPa50qXHmGcQjxp0JuU0ZA86U strube1@Strube-16
-The keys randomart image is:
-+--[ED25519 256]--+
-|      *++oo=o. . |
-|     . =+o .= o  |
-|      .... o.E..o|
-|       .  +.+o+B.|
-|        S  =o.o+B|
-|          . o*.B+|
-|          . . =  |
-|           o .   |
-|            .    |
-+----[SHA256]-----+
-```
-
----
-
-### SSH
-
-#### Configure SSH session
-
-```bash
-code $HOME/.ssh/config
-```
-
-Windows users, from Ubuntu WSL
-(Change username for your user on windows)
-
-```bash
-ls -la /mnt/c/Users/
-mkdir /mnt/c/Users/USERNAME/.ssh/
-cp $HOME/.ssh/* /mnt/c/Users/USERNAME/.ssh/
-```
-
-
----
-
-### SSH
-
-#### Configure SSH session
-
-```bash
-Host jureca
-        HostName jureca.fz-juelich.de
-        User [MY_USERNAME]   # Here goes your username, not the word MY_USERNAME.
-        AddressFamily inet
-        IdentityFile ~/.ssh/id_ed25519-JSC
-        MACs hmac-sha2-512-etm@openssh.com
-```
-
-Copy contents to the config file and save it 
-
-**REPLACE [MY_USERNAME] WITH YOUR USERNAME!!! 🤦‍♂️**
-
----
-
-### SSH
-
-####  JSC restricts from where you can login
-#### So we need to:
-1. Find our ip range
-2. Add the range and key to [Judoor](https://judoor.fz-juelich.de)
-
----
-
-### SSH
-
-#### Find your ip/name range
-
-Open **[https://www.whatismyip.com](https://www.whatismyip.com)**
-
----
-
-### SSH
-
-#### Find your ip/name range
-
-![](images/whatismyip.png)
-
-- Let's keep this inside vscode: `code key.txt` and paste the number you got
-
----
-
-### SSH
-
-Did everyone get their **own** ip address?
-
----
-
-### SSH - EXAMPLE
-
-- I will use the number `93.199.55.163`
-- **YOUR NUMBER IS DIFFERENT**
-- Seriously
-
----
-
-### SSH - Example: `93.199.55.163`
-
-- Go to VSCode and make it simpler, replace the 2nd half with `"0.0/16"`:
-  - It was `93.199.55.163`
-  - Becomes `93.199.0.0/16` (with YOUR number, not with the example)
-- Add a `from=""` around it
-- So, it looks like this, now: `from="93.199.0.0/16"`
-- Add a second magic number, with a comma: `,10.0.0.0/8` 🧙‍♀️
-- I promise, the magic is worth it 🧝‍♂️ (If time allows)
-- In the end it looks like this: `from="93.199.0.0/16,10.0.0.0/8"` 🎬
-- Keep it open, we will use it later
-- If you are from FZJ, also add "134.94.0.0/16" with a comma
-
----
-
-### SSH - Example: `93.199.0.0/16`
-
-#### Copy your ssh key
-- Terminal: `code ~/.ssh/id_ed25519-JSC.pub`
-- Something like this will open:
-
-- ```bash
-ssh-ed25519 AAAAC3NzaC1lZDE1NTA4AAAAIHaoOJF3gqXd7CV6wncoob0DL2OJNfvjgnHLKEniHV6F strube@demonstration.fz-juelich.de
-```
-
-- Paste this line at the same `key.txt` which you just opened
-
----
-
-### SSH
-
-#### Example: `93.199.0.0/16`
-
-- Put them together and copy again:
-- ```bash
-from="93.199.0.0/16,10.0.0.0/8" ssh-ed25519 AAAAC3NzaC1lZDE1NTA4AAAAIHaoOJF3gqXd7CV6wncoob0DL2OJNfvjgnHLKEniHV6F strube@demonstration.fz-juelich.de
-```
-
----
-
-### SSH
-
-- Let's add it on [Judoor](https://judoor.fz-juelich.de)
-- ![](images/manage-ssh-keys.png)
-- Do it for JURECA and JUDAC with the same key
-
----
-
-### SSH
-
-#### Add new key to [Judoor](https://judoor.fz-juelich.de)
-
-![](images/manage-ssh-keys-from-and-key.png){ width=850px }
-
-This might take some minutes
-
----
-
-### SSH: Exercise
-
-That's it! Give it a try (and answer yes)
+## Working with the supercomputer's software
 
-```bash
-$ ssh jureca
-The authenticity of host 'jrlogin03.fz-juelich.de (134.94.0.185)' cannot be established.
-ED25519 key fingerprint is SHA256:ASeu9MJbkFx3kL1FWrysz6+paaznGenChgEkUW8nRQU.
-This key is not known by any other names
-Are you sure you want to continue connecting (yes/no/[fingerprint])? Yes
-**************************************************************************
-*                            Welcome to Jureca DC                   *
-**************************************************************************
-...
-...
-strube1@jrlogin03~ $ 
-```
+- We have literally thousands of software packages, hand-compiled for the specifics of the supercomputer.
+- [Full list](https://www.fz-juelich.de/en/ias/jsc/services/user-support/using-systems/software)
+- [Detailed documentation](https://apps.fz-juelich.de/jsc/hps/jureca/software-modules.html)
 
 ---
 
-### SSH: Exercise 
-#### Make sure you are connected to the supercomputer
-
-```bash
-# Create a folder for myself
-mkdir $PROJECT_training2425/$USER
-
-# Create a shortcut for the project on the home folder
-rm -rf ~/course ; ln -s $PROJECT_training2425/$USER ~/course
-
-# Enter course folder and
-cd ~/course
-
-# Where am I?
-pwd
+## Luncher in Jupyter-JSC
+![](images/launcher-jupyter-jsc.png)
 
-# We well need those later
-mkdir ~/course/.cache
-mkdir ~/course/.config
-mkdir ~/course/.fastai
 
-rm -rf $HOME/.cache ; ln -s ~/course/.cache $HOME/
-rm -rf $HOME/.config ; ln -s ~/course/.config $HOME/
-rm -rf $HOME/.fastai ; ln -s ~/course/.fastai $HOME/
-```
-
----
+## Software
 
-## Working with the supercomputer's software
+### Connect to terminal
 
-- We have literally thousands of software packages, hand-compiled for the specifics of the supercomputer.
-- [Full list](https://www.fz-juelich.de/en/ias/jsc/services/user-support/using-systems/software)
-- [Detailed documentation](https://apps.fz-juelich.de/jsc/hps/jureca/software-modules.html)
+![](images/jupyter-terminal.png)
 
 ---
 
-## Software
-
-#### Tool for finding software: `module spider`
+### Tool for finding software: `module spider`
 
 ```bash
 strube1$ module spider PyTorch
@@ -644,49 +401,25 @@ The following modules match your search criteria: "toml"
 ```
 ---
 
-## VSCode
-#### Editing files on the supercomputers
+### How to run it on the login node
 
-![](images/vscode-remotes.png)
+#### create a python file
+![](images/open-new-file-jp.png)
 
 ---
 
-## VSCode
-
-![](images/vscode-jusuf.png)
+#### create a python file
+![](images/rename-matrix-python-file.png)
 
 ---
 
-## VSCode
-
-- You can have a terminal inside VSCode: 
-  - Go to the menu View->Terminal
-
---- 
-
-## VSCode
-
-- From the VSCode's terminal, navigate to your "course" folder and to the name you created earlier.
-
-- ```bash
-cd $HOME/course/
-pwd
-```
-
-- This is out working directory. We do everything here.
+#### create an python file
+![](images/open-editor-matrix-python.png)
 
 ---
 
-### Demo code
-#### Create a new file "`matrix.py`" on VSCode on Jureca DC
-
-```bash
-code matrix.py
-```
-
-Paste this into the file:
-
-``` {.python .number-lines}
+#### create a python file
+``` {.bash .number-lines}
 import torch
 
 matrix1 = torch.randn(3,3)
@@ -701,8 +434,12 @@ print("The result is:\n", result)
 
 ---
 
-### How to run it on the login node
+#### create a python file
+![](images/create-python-file.png)
 
+---
+
+#### Run code in login node
 ```
 module load Stages/2023
 module load GCC OpenMPI PyTorch
@@ -738,11 +475,11 @@ Simple Linux Utility for Resource Management
 
 ### Slurm submission file example
 
-`code jureca-matrix.sbatch`
+Create a file named `jureca-matrix.sbatch` as described in the previous section, and copy all the content from the following into this file.
 
 ``` {.bash .number-lines}
 #!/bin/bash
-#SBATCH --account=training2425           # Who pays?
+#SBATCH --account=training2441           # Who pays?
 #SBATCH --nodes=1                        # How many compute nodes
 #SBATCH --job-name=matrix-multiplication
 #SBATCH --ntasks-per-node=1              # How many mpi processes/node
@@ -751,7 +488,7 @@ Simple Linux Utility for Resource Management
 #SBATCH --error=error.%j
 #SBATCH --time=00:01:00          # For how long can it run?
 #SBATCH --partition=dc-gpu         # Machine partition
-#SBATCH --reservation=training2425 # For today only
+#SBATCH --reservation=training2441 # For today only
 
 module load Stages/2024
 module load GCC OpenMPI PyTorch  # Load the correct modules on the compute node(s)
@@ -800,7 +537,7 @@ squeue --me
 ### Reservations
 
 - Some partitions have reservations, which means that only certain users can use them at certain times.
-- For this course, it's called `training2425`
+- For this course, it's called `training2441`
 
 --- 
 
@@ -816,13 +553,7 @@ scancel <JOBID>
 
 #### By now you should have output and error log files on your directory. Check them!
 
-```bash
-# Notice that this number is the job id. It's different for every job
-cat output.412169 
-cat error.412169 
-```
-
-Or simply open it on VSCode!
+simply open `output.412169` and `error.412169` using Editor!!
 
 ---
 
@@ -932,7 +663,7 @@ code fastai.sbatch
 
 ```bash
 #!/bin/bash
-#SBATCH --account=training2425
+#SBATCH --account=training2441
 #SBATCH --mail-user=MYUSER@fz-juelich.de
 #SBATCH --mail-type=ALL
 #SBATCH --nodes=1
@@ -943,7 +674,7 @@ code fastai.sbatch
 #SBATCH --error=error.%j
 #SBATCH --time=00:20:00
 #SBATCH --partition=dc-gpu
-#SBATCH --reservation=training2425 # For today only
+#SBATCH --reservation=training2441 # For today only
 
 cd $HOME/course/
 source sc_venv_template/activate.sh # Now we finally use the fastai module
@@ -996,7 +727,7 @@ The following modules were not unloaded:
 - If you run it longer, you will get the actual error:
 - ```python
 Traceback (most recent call last):
-  File "/p/project/training2425/strube1/cats.py", line 5, in <module>
+  File "/p/project/training2441/strube1/cats.py", line 5, in <module>
     path = untar_data(URLs.PETS)/'images'
     ...
     ...
@@ -1159,7 +890,7 @@ A tunnel which exposes the supercomputer's port 3000 as port 1234 locally](image
 
 ---
 
-## Port forwarding demo:
+<!-- ## Port forwarding demo:
 
 - On VSCode's terminal:
 - ```bash
@@ -1170,7 +901,7 @@ tensorboard --logdir=runs  --port=12345 serve
 - Note the tab `PORTS` next to the terminal 
 - On the browser: [http://localhost:12345](http://localhost:12345)
 
----
+--- -->
 
 ### Tensorboard on Jureca DC
 
@@ -1196,164 +927,3 @@ As of now, I expect you managed to:
 ## ANY QUESTIONS??
 
 #### Feedback is more than welcome!
-
----
-
-### Helmholtz Blablador
-
-![](images/blablador.png)
-
----
-
-### Blablador
-
-- Blablador is our Large Language Model inference server (eg. ChatGPT)
-- It's a service for the Helmholtz Association.
-  - It's fast, free and PRIVATE - I don't record your conversations!
-- Anyone here can use it
-
----
-
-### Blablador 
-
-![https://helmholtz-blablador.fz-juelich.de](images/blablador-qrcode.png){width=500px}
-
----
-
-## VScode + Continue.dev
-
-![](images/continue-ask-code.png)
-
----
-
-### Obtaining a token
-
-- Go to helmholtz codebase at [http://codebase.helmholtz.cloud](http://codebase.helmholtz.cloud)
-- Log in with your email
-- On the left side, click on your profile, and then on "Preferences"
-- On "Access tokens", click "Add new token",
-  - give it a name, 
-  - put an expiration date (max 1 year)
-  - and choose "api" in the "scopes" section
-- Click "Create Personal Access Token"
-  - You will see a "............................." - copy this and save somewhere.
-
----
-
-### Blablador
-
-![](images/blablador-api-scope.png){width=800px}
-
----
-
-### Blablador on VSCode!
-
-- Add [continue.dev](https://marketplace.visualstudio.com/items?itemName=Continue.continue) extension to VSCode
-- On Continue, choose to add model, choose Other OpenAI-compatible API
-- Click in Open Config.json at the end
-
----
-
-## Blablador: VScode + Continue.dev
-
-- Inside config.json, add at the `"models"` section:
-
-- ```json
-    {
-      "title": "Mistral helmholtz",
-      "provider": "openai",
-      "contextLength": 16384,
-      "model": "alias-code",
-      "apiKey": "ADD-YOUR-TOKEN-HERE",
-      "apiBase": "https://helmholtz-blablador.fz-juelich.de:8000"
-    },
-```
-
-- REPLACE THE APIKEY WITH YOUR OWN TOKEN!!!!
-
----
-
-### Blablador on VSCode
-
-- Click on the "Continue.dev extension on the left side of VSCode.
-- Select some code from our exercises, select it and send it to continue with cmd-shift-L (or ctrl-shift-L)
-- Ask it to add unit tests, for example.
-
----
-
-## Backup slides
-
----
-
-## There's more!
-
-- Remember the magic? 🧙‍♂️
-- Let's use it now to access the compute nodes directly!
-
----
-
-## Proxy Jump
-
-#### Accessing compute nodes directly
-
-- If we need to access some ports on the compute nodes
-- ![](images/proxyjump-magic.svg)
-
----
-
-## Proxy Jump - SSH Configuration
-
-Type on your machine "`code $HOME/.ssh/config`" and paste this at the end:
-
-```ssh
-
-# -- Compute Nodes --
-Host *.jureca
-        User [ADD YOUR USERNAME HERE]
-        StrictHostKeyChecking no
-        IdentityFile ~/.ssh/id_ed25519-JSC
-        ProxyJump jureca
-```        
-
----
-
-## Proxy Jump: Connecting to a node
-
-Example: A service provides web interface on port 9999
-
-On the supercomputer:
-
-```bash
-srun --time=00:05:00 \
-     --nodes=1 --ntasks=1 \
-     --partition=dc-gpu \
-     --account training2425 \
-     --cpu_bind=none \
-     --pty /bin/bash -i
-
-bash-4.4$ hostname # This is running on a compute node of the supercomputer
-jwb0002
-
-bash-4.4$ cd $HOME/course/
-bash-4.4$ source sc_venv_template/activate.sh
-bash-4.4$ tensorboard --logdir=runs  --port=9999 serve
-```
-
----
-
-## Proxy Jump 
-
-On your machine:
-
-- ```bash
-ssh -L :3334:localhost:9999 jrc002i.jureca
-```
-
-- Mind the `i` letter I added at the end of the hostname
-
-- Now you can access the service on your local browser at [http://localhost:3334](http://localhost:3334)
-
----
-
-### Now that's really the end! 😓
-
diff --git a/03-parallelize-training.md b/02-parallelize-training.md
similarity index 80%
rename from 03-parallelize-training.md
rename to 02-parallelize-training.md
index 5a65bc3..77b7672 100644
--- a/03-parallelize-training.md
+++ b/02-parallelize-training.md
@@ -1,8 +1,36 @@
 ---
-author: Alexandre Strube // Sabrina Benassou
+author: Alexandre Strube // Sabrina Benassou // Javad Kasravi
 title: Bringing Deep Learning Workloads to JSC supercomputers
 subtitle: Parallelize Training
-date: June 25, 2024
+date: November 19, 2024
+
+---
+
+## Good practice
+
+- Always store your code in the project folder. In our case 
+- ```bash
+/p/project/training2441/$USER
+```
+
+- Store data in the scratch directory for faster I/O access. Files in scratch are deleted after 90 days of inactivity.
+- ```bash
+/p/scratch/training2441/$USER
+```
+
+- Store the data in `$DATA_dataset` for a more permanent location.This location is not accessible by compute nodes.
+You have to Join the [project](https://judoor.fz-juelich.de/projects/datasets/) in order to store and access data 
+
+
+---
+
+## We need to download some code
+
+```bash
+cd $HOME/course
+git clone https://github.com/HelmholtzAI-FZJ/2024-11-course-deep-learning-in-neuroscience
+```
+
 ---
 
 ## The ResNet50 Model
@@ -10,6 +38,17 @@ date: June 25, 2024
 
 ---
 
+
+## The ImageNet dataset
+#### Large Scale Visual Recognition Challenge (ILSVRC)
+- An image dataset organized according to the [WordNet hierarchy](https://wordnet.princeton.edu). 
+- Extensively used in algorithms for object detection and image classification at large scale. 
+- It has 1000 classes, that comprises 1.2 million images for training, and 50,000 images for the validation set.
+
+![](images/imagenet_banner.jpeg)
+
+---
+
 ## ImageNet class
 
 ```python
@@ -20,8 +59,8 @@ class ImageNet(Dataset):
         
         self.root = root
         
-        with open(os.path.join(root, "imagenet_{}.json".format(split)), "rb") as f:
-            data = json.load(f)
+        with open(os.path.join(root, "imagenet_{}.pk".format(split)), "rb") as f:
+            data = pickle.load(f)
 
         self.samples = list(data.keys())
         self.targets = list(data.values())
@@ -74,7 +113,8 @@ class ImageNetDataModule(pl.LightningDataModule):
 class resnet50Model(pl.LightningModule):
     def __init__(self):
         super().__init__()
-        self.model = resnet50(pretrained=True)
+        weights = ResNet50_Weights.DEFAULT
+        self.model = resnet50(weights=weights)
 
     def forward(self, x):
         return self.model(x)
@@ -103,7 +143,7 @@ transform = transforms.Compose([
 ])
 
 # 1. Organize the data
-datamodule = ImageNetDataModule("/p/scratch/training2425/data/", 256, \
+datamodule = ImageNetDataModule("/p/scratch/training2441/", 256, \
     int(os.getenv('SLURM_CPUS_PER_TASK')), transform)
 # 2. Build the model using desired Task
 model = resnet50Model()
@@ -124,13 +164,13 @@ trainer.save_checkpoint("image_classification_model.pt")
 #SBATCH --nodes=1            
 #SBATCH --gres=gpu:1
 #SBATCH --ntasks-per-node=1  
-#SBATCH --cpus-per-task=96
+#SBATCH --cpus-per-task=128
 #SBATCH --time=06:00:00
 #SBATCH --partition=dc-gpu
-#SBATCH --account=training2425
+#SBATCH --account=training2441
 #SBATCH --output=%j.out
 #SBATCH --error=%j.err
-#SBATCH --reservation=training2425 
+#SBATCH --reservation=training2441
 
 # To get number of cpu per task
 export SRUN_CPUS_PER_TASK="$SLURM_CPUS_PER_TASK"
@@ -152,10 +192,69 @@ real	342m11.864s
 
 ## But what about many GPUs?
 
+::: {.container}
+:::: {.col}
+
+
+
+
+
+- We make use of the GPU of our supercomputer and distribute our training to make training faster.
 - It's when things get interesting
+::::
+:::: {.col}
+![](images/GPUs.svg)
+::::
+:::
+
+---
+
+## Distributed Training
+
+
+- Parallelize the training across multiple nodes, 
+- Significantly enhancing training speed and model accuracy.
+- It is particularly beneficial for large models and computationally intensive tasks, such as deep learning.[[1]](https://pytorch.org/tutorials/distributed/home.html)
+
+
+---
+
+<!-- ## Terminologies
+
+- WORLD_SIZE: number of processes participating in the job.
+- RANK: the rank of the process in the network.
+- LOCAL_RANK: the rank of the process on the local machine.
+- MASTER_PORT: free port on machine with rank 0.
+<!-- - MASTER_ADDR: address of rank 0 node. -->
+
+<!-- ---  -->
+
+
+
+--- 
+
+<!-- ![](images/ranks.svg)
 
 ---
 
+![](images/local_ranks.svg)
+
+
+--- -->
+
+
+## Parallel Training with PyTorch DDP
+
+- [PyTorch's DDP (Distributed Data Parallel)](https://lightning.ai/docs/pytorch/stable/accelerators/gpu_intermediate.html) works as follows:
+    - Each GPU across each node gets its own process.
+    - Each GPU gets visibility into a subset of the overall dataset. It will only ever see that subset.
+    - Each process inits the model.
+    - Each process performs a full forward and backward pass in parallel.
+    - The gradients are synced and averaged across all processes.
+    - Each process updates its optimizer.
+
+--- 
+
 ## Data Parallel
 
 ![](images/data-parallel.svg)
@@ -183,13 +282,13 @@ real	342m11.864s
 #SBATCH --nodes=1                     
 #SBATCH --gres=gpu:4                  # Use the 4 GPUs available
 #SBATCH --ntasks-per-node=4           # When using pl it should always be set to 4
-#SBATCH --cpus-per-task=24            # Divide the number of cpus (96) by the number of GPUs (4)
+#SBATCH --cpus-per-task=32            # Divide the number of cpus (128) by the number of GPUs (4)
 #SBATCH --time=02:00:00
 #SBATCH --partition=dc-gpu
-#SBATCH --account=training2425
+#SBATCH --account=training2441
 #SBATCH --output=%j.out
 #SBATCH --error=%j.err
-#SBATCH --reservation=training2425 
+#SBATCH --reservation=training2441
 
 export CUDA_VISIBLE_DEVICES=0,1,2,3    # Very important to make the GPUs visible
 export SRUN_CPUS_PER_TASK="$SLURM_CPUS_PER_TASK"
@@ -236,6 +335,163 @@ real	89m15.923s
 
 ---
 
+## DDP steps
+
+1. Set up the environement variables for the distributed mode (WORLD_SIZE, RANK, LOCAL_RANK ...)
+
+- ```python
+# The number of total processes started by Slurm.
+ntasks = os.getenv('SLURM_NTASKS')
+# Index of the current process.
+rank = os.getenv('SLURM_PROCID')
+# Index of the current process on this node only.
+local_rank = os.getenv('SLURM_LOCALID')
+# The number of nodes
+nnodes = os.getenv("SLURM_NNODES")
+```
+
+---
+
+## DDP steps
+
+2. Initialize a sampler to specify the sequence of indices/keys used in data loading.
+3. Implements data parallelism of the model. 
+4. Allow only one process to save checkpoints.
+
+- ```python
+datamodule = ImageNetDataModule("/p/scratch/training2441/", 256, \
+    int(os.getenv('SLURM_CPUS_PER_TASK')), transform)
+trainer = pl.Trainer(max_epochs=10,  accelerator="gpu", num_nodes=nnodes)
+trainer.fit(model, datamodule=datamodule)
+trainer.save_checkpoint("image_classification_model.pt")
+```
+
+---
+
+## Multi-Node training
+
+```python
+transform = transforms.Compose([
+    transforms.ToTensor(),
+    transforms.Resize((256, 256))
+])
+
+# 1. The number of nodes
+nnodes = os.getenv("SLURM_NNODES")
+# 2. Organize the data
+datamodule = ImageNetDataModule("/p/scratch/training2441/", 128, \
+    int(os.getenv('SLURM_CPUS_PER_TASK')), transform)
+# 3. Build the model using desired Task
+model = resnet50Model()
+# 4. Create the trainer
+trainer = pl.Trainer(max_epochs=10,  accelerator="gpu", num_nodes=nnodes)
+# 5. Train the model
+trainer.fit(model, datamodule=datamodule)
+# 6. Save the model!
+trainer.save_checkpoint("image_classification_model.pt")
+```
+
+---
+
+## Multi-Node training
+
+16 nodes and 4 GPU each 
+
+```bash
+#!/bin/bash -x
+#SBATCH --nodes=16                     # This needs to match Trainer(num_nodes=...)
+#SBATCH --gres=gpu:4                   # Use the 4 GPUs available
+#SBATCH --ntasks-per-node=4            # When using pl it should always be set to 4
+#SBATCH --cpus-per-task=32             # Divide the number of cpus (128) by the number of GPUs (4)
+#SBATCH --time=00:15:00
+#SBATCH --partition=dc-gpu
+#SBATCH --account=training2441
+#SBATCH --output=%j.out
+#SBATCH --error=%j.err
+#SBATCH --reservation=training2441
+
+export CUDA_VISIBLE_DEVICES=0,1,2,3    # Very important to make the GPUs visible
+export SRUN_CPUS_PER_TASK="$SLURM_CPUS_PER_TASK"
+
+source $HOME/course/$USER/sc_venv_template/activate.sh
+time srun python3 ddp_training.py
+```
+
+```bash
+real	6m56.457s
+```
+
+---
+
+## Multi-Node training
+
+With 4 nodes: 
+
+```bash
+real	24m48.169s
+```
+
+With 8 nodes: 
+
+```bash
+real	13m10.722s
+```
+
+With 16 nodes: 
+
+```bash
+real	6m56.457s
+```
+
+With 32 nodes: 
+
+```bash
+real	4m48.313s
+```
+---
+
+## Data Parallel
+
+<!-- What changed? -->
+
+- It was 
+- ```python
+trainer = pl.Trainer(max_epochs=10,  accelerator="gpu")
+``` 
+- Became 
+- ```python
+nnodes = os.getenv("SLURM_NNODES")
+trainer = pl.Trainer(max_epochs=10,  accelerator="gpu", num_nodes=nnodes)
+```
+
+---
+
+## Data Parallel
+
+<!-- What changed? -->
+
+- It was
+- ```bash
+#SBATCH --nodes=1                
+#SBATCH --gres=gpu:1
+#SBATCH --ntasks-per-node=1
+#SBATCH --cpus-per-task=128
+```
+- Became
+- ```bash
+#SBATCH --nodes=16                   # This needs to match Trainer(num_nodes=...)
+#SBATCH --gres=gpu:4                 # Use the 4 GPUs available
+#SBATCH --ntasks-per-node=4          # When using pl it should always be set to 4
+#SBATCH --cpus-per-task=32           # Divide the number of cpus (128) by the number of GPUs (4)
+export CUDA_VISIBLE_DEVICES=0,1,2,3  # Very important to make the GPUs visible
+```
+
+---
+
+## DEMO
+
+--- 
+
 ## Before we go further...
 
 - Data parallel is usually good enough 👌
@@ -431,187 +687,6 @@ real	89m15.923s
 
 ---
 
-
-## Parallel Training with PyTorch DDP
-
-- [PyTorch's DDP (Distributed Data Parallel)](https://lightning.ai/docs/pytorch/stable/accelerators/gpu_intermediate.html) works as follows:
-    - Each GPU across each node gets its own process.
-    - Each GPU gets visibility into a subset of the overall dataset. It will only ever see that subset.
-    - Each process inits the model.
-    - Each process performs a full forward and backward pass in parallel.
-    - The gradients are synced and averaged across all processes.
-    - Each process updates its optimizer.
-
---- 
-
-
-## Terminologies
-
-- WORLD_SIZE: number of processes participating in the job.
-- RANK: the rank of the process in the network.
-- LOCAL_RANK: the rank of the process on the local machine.
-- MASTER_PORT: free port on machine with rank 0.
-<!-- - MASTER_ADDR: address of rank 0 node. -->
-
----
-
-## DDP steps
-
-1. Set up the environement variables for the distributed mode (WORLD_SIZE, RANK, LOCAL_RANK ...)
-
-- ```python
-# The number of total processes started by Slurm.
-ntasks = os.getenv('SLURM_NTASKS')
-# Index of the current process.
-rank = os.getenv('SLURM_PROCID')
-# Index of the current process on this node only.
-local_rank = os.getenv('SLURM_LOCALID')
-# The number of nodes
-nnodes = os.getenv("SLURM_NNODES")
-```
-
----
-
-## DDP steps
-
-2. Initialize a sampler to specify the sequence of indices/keys used in data loading.
-3. Implements data parallelism of the model. 
-4. Allow only one process to save checkpoints.
-
-- ```python
-datamodule = ImageNetDataModule("/p/scratch/training2425/data/", 256, \
-    int(os.getenv('SLURM_CPUS_PER_TASK')), transform)
-trainer = pl.Trainer(max_epochs=10,  accelerator="gpu", num_nodes=nnodes)
-trainer.fit(model, datamodule=datamodule)
-trainer.save_checkpoint("image_classification_model.pt")
-```
-
----
-
-## DDP steps
-
-```python
-transform = transforms.Compose([
-    transforms.ToTensor(),
-    transforms.Resize((256, 256))
-])
-
-# 1. The number of nodes
-nnodes = os.getenv("SLURM_NNODES")
-# 2. Organize the data
-datamodule = ImageNetDataModule("/p/scratch/training2425/data/", 128, \
-    int(os.getenv('SLURM_CPUS_PER_TASK')), transform)
-# 3. Build the model using desired Task
-model = resnet50Model()
-# 4. Create the trainer
-trainer = pl.Trainer(max_epochs=10,  accelerator="gpu", num_nodes=nnodes)
-# 5. Train the model
-trainer.fit(model, datamodule=datamodule)
-# 6. Save the model!
-trainer.save_checkpoint("image_classification_model.pt")
-```
-
----
-
-## DDP training
-
-16 nodes and 4 GPU each 
-
-```bash
-#!/bin/bash -x
-#SBATCH --nodes=16                     # This needs to match Trainer(num_nodes=...)
-#SBATCH --gres=gpu:4                   # Use the 4 GPUs available
-#SBATCH --ntasks-per-node=4            # When using pl it should always be set to 4
-#SBATCH --cpus-per-task=24             # Divide the number of cpus (96) by the number of GPUs (4)
-#SBATCH --time=00:15:00
-#SBATCH --partition=dc-gpu
-#SBATCH --account=training2425
-#SBATCH --output=%j.out
-#SBATCH --error=%j.err
-#SBATCH --reservation=training2425 
-
-export CUDA_VISIBLE_DEVICES=0,1,2,3    # Very important to make the GPUs visible
-export SRUN_CPUS_PER_TASK="$SLURM_CPUS_PER_TASK"
-
-source $HOME/course/$USER/sc_venv_template/activate.sh
-time srun python3 ddp_training.py
-```
-
-```bash
-real	6m56.457s
-```
-
----
-
-## DDP training
-
-With 4 nodes: 
-
-```bash
-real	24m48.169s
-```
-
-With 8 nodes: 
-
-```bash
-real	13m10.722s
-```
-
-With 16 nodes: 
-
-```bash
-real	6m56.457s
-```
-
-With 32 nodes: 
-
-```bash
-real	4m48.313s
-```
----
-
-## Data Parallel
-
-<!-- What changed? -->
-
-- It was 
-- ```python
-trainer = pl.Trainer(max_epochs=10,  accelerator="gpu")
-``` 
-- Became 
-- ```python
-nnodes = os.getenv("SLURM_NNODES")
-trainer = pl.Trainer(max_epochs=10,  accelerator="gpu", num_nodes=nnodes)
-```
-
----
-
-## Data Parallel
-
-<!-- What changed? -->
-
-- It was
-- ```bash
-#SBATCH --nodes=1                
-#SBATCH --gres=gpu:1
-#SBATCH --ntasks-per-node=1
-#SBATCH --cpus-per-task=96
-```
-- Became
-- ```bash
-#SBATCH --nodes=16                   # This needs to match Trainer(num_nodes=...)
-#SBATCH --gres=gpu:4                 # Use the 4 GPUs available
-#SBATCH --ntasks-per-node=4          # When using pl it should always be set to 4
-#SBATCH --cpus-per-task=24           # Divide the number of cpus (96) by the number of GPUs (4)
-export CUDA_VISIBLE_DEVICES=0,1,2,3  # Very important to make the GPUs visible
-```
-
----
-
-## DEMO
-
---- 
-
 ## TensorBoard
 
 - In resnet50.py
@@ -645,7 +720,6 @@ tensorboard --logdir=[PATH_TO_TENSOR_BOARD]
 
 ## DAY 2 RECAP 
 
-- Access using FS, Arrow, and H5 files
 - Ran parallel code.
 - Can submit single node, multi-gpu and multi-node training.
 - Use TensorBoard on the supercomputer.
@@ -657,7 +731,7 @@ tensorboard --logdir=[PATH_TO_TENSOR_BOARD]
 
 #### Feedback is more than welcome!
 
-#### Link to [other courses at JSC](https://go.fzj.de/intro-sc-ai-2023-other-courses)
+#### Link to [other courses at JSC](https://go.fzj.de/dl-in-neuroscience-all-courses)
 
 ---
 
diff --git a/02-speedup-data-loading.md b/02-speedup-data-loading.md
deleted file mode 100644
index 8d9d79b..0000000
--- a/02-speedup-data-loading.md
+++ /dev/null
@@ -1,444 +0,0 @@
----
-author: Alexandre Strube // Sabrina Benassou
-title: Bringing Deep Learning Workloads to JSC supercomputers
-subtitle: Data loading
-date: June 25, 2024
----
-
-### Schedule for day 2
-
-| Time          | Title                |
-| ------------- | -----------          |
-| 10:00 - 10:15 | Welcome, questions   |
-| 10:15 - 11:30 | Data loading |
-| 11:30 - 12:00 | Coffee Break (flexible) |
-| 12:30 - 14:00 | Parallelize Training |
-
----
-
-## Let's talk about DATA
-
-- Some general considerations one should have in mind
-
----
-
-![Not this data](images/data-and-lore.jpg)
-
---- 
-
-## I/O is separate and shared
-
-#### All compute nodes of all supercomputers see the same files
-
-- Performance tradeoff between shared acessibility and speed
-- It's simple to load data fast to 1 or 2 gpus. But to 100? 1000? 10000?
-
----
-
-### Jülich Supercomputers
-
-- Our I/O server is almost a supercomputer by itself
-- ![JSC Supercomputer Stragegy](images/machines.png)
-
----
-
-## Where do I keep my files?
-
-- **`$PROJECT_projectname`** for code (`projectname` is `training2425` in this case)
-    - Most of your work should stay here
-- **`$DATA_projectname`** for big data(*)
-    - Permanent location for big datasets
-- **`$SCRATCH_projectname`** for temporary files (fast, but not permanent)
-    - Files are deleted after 90 days untouched
-
----
-
-## Data services
-
-- JSC provides different data services
-- Data projects give massive amounts of storage
-- We use it for ML datasets. Join the project at **[Judoor](https://judoor.fz-juelich.de/projects/join/datasets)**
-- After being approved, connect to the supercomputer and try it:
-- ```bash
-cd $DATA_datasets
-ls -la
-```
-
----
-
-## Data Staging
-
-- [LARGEDATA filesystem](https://apps.fz-juelich.de/jsc/hps/juwels/filesystems.html) is not accessible by compute nodes
-    - Copy files to an accessible filesystem BEFORE working
-- Imagenet-21K copy alone takes 21+ minutes to $SCRATCH
-    - We already copied it to $SCRATCH for you
-
----
-
-## Data loading
-
-![Fat GPUs need to be fed FAST](images/nomnom.jpg)
-
---- 
-
-## Strategies
-
-- We have CPUs and lots of memory - let's use them
-    - multitask training and data loading for the next batch
-    - `/dev/shm` is a filesystem on ram - ultra fast ⚡️
-- Use big files made for parallel computing
-    - HDF5, Zarr, mmap() in a parallel fs, LMDB
-- Use specialized data loading libraries
-    - FFCV, DALI, Apache Arrow
-- Compression sush as squashfs 
-    - data transfer can be slower than decompression (must be checked case by case)
-    - Beneficial in cases where numerous small files are at hand.
-
----
-
-## Libraries
-
-- Apache Arrow [https://arrow.apache.org/](https://arrow.apache.org/)
-- FFCV [https://github.com/libffcv/ffcv](https://github.com/libffcv/ffcv) and [FFCV for PyTorch-Lightning](https://github.com/SerezD/ffcv_pytorch_lightning)
-- Nvidia's DALI [https://developer.nvidia.com/dali](https://developer.nvidia.com/dali)
-
----
-
-## We need to download some code
-
-```bash
-cd $HOME/course
-git clone https://github.com/HelmholtzAI-FZJ/2024-06-course-Bringing-Deep-Learning-Workloads-to-JSC-supercomputers.git
-```
-
----
-
-## The ImageNet dataset
-#### Large Scale Visual Recognition Challenge (ILSVRC)
-- An image dataset organized according to the [WordNet hierarchy](https://wordnet.princeton.edu). 
-- Extensively used in algorithms for object detection and image classification at large scale. 
-- It has 1000 classes, that comprises 1.2 million images for training, and 50,000 images for the validation set.
-
-![](images/imagenet_banner.jpeg)
-
----
-
-## The ImageNet dataset
-
-```bash
-ILSVRC
-|-- Data/
-    `-- CLS-LOC
-        |-- test
-        |-- train
-        |   |-- n01440764
-        |   |   |-- n01440764_10026.JPEG
-        |   |   |-- n01440764_10027.JPEG
-        |   |   |-- n01440764_10029.JPEG
-        |   |-- n01695060
-        |   |   |-- n01695060_10009.JPEG
-        |   |   |-- n01695060_10022.JPEG
-        |   |   |-- n01695060_10028.JPEG
-        |   |   |-- ...
-        |   |...
-        |-- val
-            |-- ILSVRC2012_val_00000001.JPEG  
-            |-- ILSVRC2012_val_00016668.JPEG  
-            |-- ILSVRC2012_val_00033335.JPEG      
-            |-- ...
-```
----
-
-## The ImageNet dataset
-imagenet_train.json
-
-```bash 
-{
-    'ILSVRC/Data/CLS-LOC/train/n03146219/n03146219_8050.JPEG': 524,
-    'ILSVRC/Data/CLS-LOC/train/n03146219/n03146219_12728.JPEG': 524,
-    'ILSVRC/Data/CLS-LOC/train/n03146219/n03146219_9736.JPEG': 524,
-    ...
-    'ILSVRC/Data/CLS-LOC/train/n03146219/n03146219_7460.JPEG': 524,
-    ...
- }
-```
-
-imagenet_val.json
-
-```bash
-{
-    'ILSVRC/Data/CLS-LOC/val/ILSVRC2012_val_00008838.JPEG': 785,
-    'ILSVRC/Data/CLS-LOC/val/ILSVRC2012_val_00008555.JPEG': 129,
-    'ILSVRC/Data/CLS-LOC/val/ILSVRC2012_val_00028410.JPEG': 968,
-    ...
-    'ILSVRC/Data/CLS-LOC/val/ILSVRC2012_val_00016007.JPEG': 709,
- }
-```
-
----
-
-## Access File System
-
-```python
-def __getitem__(self, idx):
-    x = Image.open(os.path.join(self.root, self.samples[idx])).convert("RGB")
-    if self.transform:
-        x = self.transform(x)
-    return x, self.targets[idx]
-   
-```
-
----
-
-## Inodes 
-- Inodes (Index Nodes) are data structures that store metadata about files and directories.
-- Unique identification of files and directories within the file system.
-- Efficient management and retrieval of file metadata.
-- Essential for file operations like opening, reading, and writing.
-- **Limitations**:
-  - **Fixed Number**: Limited number of inodes; no new files if exhausted, even with free disk space.
-  - **Space Consumption**: Inodes consume disk space, balancing is needed for efficiency.
-![](images/inodes.png)
-
----
-
-## Pyarrow File Creation
-
-![](images/field.png)
-
-```python 
-    binary_t = pa.binary()
-    uint16_t = pa.uint16()
-```
-
----
-
-## Pyarrow File Creation
-
-![](images/schema.png)
-
-```python 
-    binary_t = pa.binary()
-    uint16_t = pa.uint16()
-
-    schema = pa.schema([
-        pa.field('image_data', binary_t),
-        pa.field('label', uint16_t),
-    ])
-```
-
----
-
-## Pyarrow File Creation
-
-![](images/file.png){width=700 height=350}
-
-```python 
-    with pa.OSFile(
-            os.path.join(args.target_folder, f'ImageNet-{split}.arrow'),
-            'wb',
-    ) as f:
-        with pa.ipc.new_file(f, schema) as writer:
-```
-
----
-
-## Pyarrow File Creation
-
-![](images/batch.png){width=650 height=300}
-
-```python 
-
-    with open(sample, 'rb') as f:
-        img_string = f.read()
-
-    image_data = pa.array([img_string], type=binary_t)
-    label = pa.array([label], type=uint16_t)
-
-    batch = pa.record_batch([image_data, label], schema=schema)
-
-    writer.write(batch)
-```
-
----
-
-## Pyarrow File Creation
-
-![](images/pyarrow.png){width=650 height=300}
-
-```python 
-
-    with open(sample, 'rb') as f:
-        img_string = f.read()
-
-    image_data = pa.array([img_string], type=binary_t)
-    label = pa.array([label], type=uint16_t)
-
-    batch = pa.record_batch([image_data, label], schema=schema)
-
-    writer.write(batch)
-```
-
----
-
-## Access Arrow File
-
-::: {.container}
-:::: {.col}
-![](images/pyarrow.png){width=500 height=300}
-::::
-:::: {.col}
-```python
-def __getitem__(self, idx):
-    if self.arrowfile is None:
-        self.arrowfile = pa.OSFile(self.data_root, 'rb')
-        self.reader = pa.ipc.open_file(self.arrowfile)
-
-    row = self.reader.get_batch(idx)
-
-    img_string = row['image_data'][0].as_py()
-    target = row['label'][0].as_py()
-
-    with io.BytesIO(img_string) as byte_stream:
-        with Image.open(byte_stream) as img:
-            img = img.convert("RGB")
-
-    if self.transform:
-        img = self.transform(img)
-
-    return img, target
-
-```
-::::
-:::
-
----
-
-## HDF5
-
-![](images/h5.png)
-
-```python
-
-with h5py.File(os.path.join(args.target_folder, 'ImageNet.h5'), "w") as f:
-
-```
-
----
-
-## HDF5
-
-::: {.container}
-:::: {.col}
-```python
-
-group = g.create_group(split)
-
-```
-::::
-:::: {.col}
-![](images/groups.png)
-::::
-:::
-
----
-
-## HDF5
-
-
-::: {.container}
-:::: {.col}
-``` python 
-dt_sample = h5py.vlen_dtype(np.dtype(np.uint8))
-dt_target = np.dtype('int16')
-
-dset = group.create_dataset(
-                'images',
-                (len(samples),),
-                dtype=dt_sample,
-            )
-
-dtargets = group.create_dataset(
-        'targets',
-        (len(samples),),
-        dtype=dt_target,
-    )
-```
-::::
-:::: {.col}
-![](images/datasets.png){width=400 height=350}
-::::
-:::
-
----
-
-## HDF5
-
-
-![](images/first_iter.png){width=750 height=350}
-
-```python
-for idx, (sample, target) in tqdm(enumerate(zip(samples, targets))):        
-    with open(sample, 'rb') as f:
-        img_string = f.read() 
-        dset[idx] = np.array(list(img_string), dtype=np.uint8)
-        dtargets[idx] = target
-```
-
----
-
-## HDF5
-
-
-![](images/last_iter.png){width=750 height=350}
-
-```python
-for idx, (sample, target) in tqdm(enumerate(zip(samples, targets))):        
-    with open(sample, 'rb') as f:
-        img_string = f.read() 
-        dset[idx] = np.array(list(img_string), dtype=np.uint8)
-        dtargets[idx] = target
-```
-
----
-
-## HDF5
-
-
-![](images/hdf5.png)
-
----
-
-## Access h5 File 
-
-```python
-def __getitem__(self, idx):
-    if self.h5file is None:
-        self.h5file = h5py.File(self.train_data_path, 'r')[self.split]
-        self.imgs = self.h5file["images"]
-        self.targets = self.h5file["targets"]
-
-    img_string = self.imgs[idx]
-    target = self.targets[idx]
-
-    with io.BytesIO(img_string) as byte_stream:
-        with Image.open(byte_stream) as img:
-            img = img.convert("RGB")
-
-    if self.transform:
-        img = self.transform(img)
-        
-    return img, target
-```
-
----
-
-## DEMO
-
----
-
-## Exercise
-
-- Could you create an arrow file for the flickr dataset stored in 
-```/p/scratch/training2402/data/Flickr30K/```
-and read it using a dataloader ?
\ No newline at end of file
diff --git a/email-template.md b/email-template.md
index c515cb8..b498c13 100644
--- a/email-template.md
+++ b/email-template.md
@@ -1,20 +1,16 @@
 ---
-author: Alexandre Strube // Sabrina Benassou
-title: Course: Bringing Deep Learning Workloads to JSC supercomputers
+author: Alexandre Strube // Sabrina Benassou // Javad Kasravi
+title: Deep Learning in Neuroscience // on the Supercomputers of the Jülich Supercomputing Centre
 # subtitle: A primer in supercomputers`
-date: June 25, 2024
+date: November 19, 2024
 ---
 
 Dear students,
 
 the next "Bringing Deep Learning Workloads to JSC supercomputers" course is approaching! Thank you all very much for your participation.
 
-The course is online, over zoom. It might be recorded. This is the link:
-https://go.fzj.de/bringing-dl-workloads-to-jsc-zoom
-
-
 *********
-IMPORTANT - Please check all steps! Some things need to be done a day BEFORE the course!!!
+IMPORTANT - Please check all steps! Some things need to be done some days BEFORE the course!!!
 *********
 
 Checklist for BEFORE the course:
@@ -22,43 +18,29 @@ Checklist for BEFORE the course:
 - If you don't have one, make an account on JuDOOR, our portal: https://judoor.fz-juelich.de/register
 Instruction video: https://drive.google.com/file/d/1-DfiNBP4Gta0av4lQmubkXIXzr2FW4a-/view
 
-- Joining the course's project: https://go.fzj.de/bringing-dl-workloads-to-jsc-project-join
+- Joining the course's project: https://go.fzj.de/dl-in-neuroscience-project-join
 
 - Sign the usage agreements, as shown in this video: https://drive.google.com/file/d/1mEN1GmWyGFp75uMIi4d6Tpek2NC_X8eY/view
 
-- Install software (see below). On windows you DO need administrator rights. We can't support other softwares during the course.
-
-- We will use Slack for communication. Please log in BEFORE the course: https://go.fzj.de/bringing-dl-workloads-to-jsc-slack
-
-
+If you did not complete the above checklist before the course, unfortunately, it will not be possible to use the supercomputers.
 ---
 
 What software is necessary for this course?
 
 The course is platform-independent. It can even be followed by a Windows user, but if possible, avoid it. In general. Forever.
 
-- Visual Studio Code: it's a free editor which we will demo on this course. Get it from https://code.visualstudio.com/download
-
-- Visual Studio Code Remote Development: https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.vscode-remote-extensionpack
-
-- Visual Studio: Remote - SSH: https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.remote-ssh
-
-- (WINDOWS ONLY): WSL. This installs the WSL support for Visual Studio Code, which will install WSL itself (And Ubuntu). https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.remote-wsl - This is a long install, take your time.
-  PLEASE MAKE SURE WSL IS ACTUALLY INSTALLED - Try running it. Check this example: https://pureinfotech.com/install-windows-subsystem-linux-2-windows-10/
-
 - A terminal. On Linux and Mac, it's just called "Terminal". Little familiarity with it is required. On windows, the WSL installs it.
 
-- The `ssh` command. It's installed by default on Mac and Linux, and should be on Windows after the aforementioned steps.
-
 - Some knowledge of the Python language.
 
 ---
 
-The course material is available at https://go.fzj.de/bringing-dl-workloads-to-jsc - I will be making some final commits to it, so make sure you reload it every now and then.
+
+The course material is available at https://go.fzj.de/dl-in-neuroscience - I will be making some final commits to it, so make sure you reload it every now and then.
 
 See you soon, 
 
-Alex and Sabrina
+Alex, Sabrina and Javad
 
 
 
diff --git a/public/01-access-machines.html b/public/01-access-machines.html
index 2d41bf7..1aaad8f 100644
--- a/public/01-access-machines.html
+++ b/public/01-access-machines.html
@@ -3,9 +3,9 @@
 <head>
   <meta charset="utf-8">
   <meta name="generator" content="pandoc">
-  <meta name="author" content="Alexandre Strube // Sabrina Benassou">
-  <meta name="dcterms.date" content="2024-06-25">
-  <title>Accessing the machines, intro</title>
+  <meta name="author" content="Alexandre Strube // Sabrina Benassou // Javad Kasravi">
+  <meta name="dcterms.date" content="2024-11-19">
+  <title>Deep Learning in Neuroscience // on the Supercomputers of the Jülich Supercomputing Centre</title>
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta name="apple-mobile-web-app-status-bar-style" content="black-translucent">
   <meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1.0, user-scalable=no, minimal-ui">
@@ -43,7 +43,7 @@
     }
     @media print {
     pre > code.sourceCode { white-space: pre-wrap; }
-    pre > code.sourceCode > span { display: inline-block; text-indent: -5em; padding-left: 5em; }
+    pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
     }
     pre.numberSource code
       { counter-reset: source-line 0; }
@@ -225,9 +225,11 @@
     <div class="slides">
 
 <section id="title-slide">
-  <h1 class="title">Accessing the machines, intro</h1>
-  <p class="author">Alexandre Strube // Sabrina Benassou</p>
-  <p class="date">June 25, 2024</p>
+  <h1 class="title">Deep Learning in Neuroscience // on the
+Supercomputers of the Jülich Supercomputing Centre</h1>
+  <p class="author">Alexandre Strube // Sabrina Benassou // Javad
+Kasravi</p>
+  <p class="date">November 19, 2024</p>
 </section>
 
 <section id="communication" class="slide level2">
@@ -235,28 +237,22 @@ <h2>Communication:</h2>
 <p>Links for the complimentary parts of this course:</p>
 <ul>
 <li class="fragment"><a
-href="https://go.fzj.de/bringing-dl-workloads-to-jsc-zoom">Zoom</a></li>
+href="https://go.fzj.de/dl-in-neuroscience-course">Event page</a></li>
 <li class="fragment"><a
-href="https://go.fzj.de/bringing-dl-workloads-to-jsc-slack">Slack</a></li>
-<li class="fragment"><a
-href="https://go.fzj.de/bringing-dl-workloads-to-jsc-course">JSC
-Training Page</a></li>
-<li class="fragment"><a
-href="https://go.fzj.de/bringing-dl-workloads-to-jsc-project-join">Judoor
-project page invite</a></li>
-<li class="fragment"><a
-href="https://go.fzj.de/bringing-dl-workloads-to-jsc">This document:
-https://go.fzj.de/bringing-dl-workloads-to-jsc</a></li>
+href="https://go.fzj.de/dl-in-neuroscience-project-join">Judoor project
+page invite</a></li>
+<li class="fragment"><a href="https://go.fzj.de/dl-in-neuroscience">This
+document: https://go.fzj.de/dl-in-neuroscience</a></li>
 <li class="fragment">Our mailing list for <a
 href="https://lists.fz-juelich.de/mailman/listinfo/ml">AI news</a></li>
 <li class="fragment"><a
-href="https://go.fzj.de/bringing-dl-workloads-to-jsc-survey">Survey at
-the end of the course</a></li>
+href="https://go.fzj.de/dl-in-neuroscience-survey">Survey at the end of
+the course</a></li>
 <li class="fragment"><a
 href="https://gitlab.jsc.fz-juelich.de/kesselheim1/sc_venv_template">Virtual
 Environment template</a></li>
 <li class="fragment"><a
-href="https://go.fzj.de/bringing-dl-workloads-to-jsc-repo">SOURCE of the
+href="https://go.fzj.de/dl-in-neuroscience-repo">SOURCE of the
 course/slides on Github</a></li>
 </ul>
 <p><img
@@ -295,13 +291,19 @@ <h2>Team:</h2>
 <figcaption aria-hidden="true">Sabrina Benassou</figcaption>
 </figure>
 </div>
+<div class="col">
+<figure>
+<img data-src="pics/javad.jpg" alt="Javad Kasravi" />
+<figcaption aria-hidden="true">Javad Kasravi</figcaption>
+</figure>
+</div>
 </div>
 <p><img
 data-src="images/Logo_FZ_Juelich_rgb_Schutzzone_transparent.svg" /></p>
 </section>
 <section class="slide level2">
 
-<h3 id="schedule-for-day-1">Schedule for day 1</h3>
+<h3 id="schedule">Schedule</h3>
 <table>
 <thead>
 <tr class="header">
@@ -311,39 +313,39 @@ <h3 id="schedule-for-day-1">Schedule for day 1</h3>
 </thead>
 <tbody>
 <tr class="odd">
-<td>10:00 - 10:15</td>
+<td>09:00 - 09:15</td>
 <td>Welcome</td>
 </tr>
 <tr class="even">
-<td>10:15 - 11:00</td>
+<td>09:15 - 10:00</td>
 <td>Introduction</td>
 </tr>
 <tr class="odd">
-<td>11:00 - 11:15</td>
+<td>11:00 - 10:15</td>
 <td>Coffee break</td>
 </tr>
 <tr class="even">
-<td>11:16 - 11:30</td>
+<td>10:16 - 10:30</td>
 <td>Judoor, Keys</td>
 </tr>
 <tr class="odd">
-<td>11:30 - 12:00</td>
-<td>SSH, Jupyter, VS Code</td>
+<td>10:30 - 11:00</td>
+<td>Jupyter-JSC</td>
 </tr>
 <tr class="even">
-<td>12:00 - 12:15</td>
+<td>11:00 - 11:15</td>
 <td>Coffee Break</td>
 </tr>
 <tr class="odd">
-<td>12:15 - 13:00</td>
+<td>11:15 - 12:00</td>
 <td>Running services on the login and compute nodes</td>
 </tr>
 <tr class="even">
-<td>13:00 - 13:15</td>
+<td>12:00 - 12:15</td>
 <td>Coffee Break</td>
 </tr>
 <tr class="odd">
-<td>13:30 - 14:00</td>
+<td>12:30 - 13:00</td>
 <td>Sync (everyone should be at the same point)</td>
 </tr>
 </tbody>
@@ -354,9 +356,9 @@ <h3 id="schedule-for-day-1">Schedule for day 1</h3>
 <h3 id="note">Note</h3>
 <p>Please open this document on your own browser! We will need it for
 the exercises. <a
-href="https://go.fzj.de/bringing-dl-workloads-to-jsc">https://go.fzj.de/bringing-dl-workloads-to-jsc</a></p>
+href="https://go.fzj.de/dl-in-neuroscience">https://go.fzj.de/dl-in-neuroscience</a></p>
 <figure>
-<img data-src="images/bringing-dl-workloads-to-jsc.png"
+<img data-src="images/dl-in-neuroscience.png"
 alt="Mobile friendly, but you need it on your computer, really" />
 <figcaption aria-hidden="true">Mobile friendly, but you need it on your
 computer, really</figcaption>
@@ -545,17 +547,17 @@ <h3 id="connecting-to-jureca-dc">Connecting to Jureca DC</h3>
 <h4 id="getting-compute-time">Getting compute time</h4>
 <ul>
 <li class="fragment">Go to <a
-href="https://go.fzj.de/bringing-dl-workloads-to-jsc-project-join">https://go.fzj.de/bringing-dl-workloads-to-jsc-project-join</a></li>
+href="https://go.fzj.de/dl-in-neuroscience-project-join">https://go.fzj.de/dl-in-neuroscience-project-join</a></li>
 <li class="fragment">Join the course project
-<code>training2425</code></li>
+<code>training2441</code></li>
 <li class="fragment">Sign the Usage Agreements (<a
 href="https://drive.google.com/file/d/1mEN1GmWyGFp75uMIi4d6Tpek2NC_X8eY/view">Video</a>)</li>
 <li class="fragment">Compute time allocation is based on compute
 projects. For every compute job, a compute project pays.</li>
 <li class="fragment">Time is measured in core-hours. One hour of Jureca
-DC is 48 core-hours.</li>
+DC is 128 core-hours.</li>
 <li class="fragment">Example: Job runs for 8 hours on 64 nodes of Jureca
-DC: 8 * 64 * 48 = 24576 core-h!</li>
+DC: 8 * 64 * 128 = 65536 core-h!</li>
 </ul>
 </section>
 <section id="jupyter" class="slide level2">
@@ -574,272 +576,8 @@ <h2>Jupyter</h2>
 </section>
 <section id="jupyter-1" class="slide level2">
 <h2>Jupyter</h2>
-<h4
-id="pay-attention-to-the-partition---dont-run-it-on-the-login-node">Pay
-attention to the partition - DON’T RUN IT ON THE LOGIN NODE!!!</h4>
 <p><img data-src="images/jupyter-partition.png" /></p>
 </section>
-<section id="connecting-to-jureca-dc-1" class="slide level2">
-<h2>Connecting to Jureca DC</h2>
-</section>
-<section id="vscode" class="slide level2">
-<h2>VSCode</h2>
-<ul>
-<li class="fragment"><a
-href="https://code.visualstudio.com/download">Download VScode:
-code.visualstudio.com</a></li>
-<li class="fragment">Install and run it
-<ul>
-<li class="fragment">On the local terminal, type <code>code</code></li>
-</ul></li>
-<li class="fragment">Install <a
-href="https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.vscode-remote-extensionpack">Remote
-Development Tools</a></li>
-<li class="fragment">Install <a
-href="https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.remote-ssh">Remote:
-SSH</a></li>
-<li class="fragment">If you have Windows, you need WSL as explained on
-the email.</li>
-</ul>
-</section>
-<section id="vscode-1" class="slide level2">
-<h2>VSCode</h2>
-<h3 id="now-with-the-remote-explorer-tab">Now with the remote explorer
-tab</h3>
-<p><img data-src="images/vscode-welcome.png" /></p>
-</section>
-<section class="slide level2">
-
-<h4 id="ssh">SSH</h4>
-<ul>
-<li class="fragment">SSH is a secure shell (terminal) connection to
-another computer</li>
-<li class="fragment">You connect from your computer to the LOGIN
-NODE</li>
-<li class="fragment">Security is given by public/private keys</li>
-<li class="fragment">A connection to the supercomputer needs a
-<ol type="1">
-<li class="fragment">Key,</li>
-<li class="fragment">Configuration</li>
-<li class="fragment">Key/IP address known to the supercomputer</li>
-</ol></li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-1">SSH</h3>
-<h4 id="create-key-in-vscodes-terminal-menu-view-terminal">Create key in
-VSCode’s Terminal (menu View-&gt;Terminal)</h4>
-<div class="sourceCode" id="cb1"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="fu">mkdir</span> ~/.ssh/</span>
-<span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a><span class="fu">ssh-keygen</span> <span class="at">-a</span> 100 <span class="at">-t</span> ed25519 <span class="at">-f</span> ~/.ssh/id_ed25519-JSC</span></code></pre></div>
-<div class="sourceCode" id="cb2"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb2-1"><a href="#cb2-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> ssh-keygen <span class="at">-a</span> 100 <span class="at">-t</span> ed25519 <span class="at">-f</span> ~/.ssh/id_ed25519-JSC</span>
-<span id="cb2-2"><a href="#cb2-2" aria-hidden="true" tabindex="-1"></a><span class="ex">Generating</span> public/private ed25519 key pair.</span>
-<span id="cb2-3"><a href="#cb2-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Enter</span> passphrase <span class="er">(</span><span class="ex">empty</span> for no passphrase<span class="kw">)</span><span class="bu">:</span> </span>
-<span id="cb2-4"><a href="#cb2-4" aria-hidden="true" tabindex="-1"></a><span class="ex">Enter</span> same passphrase again: </span>
-<span id="cb2-5"><a href="#cb2-5" aria-hidden="true" tabindex="-1"></a><span class="ex">Your</span> identification has been saved in /Users/strube1/.ssh/id_ed25519-JSC</span>
-<span id="cb2-6"><a href="#cb2-6" aria-hidden="true" tabindex="-1"></a><span class="ex">Your</span> public key has been saved in /Users/strube1/.ssh/id_ed25519-JSC.pub</span>
-<span id="cb2-7"><a href="#cb2-7" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> key fingerprint is:</span>
-<span id="cb2-8"><a href="#cb2-8" aria-hidden="true" tabindex="-1"></a><span class="ex">SHA256:EGNNC1NTaN8fHwpfuZRPa50qXHmGcQjxp0JuU0ZA86U</span> strube1@Strube-16</span>
-<span id="cb2-9"><a href="#cb2-9" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> keys randomart image is:</span>
-<span id="cb2-10"><a href="#cb2-10" aria-hidden="true" tabindex="-1"></a><span class="ex">+--[ED25519</span> 256]--+</span>
-<span id="cb2-11"><a href="#cb2-11" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span>      <span class="ex">*++oo=o.</span> . <span class="kw">|</span></span>
-<span id="cb2-12"><a href="#cb2-12" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span>     <span class="bu">.</span> =+o .= o  <span class="kw">|</span></span>
-<span id="cb2-13"><a href="#cb2-13" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span>      <span class="ex">....</span> o.E..o<span class="kw">|</span></span>
-<span id="cb2-14"><a href="#cb2-14" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span>       <span class="bu">.</span>  +.+o+B.<span class="kw">|</span></span>
-<span id="cb2-15"><a href="#cb2-15" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span>        <span class="ex">S</span>  =o.o+B<span class="kw">|</span></span>
-<span id="cb2-16"><a href="#cb2-16" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span>          <span class="bu">.</span> o<span class="pp">*</span>.B+<span class="kw">|</span></span>
-<span id="cb2-17"><a href="#cb2-17" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span>          <span class="bu">.</span> . =  <span class="kw">|</span></span>
-<span id="cb2-18"><a href="#cb2-18" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span>           <span class="ex">o</span> .   <span class="kw">|</span></span>
-<span id="cb2-19"><a href="#cb2-19" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span>            <span class="bu">.</span>    <span class="kw">|</span></span>
-<span id="cb2-20"><a href="#cb2-20" aria-hidden="true" tabindex="-1"></a><span class="ex">+----[SHA256]-----+</span></span></code></pre></div>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-2">SSH</h3>
-<h4 id="configure-ssh-session">Configure SSH session</h4>
-<div class="sourceCode" id="cb3"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a><span class="ex">code</span> <span class="va">$HOME</span>/.ssh/config</span></code></pre></div>
-<p>Windows users, from Ubuntu WSL (Change username for your user on
-windows)</p>
-<div class="sourceCode" id="cb4"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a><span class="fu">ls</span> <span class="at">-la</span> /mnt/c/Users/</span>
-<span id="cb4-2"><a href="#cb4-2" aria-hidden="true" tabindex="-1"></a><span class="fu">mkdir</span> /mnt/c/Users/USERNAME/.ssh/</span>
-<span id="cb4-3"><a href="#cb4-3" aria-hidden="true" tabindex="-1"></a><span class="fu">cp</span> <span class="va">$HOME</span>/.ssh/<span class="pp">*</span> /mnt/c/Users/USERNAME/.ssh/</span></code></pre></div>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-3">SSH</h3>
-<h4 id="configure-ssh-session-1">Configure SSH session</h4>
-<div class="sourceCode" id="cb5"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a><span class="ex">Host</span> jureca</span>
-<span id="cb5-2"><a href="#cb5-2" aria-hidden="true" tabindex="-1"></a>        <span class="ex">HostName</span> jureca.fz-juelich.de</span>
-<span id="cb5-3"><a href="#cb5-3" aria-hidden="true" tabindex="-1"></a>        <span class="ex">User</span> <span class="pp">[</span><span class="ss">MY_USERNAME</span><span class="pp">]</span>   <span class="co"># Here goes your username, not the word MY_USERNAME.</span></span>
-<span id="cb5-4"><a href="#cb5-4" aria-hidden="true" tabindex="-1"></a>        <span class="ex">AddressFamily</span> inet</span>
-<span id="cb5-5"><a href="#cb5-5" aria-hidden="true" tabindex="-1"></a>        <span class="ex">IdentityFile</span> ~/.ssh/id_ed25519-JSC</span>
-<span id="cb5-6"><a href="#cb5-6" aria-hidden="true" tabindex="-1"></a>        <span class="ex">MACs</span> hmac-sha2-512-etm@openssh.com</span></code></pre></div>
-<p>Copy contents to the config file and save it</p>
-<p><strong>REPLACE [MY_USERNAME] WITH YOUR USERNAME!!! 🤦‍♂️</strong></p>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-4">SSH</h3>
-<h4 id="jsc-restricts-from-where-you-can-login">JSC restricts from where
-you can login</h4>
-<h4 id="so-we-need-to">So we need to:</h4>
-<ol type="1">
-<li class="fragment">Find our ip range</li>
-<li class="fragment">Add the range and key to <a
-href="https://judoor.fz-juelich.de">Judoor</a></li>
-</ol>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-5">SSH</h3>
-<h4 id="find-your-ipname-range">Find your ip/name range</h4>
-<p>Open <strong><a
-href="https://www.whatismyip.com">https://www.whatismyip.com</a></strong></p>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-6">SSH</h3>
-<h4 id="find-your-ipname-range-1">Find your ip/name range</h4>
-<p><img data-src="images/whatismyip.png" /></p>
-<ul>
-<li class="fragment">Let’s keep this inside vscode:
-<code>code key.txt</code> and paste the number you got</li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-7">SSH</h3>
-<p>Did everyone get their <strong>own</strong> ip address?</p>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh---example">SSH - EXAMPLE</h3>
-<ul>
-<li class="fragment">I will use the number
-<code>93.199.55.163</code></li>
-<li class="fragment"><strong>YOUR NUMBER IS DIFFERENT</strong></li>
-<li class="fragment">Seriously</li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh---example-93.199.55.163">SSH - Example:
-<code>93.199.55.163</code></h3>
-<ul>
-<li class="fragment">Go to VSCode and make it simpler, replace the 2nd
-half with <code>"0.0/16"</code>:
-<ul>
-<li class="fragment">It was <code>93.199.55.163</code></li>
-<li class="fragment">Becomes <code>93.199.0.0/16</code> (with YOUR
-number, not with the example)</li>
-</ul></li>
-<li class="fragment">Add a <code>from=""</code> around it</li>
-<li class="fragment">So, it looks like this, now:
-<code>from="93.199.0.0/16"</code></li>
-<li class="fragment">Add a second magic number, with a comma:
-<code>,10.0.0.0/8</code> 🧙‍♀️</li>
-<li class="fragment">I promise, the magic is worth it 🧝‍♂️ (If time
-allows)</li>
-<li class="fragment">In the end it looks like this:
-<code>from="93.199.0.0/16,10.0.0.0/8"</code> 🎬</li>
-<li class="fragment">Keep it open, we will use it later</li>
-<li class="fragment">If you are from FZJ, also add “134.94.0.0/16” with
-a comma</li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh---example-93.199.0.016">SSH - Example:
-<code>93.199.0.0/16</code></h3>
-<h4 id="copy-your-ssh-key">Copy your ssh key</h4>
-<ul>
-<li class="fragment"><p>Terminal:
-<code>code ~/.ssh/id_ed25519-JSC.pub</code></p></li>
-<li class="fragment"><p>Something like this will open:</p></li>
-<li class="fragment"><div class="sourceCode" id="cb6"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a><span class="ex">ssh-ed25519</span> AAAAC3NzaC1lZDE1NTA4AAAAIHaoOJF3gqXd7CV6wncoob0DL2OJNfvjgnHLKEniHV6F strube@demonstration.fz-juelich.de</span></code></pre></div></li>
-<li class="fragment"><p>Paste this line at the same <code>key.txt</code>
-which you just opened</p></li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-8">SSH</h3>
-<h4 id="example-93.199.0.016">Example: <code>93.199.0.0/16</code></h4>
-<ul>
-<li class="fragment">Put them together and copy again:</li>
-<li class="fragment"><div class="sourceCode" id="cb7"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a><span class="va">from</span><span class="op">=</span><span class="st">&quot;93.199.0.0/16,10.0.0.0/8&quot;</span> <span class="ex">ssh-ed25519</span> AAAAC3NzaC1lZDE1NTA4AAAAIHaoOJF3gqXd7CV6wncoob0DL2OJNfvjgnHLKEniHV6F strube@demonstration.fz-juelich.de</span></code></pre></div></li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-9">SSH</h3>
-<ul>
-<li class="fragment">Let’s add it on <a
-href="https://judoor.fz-juelich.de">Judoor</a></li>
-<li class="fragment"><img data-src="images/manage-ssh-keys.png" /></li>
-<li class="fragment">Do it for JURECA and JUDAC with the same key</li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-10">SSH</h3>
-<h4 id="add-new-key-to-judoor">Add new key to <a
-href="https://judoor.fz-juelich.de">Judoor</a></h4>
-<p><img data-src="images/manage-ssh-keys-from-and-key.png"
-width="850" /></p>
-<p>This might take some minutes</p>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-exercise">SSH: Exercise</h3>
-<p>That’s it! Give it a try (and answer yes)</p>
-<div class="sourceCode" id="cb8"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> ssh jureca</span>
-<span id="cb8-2"><a href="#cb8-2" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> authenticity of host <span class="st">&#39;jrlogin03.fz-juelich.de (134.94.0.185)&#39;</span> cannot be established.</span>
-<span id="cb8-3"><a href="#cb8-3" aria-hidden="true" tabindex="-1"></a><span class="ex">ED25519</span> key fingerprint is SHA256:ASeu9MJbkFx3kL1FWrysz6+paaznGenChgEkUW8nRQU.</span>
-<span id="cb8-4"><a href="#cb8-4" aria-hidden="true" tabindex="-1"></a><span class="ex">This</span> key is not known by any other names</span>
-<span id="cb8-5"><a href="#cb8-5" aria-hidden="true" tabindex="-1"></a><span class="ex">Are</span> you sure you want to continue connecting <span class="er">(</span><span class="ex">yes/no/[fingerprint]</span><span class="kw">)</span><span class="ex">?</span> Yes</span>
-<span id="cb8-6"><a href="#cb8-6" aria-hidden="true" tabindex="-1"></a><span class="ex">**************************************************************************</span></span>
-<span id="cb8-7"><a href="#cb8-7" aria-hidden="true" tabindex="-1"></a><span class="ex">*</span>                            Welcome to Jureca DC                   <span class="pp">*</span></span>
-<span id="cb8-8"><a href="#cb8-8" aria-hidden="true" tabindex="-1"></a><span class="ex">**************************************************************************</span></span>
-<span id="cb8-9"><a href="#cb8-9" aria-hidden="true" tabindex="-1"></a><span class="ex">...</span></span>
-<span id="cb8-10"><a href="#cb8-10" aria-hidden="true" tabindex="-1"></a><span class="ex">...</span></span>
-<span id="cb8-11"><a href="#cb8-11" aria-hidden="true" tabindex="-1"></a><span class="ex">strube1@jrlogin03~</span> $ </span></code></pre></div>
-</section>
-<section class="slide level2">
-
-<h3 id="ssh-exercise-1">SSH: Exercise</h3>
-<h4 id="make-sure-you-are-connected-to-the-supercomputer">Make sure you
-are connected to the supercomputer</h4>
-<div class="sourceCode" id="cb9"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a><span class="co"># Create a folder for myself</span></span>
-<span id="cb9-2"><a href="#cb9-2" aria-hidden="true" tabindex="-1"></a><span class="fu">mkdir</span> <span class="va">$PROJECT_training2425</span>/<span class="va">$USER</span></span>
-<span id="cb9-3"><a href="#cb9-3" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-4"><a href="#cb9-4" aria-hidden="true" tabindex="-1"></a><span class="co"># Create a shortcut for the project on the home folder</span></span>
-<span id="cb9-5"><a href="#cb9-5" aria-hidden="true" tabindex="-1"></a><span class="fu">rm</span> <span class="at">-rf</span> ~/course <span class="kw">;</span> <span class="fu">ln</span> <span class="at">-s</span> <span class="va">$PROJECT_training2425</span>/<span class="va">$USER</span> ~/course</span>
-<span id="cb9-6"><a href="#cb9-6" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-7"><a href="#cb9-7" aria-hidden="true" tabindex="-1"></a><span class="co"># Enter course folder and</span></span>
-<span id="cb9-8"><a href="#cb9-8" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> ~/course</span>
-<span id="cb9-9"><a href="#cb9-9" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-10"><a href="#cb9-10" aria-hidden="true" tabindex="-1"></a><span class="co"># Where am I?</span></span>
-<span id="cb9-11"><a href="#cb9-11" aria-hidden="true" tabindex="-1"></a><span class="bu">pwd</span></span>
-<span id="cb9-12"><a href="#cb9-12" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-13"><a href="#cb9-13" aria-hidden="true" tabindex="-1"></a><span class="co"># We well need those later</span></span>
-<span id="cb9-14"><a href="#cb9-14" aria-hidden="true" tabindex="-1"></a><span class="fu">mkdir</span> ~/course/.cache</span>
-<span id="cb9-15"><a href="#cb9-15" aria-hidden="true" tabindex="-1"></a><span class="fu">mkdir</span> ~/course/.config</span>
-<span id="cb9-16"><a href="#cb9-16" aria-hidden="true" tabindex="-1"></a><span class="fu">mkdir</span> ~/course/.fastai</span>
-<span id="cb9-17"><a href="#cb9-17" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-18"><a href="#cb9-18" aria-hidden="true" tabindex="-1"></a><span class="fu">rm</span> <span class="at">-rf</span> <span class="va">$HOME</span>/.cache <span class="kw">;</span> <span class="fu">ln</span> <span class="at">-s</span> ~/course/.cache <span class="va">$HOME</span>/</span>
-<span id="cb9-19"><a href="#cb9-19" aria-hidden="true" tabindex="-1"></a><span class="fu">rm</span> <span class="at">-rf</span> <span class="va">$HOME</span>/.config <span class="kw">;</span> <span class="fu">ln</span> <span class="at">-s</span> ~/course/.config <span class="va">$HOME</span>/</span>
-<span id="cb9-20"><a href="#cb9-20" aria-hidden="true" tabindex="-1"></a><span class="fu">rm</span> <span class="at">-rf</span> <span class="va">$HOME</span>/.fastai <span class="kw">;</span> <span class="fu">ln</span> <span class="at">-s</span> ~/course/.fastai <span class="va">$HOME</span>/</span></code></pre></div>
-</section>
 <section id="working-with-the-supercomputers-software"
 class="slide level2">
 <h2>Working with the supercomputer’s software</h2>
@@ -854,27 +592,36 @@ <h2>Working with the supercomputer’s software</h2>
 documentation</a></li>
 </ul>
 </section>
+<section id="luncher-in-jupyter-jsc" class="slide level2">
+<h2>Luncher in Jupyter-JSC</h2>
+<p><img data-src="images/launcher-jupyter-jsc.png" /></p>
+</section>
 <section id="software" class="slide level2">
 <h2>Software</h2>
-<h4 id="tool-for-finding-software-module-spider">Tool for finding
-software: <code>module spider</code></h4>
-<div class="sourceCode" id="cb10"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a><span class="ex">strube1$</span> module spider PyTorch</span>
-<span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a><span class="ex">------------------------------------------------------------------------------------</span></span>
-<span id="cb10-3"><a href="#cb10-3" aria-hidden="true" tabindex="-1"></a>  <span class="ex">PyTorch:</span></span>
-<span id="cb10-4"><a href="#cb10-4" aria-hidden="true" tabindex="-1"></a><span class="ex">------------------------------------------------------------------------------------</span></span>
-<span id="cb10-5"><a href="#cb10-5" aria-hidden="true" tabindex="-1"></a>    <span class="ex">Description:</span></span>
-<span id="cb10-6"><a href="#cb10-6" aria-hidden="true" tabindex="-1"></a>      <span class="ex">Tensors</span> and Dynamic neural networks in Python with strong GPU acceleration. </span>
-<span id="cb10-7"><a href="#cb10-7" aria-hidden="true" tabindex="-1"></a>      <span class="ex">PyTorch</span> is a deep learning framework that puts Python first.</span>
-<span id="cb10-8"><a href="#cb10-8" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb10-9"><a href="#cb10-9" aria-hidden="true" tabindex="-1"></a>     <span class="ex">Versions:</span></span>
-<span id="cb10-10"><a href="#cb10-10" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch/1.7.0-Python-3.8.5</span></span>
-<span id="cb10-11"><a href="#cb10-11" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch/1.8.1-Python-3.8.5</span></span>
-<span id="cb10-12"><a href="#cb10-12" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch/1.11-CUDA-11.5</span></span>
-<span id="cb10-13"><a href="#cb10-13" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch/1.12.0-CUDA-11.7</span></span>
-<span id="cb10-14"><a href="#cb10-14" aria-hidden="true" tabindex="-1"></a>     <span class="ex">Other</span> possible modules matches:</span>
-<span id="cb10-15"><a href="#cb10-15" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch-Geometric</span>  PyTorch-Lightning</span>
-<span id="cb10-16"><a href="#cb10-16" aria-hidden="true" tabindex="-1"></a><span class="ex">...</span></span></code></pre></div>
+<h3 id="connect-to-terminal">Connect to terminal</h3>
+<p><img data-src="images/jupyter-terminal.png" /></p>
+</section>
+<section class="slide level2">
+
+<h3 id="tool-for-finding-software-module-spider">Tool for finding
+software: <code>module spider</code></h3>
+<div class="sourceCode" id="cb1"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="ex">strube1$</span> module spider PyTorch</span>
+<span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a><span class="ex">------------------------------------------------------------------------------------</span></span>
+<span id="cb1-3"><a href="#cb1-3" aria-hidden="true" tabindex="-1"></a>  <span class="ex">PyTorch:</span></span>
+<span id="cb1-4"><a href="#cb1-4" aria-hidden="true" tabindex="-1"></a><span class="ex">------------------------------------------------------------------------------------</span></span>
+<span id="cb1-5"><a href="#cb1-5" aria-hidden="true" tabindex="-1"></a>    <span class="ex">Description:</span></span>
+<span id="cb1-6"><a href="#cb1-6" aria-hidden="true" tabindex="-1"></a>      <span class="ex">Tensors</span> and Dynamic neural networks in Python with strong GPU acceleration. </span>
+<span id="cb1-7"><a href="#cb1-7" aria-hidden="true" tabindex="-1"></a>      <span class="ex">PyTorch</span> is a deep learning framework that puts Python first.</span>
+<span id="cb1-8"><a href="#cb1-8" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb1-9"><a href="#cb1-9" aria-hidden="true" tabindex="-1"></a>     <span class="ex">Versions:</span></span>
+<span id="cb1-10"><a href="#cb1-10" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch/1.7.0-Python-3.8.5</span></span>
+<span id="cb1-11"><a href="#cb1-11" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch/1.8.1-Python-3.8.5</span></span>
+<span id="cb1-12"><a href="#cb1-12" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch/1.11-CUDA-11.5</span></span>
+<span id="cb1-13"><a href="#cb1-13" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch/1.12.0-CUDA-11.7</span></span>
+<span id="cb1-14"><a href="#cb1-14" aria-hidden="true" tabindex="-1"></a>     <span class="ex">Other</span> possible modules matches:</span>
+<span id="cb1-15"><a href="#cb1-15" aria-hidden="true" tabindex="-1"></a>        <span class="ex">PyTorch-Geometric</span>  PyTorch-Lightning</span>
+<span id="cb1-16"><a href="#cb1-16" aria-hidden="true" tabindex="-1"></a><span class="ex">...</span></span></code></pre></div>
 </section>
 <section id="what-do-we-have" class="slide level2">
 <h2>What do we have?</h2>
@@ -911,31 +658,31 @@ <h2>Example: PyTorch</h2>
 <section id="example-pytorch-2" class="slide level2">
 <h2>Example: PyTorch</h2>
 <p>(make sure you are still connected to Jureca DC)</p>
-<div class="sourceCode" id="cb11"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> python</span>
-<span id="cb11-2"><a href="#cb11-2" aria-hidden="true" tabindex="-1"></a><span class="ex">-bash:</span> python: command not found</span></code></pre></div>
+<div class="sourceCode" id="cb2"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb2-1"><a href="#cb2-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> python</span>
+<span id="cb2-2"><a href="#cb2-2" aria-hidden="true" tabindex="-1"></a><span class="ex">-bash:</span> python: command not found</span></code></pre></div>
 <p>Oh noes! 🙈</p>
 <p>Let’s bring Python together with PyTorch!</p>
 </section>
 <section id="example-pytorch-3" class="slide level2">
 <h2>Example: PyTorch</h2>
 <p>Copy and paste these lines</p>
-<div class="sourceCode" id="cb12"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a><span class="co"># This command fails, as we have no proper python</span></span>
-<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a><span class="ex">python</span> </span>
-<span id="cb12-3"><a href="#cb12-3" aria-hidden="true" tabindex="-1"></a><span class="co"># So, we load the correct modules...</span></span>
-<span id="cb12-4"><a href="#cb12-4" aria-hidden="true" tabindex="-1"></a><span class="ex">module</span> load Stages/2024</span>
-<span id="cb12-5"><a href="#cb12-5" aria-hidden="true" tabindex="-1"></a><span class="ex">module</span> load GCC OpenMPI Python PyTorch</span>
-<span id="cb12-6"><a href="#cb12-6" aria-hidden="true" tabindex="-1"></a><span class="co"># And we run a small test: import pytorch and ask its version</span></span>
-<span id="cb12-7"><a href="#cb12-7" aria-hidden="true" tabindex="-1"></a><span class="ex">python</span> <span class="at">-c</span> <span class="st">&quot;import torch ; print(torch.__version__)&quot;</span> </span></code></pre></div>
+<div class="sourceCode" id="cb3"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a><span class="co"># This command fails, as we have no proper python</span></span>
+<span id="cb3-2"><a href="#cb3-2" aria-hidden="true" tabindex="-1"></a><span class="ex">python</span> </span>
+<span id="cb3-3"><a href="#cb3-3" aria-hidden="true" tabindex="-1"></a><span class="co"># So, we load the correct modules...</span></span>
+<span id="cb3-4"><a href="#cb3-4" aria-hidden="true" tabindex="-1"></a><span class="ex">module</span> load Stages/2024</span>
+<span id="cb3-5"><a href="#cb3-5" aria-hidden="true" tabindex="-1"></a><span class="ex">module</span> load GCC OpenMPI Python PyTorch</span>
+<span id="cb3-6"><a href="#cb3-6" aria-hidden="true" tabindex="-1"></a><span class="co"># And we run a small test: import pytorch and ask its version</span></span>
+<span id="cb3-7"><a href="#cb3-7" aria-hidden="true" tabindex="-1"></a><span class="ex">python</span> <span class="at">-c</span> <span class="st">&quot;import torch ; print(torch.__version__)&quot;</span> </span></code></pre></div>
 <p>Should look like this:</p>
-<div class="sourceCode" id="cb13"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> python</span>
-<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a><span class="ex">-bash:</span> python: command not found</span>
-<span id="cb13-3"><a href="#cb13-3" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> module load Stages/2024</span>
-<span id="cb13-4"><a href="#cb13-4" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> module load GCC OpenMPI Python PyTorch</span>
-<span id="cb13-5"><a href="#cb13-5" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> python <span class="at">-c</span> <span class="st">&quot;import torch ; print(torch.__version__)&quot;</span> </span>
-<span id="cb13-6"><a href="#cb13-6" aria-hidden="true" tabindex="-1"></a><span class="ex">2.1.0</span></span></code></pre></div>
+<div class="sourceCode" id="cb4"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> python</span>
+<span id="cb4-2"><a href="#cb4-2" aria-hidden="true" tabindex="-1"></a><span class="ex">-bash:</span> python: command not found</span>
+<span id="cb4-3"><a href="#cb4-3" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> module load Stages/2024</span>
+<span id="cb4-4"><a href="#cb4-4" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> module load GCC OpenMPI Python PyTorch</span>
+<span id="cb4-5"><a href="#cb4-5" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> python <span class="at">-c</span> <span class="st">&quot;import torch ; print(torch.__version__)&quot;</span> </span>
+<span id="cb4-6"><a href="#cb4-6" aria-hidden="true" tabindex="-1"></a><span class="ex">2.1.0</span></span></code></pre></div>
 </section>
 <section id="python-modules" class="slide level2">
 <h2>Python Modules</h2>
@@ -943,78 +690,63 @@ <h2>Python Modules</h2>
 id="some-of-the-python-softwares-are-part-of-python-itself-or-of-other-softwares.-use-module-key">Some
 of the python softwares are part of Python itself, or of other
 softwares. Use “<code>module key</code>”</h4>
-<div class="sourceCode" id="cb14"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb14-1"><a href="#cb14-1" aria-hidden="true" tabindex="-1"></a><span class="ex">module</span> key toml</span>
-<span id="cb14-2"><a href="#cb14-2" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> following modules match your search criteria: <span class="st">&quot;toml&quot;</span></span>
-<span id="cb14-3"><a href="#cb14-3" aria-hidden="true" tabindex="-1"></a><span class="ex">------------------------------------------------------------------------------------</span></span>
-<span id="cb14-4"><a href="#cb14-4" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb14-5"><a href="#cb14-5" aria-hidden="true" tabindex="-1"></a>  <span class="ex">Jupyter:</span> Jupyter/2020.2.5-Python-3.8.5, Jupyter/2021.3.1-Python-3.8.5, Jupyter/2021.3.2-Python-3.8.5, Jupyter/2022.3.3, Jupyter/2022.3.4</span>
-<span id="cb14-6"><a href="#cb14-6" aria-hidden="true" tabindex="-1"></a>    <span class="ex">Project</span> Jupyter exists to develop open-source software, open-standards, and services for interactive computing across dozens of programming languages.</span>
-<span id="cb14-7"><a href="#cb14-7" aria-hidden="true" tabindex="-1"></a>    </span>
-<span id="cb14-8"><a href="#cb14-8" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb14-9"><a href="#cb14-9" aria-hidden="true" tabindex="-1"></a>  <span class="ex">PyQuil:</span> PyQuil/3.0.1</span>
-<span id="cb14-10"><a href="#cb14-10" aria-hidden="true" tabindex="-1"></a>    <span class="ex">PyQuil</span> is a library for generating and executing Quil programs on the Rigetti Forest platform.</span>
-<span id="cb14-11"><a href="#cb14-11" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb14-12"><a href="#cb14-12" aria-hidden="true" tabindex="-1"></a>  <span class="ex">Python:</span> Python/3.8.5, Python/3.9.6, Python/3.10.4</span>
-<span id="cb14-13"><a href="#cb14-13" aria-hidden="true" tabindex="-1"></a>    <span class="ex">Python</span> is a programming language that lets you work more quickly and integrate your systems more effectively.</span>
-<span id="cb14-14"><a href="#cb14-14" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb14-15"><a href="#cb14-15" aria-hidden="true" tabindex="-1"></a><span class="ex">------------------------------------------------------------------------------------</span></span></code></pre></div>
-</section>
-<section id="vscode-2" class="slide level2">
-<h2>VSCode</h2>
-<h4 id="editing-files-on-the-supercomputers">Editing files on the
-supercomputers</h4>
-<p><img data-src="images/vscode-remotes.png" /></p>
-</section>
-<section id="vscode-3" class="slide level2">
-<h2>VSCode</h2>
-<p><img data-src="images/vscode-jusuf.png" /></p>
-</section>
-<section id="vscode-4" class="slide level2">
-<h2>VSCode</h2>
-<ul>
-<li class="fragment">You can have a terminal inside VSCode:
-<ul>
-<li class="fragment">Go to the menu View-&gt;Terminal</li>
-</ul></li>
-</ul>
+<div class="sourceCode" id="cb5"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a><span class="ex">module</span> key toml</span>
+<span id="cb5-2"><a href="#cb5-2" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> following modules match your search criteria: <span class="st">&quot;toml&quot;</span></span>
+<span id="cb5-3"><a href="#cb5-3" aria-hidden="true" tabindex="-1"></a><span class="ex">------------------------------------------------------------------------------------</span></span>
+<span id="cb5-4"><a href="#cb5-4" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb5-5"><a href="#cb5-5" aria-hidden="true" tabindex="-1"></a>  <span class="ex">Jupyter:</span> Jupyter/2020.2.5-Python-3.8.5, Jupyter/2021.3.1-Python-3.8.5, Jupyter/2021.3.2-Python-3.8.5, Jupyter/2022.3.3, Jupyter/2022.3.4</span>
+<span id="cb5-6"><a href="#cb5-6" aria-hidden="true" tabindex="-1"></a>    <span class="ex">Project</span> Jupyter exists to develop open-source software, open-standards, and services for interactive computing across dozens of programming languages.</span>
+<span id="cb5-7"><a href="#cb5-7" aria-hidden="true" tabindex="-1"></a>    </span>
+<span id="cb5-8"><a href="#cb5-8" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb5-9"><a href="#cb5-9" aria-hidden="true" tabindex="-1"></a>  <span class="ex">PyQuil:</span> PyQuil/3.0.1</span>
+<span id="cb5-10"><a href="#cb5-10" aria-hidden="true" tabindex="-1"></a>    <span class="ex">PyQuil</span> is a library for generating and executing Quil programs on the Rigetti Forest platform.</span>
+<span id="cb5-11"><a href="#cb5-11" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb5-12"><a href="#cb5-12" aria-hidden="true" tabindex="-1"></a>  <span class="ex">Python:</span> Python/3.8.5, Python/3.9.6, Python/3.10.4</span>
+<span id="cb5-13"><a href="#cb5-13" aria-hidden="true" tabindex="-1"></a>    <span class="ex">Python</span> is a programming language that lets you work more quickly and integrate your systems more effectively.</span>
+<span id="cb5-14"><a href="#cb5-14" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb5-15"><a href="#cb5-15" aria-hidden="true" tabindex="-1"></a><span class="ex">------------------------------------------------------------------------------------</span></span></code></pre></div>
 </section>
-<section id="vscode-5" class="slide level2">
-<h2>VSCode</h2>
-<ul>
-<li class="fragment"><p>From the VSCode’s terminal, navigate to your
-“course” folder and to the name you created earlier.</p></li>
-<li class="fragment"><div class="sourceCode" id="cb15"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="va">$HOME</span>/course/</span>
-<span id="cb15-2"><a href="#cb15-2" aria-hidden="true" tabindex="-1"></a><span class="bu">pwd</span></span></code></pre></div></li>
-<li class="fragment"><p>This is out working directory. We do everything
-here.</p></li>
-</ul>
+<section class="slide level2">
+
+<h3 id="how-to-run-it-on-the-login-node">How to run it on the login
+node</h3>
+<h4 id="create-a-python-file">create a python file</h4>
+<p><img data-src="images/open-new-file-jp.png" /></p>
 </section>
 <section class="slide level2">
 
-<h3 id="demo-code">Demo code</h3>
-<h4 id="create-a-new-file-matrix.py-on-vscode-on-jureca-dc">Create a new
-file “<code>matrix.py</code>” on VSCode on Jureca DC</h4>
-<div class="sourceCode" id="cb16"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="ex">code</span> matrix.py</span></code></pre></div>
-<p>Paste this into the file:</p>
-<div class="sourceCode" id="cb17"><pre
-class="sourceCode numberSource python number-lines"><code class="sourceCode python"><span id="cb17-1"><a href="#cb17-1"></a><span class="im">import</span> torch</span>
-<span id="cb17-2"><a href="#cb17-2"></a></span>
-<span id="cb17-3"><a href="#cb17-3"></a>matrix1 <span class="op">=</span> torch.randn(<span class="dv">3</span>,<span class="dv">3</span>)</span>
-<span id="cb17-4"><a href="#cb17-4"></a><span class="bu">print</span>(<span class="st">&quot;The first matrix is&quot;</span>, matrix1)</span>
-<span id="cb17-5"><a href="#cb17-5"></a></span>
-<span id="cb17-6"><a href="#cb17-6"></a>matrix2 <span class="op">=</span> torch.randn(<span class="dv">3</span>,<span class="dv">3</span>)</span>
-<span id="cb17-7"><a href="#cb17-7"></a><span class="bu">print</span>(<span class="st">&quot;The second matrix is&quot;</span>, matrix2)</span>
-<span id="cb17-8"><a href="#cb17-8"></a></span>
-<span id="cb17-9"><a href="#cb17-9"></a>result <span class="op">=</span> torch.matmul(matrix1,matrix2)</span>
-<span id="cb17-10"><a href="#cb17-10"></a><span class="bu">print</span>(<span class="st">&quot;The result is:</span><span class="ch">\n</span><span class="st">&quot;</span>, result)</span></code></pre></div>
+<h4 id="create-a-python-file-1">create a python file</h4>
+<p><img data-src="images/rename-matrix-python-file.png" /></p>
 </section>
 <section class="slide level2">
 
-<h3 id="how-to-run-it-on-the-login-node">How to run it on the login
-node</h3>
+<h4 id="create-an-python-file">create an python file</h4>
+<p><img data-src="images/open-editor-matrix-python.png" /></p>
+</section>
+<section class="slide level2">
+
+<h4 id="create-a-python-file-2">create a python file</h4>
+<div class="sourceCode" id="cb6"><pre
+class="sourceCode numberSource bash number-lines"><code class="sourceCode bash"><span id="cb6-1"><a href="#cb6-1"></a><span class="ex">import</span> torch</span>
+<span id="cb6-2"><a href="#cb6-2"></a></span>
+<span id="cb6-3"><a href="#cb6-3"></a><span class="ex">matrix1</span> = torch.randn<span class="er">(</span><span class="ex">3,3</span><span class="kw">)</span></span>
+<span id="cb6-4"><a href="#cb6-4"></a><span class="ex">print</span><span class="er">(</span><span class="st">&quot;The first matrix is&quot;</span><span class="ex">,</span> matrix1<span class="kw">)</span></span>
+<span id="cb6-5"><a href="#cb6-5"></a></span>
+<span id="cb6-6"><a href="#cb6-6"></a><span class="ex">matrix2</span> = torch.randn<span class="er">(</span><span class="ex">3,3</span><span class="kw">)</span></span>
+<span id="cb6-7"><a href="#cb6-7"></a><span class="ex">print</span><span class="er">(</span><span class="st">&quot;The second matrix is&quot;</span><span class="ex">,</span> matrix2<span class="kw">)</span></span>
+<span id="cb6-8"><a href="#cb6-8"></a></span>
+<span id="cb6-9"><a href="#cb6-9"></a><span class="ex">result</span> = torch.matmul<span class="er">(</span><span class="ex">matrix1,matrix2</span><span class="kw">)</span></span>
+<span id="cb6-10"><a href="#cb6-10"></a><span class="ex">print</span><span class="er">(</span><span class="st">&quot;The result is:\n&quot;</span><span class="ex">,</span> result<span class="kw">)</span></span></code></pre></div>
+</section>
+<section class="slide level2">
+
+<h4 id="create-a-python-file-3">create a python file</h4>
+<p><img data-src="images/create-python-file.png" /></p>
+</section>
+<section class="slide level2">
+
+<h4 id="run-code-in-login-node">Run code in login node</h4>
 <pre><code>module load Stages/2023
 module load GCC OpenMPI PyTorch
 python matrix.py</code></pre>
@@ -1048,32 +780,34 @@ <h3 id="slurm-submission-file">Slurm submission file</h3>
 
 <h3 id="slurm-submission-file-example">Slurm submission file
 example</h3>
-<p><code>code jureca-matrix.sbatch</code></p>
-<div class="sourceCode" id="cb19"><pre
-class="sourceCode numberSource bash number-lines"><code class="sourceCode bash"><span id="cb19-1"><a href="#cb19-1"></a><span class="co">#!/bin/bash</span></span>
-<span id="cb19-2"><a href="#cb19-2"></a><span class="co">#SBATCH --account=training2425           # Who pays?</span></span>
-<span id="cb19-3"><a href="#cb19-3"></a><span class="co">#SBATCH --nodes=1                        # How many compute nodes</span></span>
-<span id="cb19-4"><a href="#cb19-4"></a><span class="co">#SBATCH --job-name=matrix-multiplication</span></span>
-<span id="cb19-5"><a href="#cb19-5"></a><span class="co">#SBATCH --ntasks-per-node=1              # How many mpi processes/node</span></span>
-<span id="cb19-6"><a href="#cb19-6"></a><span class="co">#SBATCH --cpus-per-task=1                # How many cpus per mpi proc</span></span>
-<span id="cb19-7"><a href="#cb19-7"></a><span class="co">#SBATCH --output=output.%j        # Where to write results</span></span>
-<span id="cb19-8"><a href="#cb19-8"></a><span class="co">#SBATCH --error=error.%j</span></span>
-<span id="cb19-9"><a href="#cb19-9"></a><span class="co">#SBATCH --time=00:01:00          # For how long can it run?</span></span>
-<span id="cb19-10"><a href="#cb19-10"></a><span class="co">#SBATCH --partition=dc-gpu         # Machine partition</span></span>
-<span id="cb19-11"><a href="#cb19-11"></a><span class="co">#SBATCH --reservation=training2425 # For today only</span></span>
-<span id="cb19-12"><a href="#cb19-12"></a></span>
-<span id="cb19-13"><a href="#cb19-13"></a><span class="ex">module</span> load Stages/2024</span>
-<span id="cb19-14"><a href="#cb19-14"></a><span class="ex">module</span> load GCC OpenMPI PyTorch  <span class="co"># Load the correct modules on the compute node(s)</span></span>
-<span id="cb19-15"><a href="#cb19-15"></a></span>
-<span id="cb19-16"><a href="#cb19-16"></a><span class="ex">srun</span> python matrix.py            <span class="co"># srun tells the supercomputer how to run it</span></span></code></pre></div>
+<p>Create a file named <code>jureca-matrix.sbatch</code> as described in
+the previous section, and copy all the content from the following into
+this file.</p>
+<div class="sourceCode" id="cb8"><pre
+class="sourceCode numberSource bash number-lines"><code class="sourceCode bash"><span id="cb8-1"><a href="#cb8-1"></a><span class="co">#!/bin/bash</span></span>
+<span id="cb8-2"><a href="#cb8-2"></a><span class="co">#SBATCH --account=training2441           # Who pays?</span></span>
+<span id="cb8-3"><a href="#cb8-3"></a><span class="co">#SBATCH --nodes=1                        # How many compute nodes</span></span>
+<span id="cb8-4"><a href="#cb8-4"></a><span class="co">#SBATCH --job-name=matrix-multiplication</span></span>
+<span id="cb8-5"><a href="#cb8-5"></a><span class="co">#SBATCH --ntasks-per-node=1              # How many mpi processes/node</span></span>
+<span id="cb8-6"><a href="#cb8-6"></a><span class="co">#SBATCH --cpus-per-task=1                # How many cpus per mpi proc</span></span>
+<span id="cb8-7"><a href="#cb8-7"></a><span class="co">#SBATCH --output=output.%j        # Where to write results</span></span>
+<span id="cb8-8"><a href="#cb8-8"></a><span class="co">#SBATCH --error=error.%j</span></span>
+<span id="cb8-9"><a href="#cb8-9"></a><span class="co">#SBATCH --time=00:01:00          # For how long can it run?</span></span>
+<span id="cb8-10"><a href="#cb8-10"></a><span class="co">#SBATCH --partition=dc-gpu         # Machine partition</span></span>
+<span id="cb8-11"><a href="#cb8-11"></a><span class="co">#SBATCH --reservation=training2441 # For today only</span></span>
+<span id="cb8-12"><a href="#cb8-12"></a></span>
+<span id="cb8-13"><a href="#cb8-13"></a><span class="ex">module</span> load Stages/2024</span>
+<span id="cb8-14"><a href="#cb8-14"></a><span class="ex">module</span> load GCC OpenMPI PyTorch  <span class="co"># Load the correct modules on the compute node(s)</span></span>
+<span id="cb8-15"><a href="#cb8-15"></a></span>
+<span id="cb8-16"><a href="#cb8-16"></a><span class="ex">srun</span> python matrix.py            <span class="co"># srun tells the supercomputer how to run it</span></span></code></pre></div>
 </section>
 <section class="slide level2">
 
 <h3 id="submitting-a-job-sbatch">Submitting a job: SBATCH</h3>
-<div class="sourceCode" id="cb20"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb20-1"><a href="#cb20-1" aria-hidden="true" tabindex="-1"></a><span class="ex">sbatch</span> jureca-matrix.sbatch</span>
-<span id="cb20-2"><a href="#cb20-2" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb20-3"><a href="#cb20-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Submitted</span> batch job 412169</span></code></pre></div>
+<div class="sourceCode" id="cb9"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a><span class="ex">sbatch</span> jureca-matrix.sbatch</span>
+<span id="cb9-2"><a href="#cb9-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb9-3"><a href="#cb9-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Submitted</span> batch job 412169</span></code></pre></div>
 </section>
 <section class="slide level2">
 
@@ -1084,10 +818,10 @@ <h3 id="are-we-there-yet">Are we there yet?</h3>
 
 <h3 id="are-we-there-yet-1">Are we there yet? 🐴</h3>
 <p><code>squeue --me</code></p>
-<div class="sourceCode" id="cb21"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb21-1"><a href="#cb21-1" aria-hidden="true" tabindex="-1"></a><span class="ex">squeue</span> <span class="at">--me</span></span>
-<span id="cb21-2"><a href="#cb21-2" aria-hidden="true" tabindex="-1"></a>   <span class="ex">JOBID</span>  PARTITION    NAME      USER    ST       TIME  NODES NODELIST<span class="er">(</span><span class="ex">REASON</span><span class="kw">)</span></span>
-<span id="cb21-3"><a href="#cb21-3" aria-hidden="true" tabindex="-1"></a>   <span class="ex">412169</span> gpus         matrix-m  strube1 CF       0:02      1 jsfc013</span></code></pre></div>
+<div class="sourceCode" id="cb10"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a><span class="ex">squeue</span> <span class="at">--me</span></span>
+<span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a>   <span class="ex">JOBID</span>  PARTITION    NAME      USER    ST       TIME  NODES NODELIST<span class="er">(</span><span class="ex">REASON</span><span class="kw">)</span></span>
+<span id="cb10-3"><a href="#cb10-3" aria-hidden="true" tabindex="-1"></a>   <span class="ex">412169</span> gpus         matrix-m  strube1 CF       0:02      1 jsfc013</span></code></pre></div>
 <h4 id="st-is-status">ST is status:</h4>
 <ul>
 <li class="fragment">PD (pending),</li>
@@ -1104,14 +838,14 @@ <h3 id="reservations">Reservations</h3>
 <li class="fragment">Some partitions have reservations, which means that
 only certain users can use them at certain times.</li>
 <li class="fragment">For this course, it’s called
-<code>training2425</code></li>
+<code>training2441</code></li>
 </ul>
 </section>
 <section class="slide level2">
 
 <h3 id="job-is-wrong-need-to-cancel">Job is wrong, need to cancel</h3>
-<div class="sourceCode" id="cb22"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb22-1"><a href="#cb22-1" aria-hidden="true" tabindex="-1"></a><span class="ex">scancel</span> <span class="op">&lt;</span>JOBID<span class="op">&gt;</span></span></code></pre></div>
+<div class="sourceCode" id="cb11"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a><span class="ex">scancel</span> <span class="op">&lt;</span>JOBID<span class="op">&gt;</span></span></code></pre></div>
 </section>
 <section class="slide level2">
 
@@ -1120,11 +854,8 @@ <h3 id="check-logs">Check logs</h3>
 id="by-now-you-should-have-output-and-error-log-files-on-your-directory.-check-them">By
 now you should have output and error log files on your directory. Check
 them!</h4>
-<div class="sourceCode" id="cb23"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb23-1"><a href="#cb23-1" aria-hidden="true" tabindex="-1"></a><span class="co"># Notice that this number is the job id. It&#39;s different for every job</span></span>
-<span id="cb23-2"><a href="#cb23-2" aria-hidden="true" tabindex="-1"></a><span class="fu">cat</span> output.412169 </span>
-<span id="cb23-3"><a href="#cb23-3" aria-hidden="true" tabindex="-1"></a><span class="fu">cat</span> error.412169 </span></code></pre></div>
-<p>Or simply open it on VSCode!</p>
+<p>simply open <code>output.412169</code> and <code>error.412169</code>
+using Editor!!</p>
 </section>
 <section id="extra-software-modules-and-kernels" class="slide level2">
 <h2>Extra software, modules and kernels</h2>
@@ -1133,9 +864,9 @@ <h4 id="you-want-that-extra-software-from-pip.">You want that extra
 <p><a
 href="https://gitlab.jsc.fz-juelich.de/kesselheim1/sc_venv_template">Venv/Kernel
 template</a></p>
-<div class="sourceCode" id="cb24"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb24-1"><a href="#cb24-1" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="va">$HOME</span>/course/</span>
-<span id="cb24-2"><a href="#cb24-2" aria-hidden="true" tabindex="-1"></a><span class="fu">git</span> clone https://gitlab.jsc.fz-juelich.de/kesselheim1/sc_venv_template.git</span></code></pre></div>
+<div class="sourceCode" id="cb12"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="va">$HOME</span>/course/</span>
+<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a><span class="fu">git</span> clone https://gitlab.jsc.fz-juelich.de/kesselheim1/sc_venv_template.git</span></code></pre></div>
 </section>
 <section id="example-lets-install-some-software" class="slide level2">
 <h2>Example: Let’s install some software!</h2>
@@ -1154,11 +885,11 @@ <h3 id="example-lets-install-some-software-1">Example: Let’s install
 <li class="fragment"><p>Edit the file
 sc_venv_template/requirements.txt</p></li>
 <li class="fragment"><p>Add these lines at the end:</p></li>
-<li class="fragment"><div class="sourceCode" id="cb25"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb25-1"><a href="#cb25-1" aria-hidden="true" tabindex="-1"></a><span class="ex">fastai</span></span>
-<span id="cb25-2"><a href="#cb25-2" aria-hidden="true" tabindex="-1"></a><span class="ex">wandb</span></span>
-<span id="cb25-3"><a href="#cb25-3" aria-hidden="true" tabindex="-1"></a><span class="ex">accelerate</span></span>
-<span id="cb25-4"><a href="#cb25-4" aria-hidden="true" tabindex="-1"></a><span class="ex">deepspeed</span></span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb13"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a><span class="ex">fastai</span></span>
+<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a><span class="ex">wandb</span></span>
+<span id="cb13-3"><a href="#cb13-3" aria-hidden="true" tabindex="-1"></a><span class="ex">accelerate</span></span>
+<span id="cb13-4"><a href="#cb13-4" aria-hidden="true" tabindex="-1"></a><span class="ex">deepspeed</span></span></code></pre></div></li>
 <li class="fragment"><p>Run on the terminal:
 <code>sc_venv_template/setup.sh</code></p></li>
 </ul>
@@ -1168,27 +899,27 @@ <h3 id="example-lets-install-some-software-1">Example: Let’s install
 <h3 id="example-activating-the-virtual-environment">Example: Activating
 the virtual environment</h3>
 <ul>
-<li class="fragment"><div class="sourceCode" id="cb26"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb26-1"><a href="#cb26-1" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> sc_venv_template/activate.sh</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb14"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb14-1"><a href="#cb14-1" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> sc_venv_template/activate.sh</span></code></pre></div></li>
 </ul>
 </section>
 <section class="slide level2">
 
 <h3 id="example-activating-the-virtual-environment-1">Example:
 Activating the virtual environment</h3>
-<div class="sourceCode" id="cb27"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb27-1"><a href="#cb27-1" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> ./activate.sh </span>
-<span id="cb27-2"><a href="#cb27-2" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> activation script must be sourced, otherwise the virtual environment will not work.</span>
-<span id="cb27-3"><a href="#cb27-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Setting</span> vars</span>
-<span id="cb27-4"><a href="#cb27-4" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> following modules were not unloaded:</span>
-<span id="cb27-5"><a href="#cb27-5" aria-hidden="true" tabindex="-1"></a>  <span class="kw">(</span><span class="ex">Use</span> <span class="st">&quot;module --force purge&quot;</span> to unload all<span class="kw">)</span><span class="bu">:</span></span>
-<span id="cb27-6"><a href="#cb27-6" aria-hidden="true" tabindex="-1"></a> <span class="ex">1</span><span class="er">)</span> <span class="ex">Stages/2024</span></span></code></pre></div>
-<div class="sourceCode" id="cb28"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb28-1"><a href="#cb28-1" aria-hidden="true" tabindex="-1"></a><span class="ex">jureca01</span> $ python</span>
-<span id="cb28-2"><a href="#cb28-2" aria-hidden="true" tabindex="-1"></a><span class="ex">Python</span> 3.11.3 <span class="er">(</span><span class="ex">main,</span> Jun 25 2023, 13:17:30<span class="kw">)</span> <span class="ex">[GCC</span> 12.3.0]</span>
-<span id="cb28-3"><a href="#cb28-3" aria-hidden="true" tabindex="-1"></a><span class="op">&gt;&gt;&gt;</span> import <span class="ex">fastai</span></span>
-<span id="cb28-4"><a href="#cb28-4" aria-hidden="true" tabindex="-1"></a><span class="op">&gt;&gt;&gt;</span> fastai.__version__</span>
-<span id="cb28-5"><a href="#cb28-5" aria-hidden="true" tabindex="-1"></a><span class="st">&#39;2.7.14&#39;</span></span></code></pre></div>
+<div class="sourceCode" id="cb15"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> ./activate.sh </span>
+<span id="cb15-2"><a href="#cb15-2" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> activation script must be sourced, otherwise the virtual environment will not work.</span>
+<span id="cb15-3"><a href="#cb15-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Setting</span> vars</span>
+<span id="cb15-4"><a href="#cb15-4" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> following modules were not unloaded:</span>
+<span id="cb15-5"><a href="#cb15-5" aria-hidden="true" tabindex="-1"></a>  <span class="kw">(</span><span class="ex">Use</span> <span class="st">&quot;module --force purge&quot;</span> to unload all<span class="kw">)</span><span class="bu">:</span></span>
+<span id="cb15-6"><a href="#cb15-6" aria-hidden="true" tabindex="-1"></a> <span class="ex">1</span><span class="er">)</span> <span class="ex">Stages/2024</span></span></code></pre></div>
+<div class="sourceCode" id="cb16"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="ex">jureca01</span> $ python</span>
+<span id="cb16-2"><a href="#cb16-2" aria-hidden="true" tabindex="-1"></a><span class="ex">Python</span> 3.11.3 <span class="er">(</span><span class="ex">main,</span> Jun 25 2023, 13:17:30<span class="kw">)</span> <span class="ex">[GCC</span> 12.3.0]</span>
+<span id="cb16-3"><a href="#cb16-3" aria-hidden="true" tabindex="-1"></a><span class="op">&gt;&gt;&gt;</span> import <span class="ex">fastai</span></span>
+<span id="cb16-4"><a href="#cb16-4" aria-hidden="true" tabindex="-1"></a><span class="op">&gt;&gt;&gt;</span> fastai.__version__</span>
+<span id="cb16-5"><a href="#cb16-5" aria-hidden="true" tabindex="-1"></a><span class="st">&#39;2.7.14&#39;</span></span></code></pre></div>
 </section>
 <section class="slide level2">
 
@@ -1196,60 +927,60 @@ <h3 id="lets-train-a-classifier">Let’s train a 🐈 classifier!</h3>
 <ul>
 <li class="fragment">This is a minimal demo, to show some quirks of the
 supercomputer</li>
-<li class="fragment"><div class="sourceCode" id="cb29"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb29-1"><a href="#cb29-1" aria-hidden="true" tabindex="-1"></a><span class="ex">code</span> cats.py</span></code></pre></div></li>
-<li class="fragment"><div class="sourceCode" id="cb30"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb30-1"><a href="#cb30-1" aria-hidden="true" tabindex="-1"></a><span class="im">from</span> fastai.vision.<span class="bu">all</span> <span class="im">import</span> <span class="op">*</span></span>
-<span id="cb30-2"><a href="#cb30-2" aria-hidden="true" tabindex="-1"></a><span class="im">from</span> fastai.callback.tensorboard <span class="im">import</span> <span class="op">*</span></span>
-<span id="cb30-3"><a href="#cb30-3" aria-hidden="true" tabindex="-1"></a><span class="co">#</span></span>
-<span id="cb30-4"><a href="#cb30-4" aria-hidden="true" tabindex="-1"></a><span class="bu">print</span>(<span class="st">&quot;Downloading dataset...&quot;</span>)</span>
-<span id="cb30-5"><a href="#cb30-5" aria-hidden="true" tabindex="-1"></a>path <span class="op">=</span> untar_data(URLs.PETS)<span class="op">/</span><span class="st">&#39;images&#39;</span></span>
-<span id="cb30-6"><a href="#cb30-6" aria-hidden="true" tabindex="-1"></a><span class="bu">print</span>(<span class="st">&quot;Finished downloading dataset&quot;</span>)</span>
-<span id="cb30-7"><a href="#cb30-7" aria-hidden="true" tabindex="-1"></a><span class="co">#</span></span>
-<span id="cb30-8"><a href="#cb30-8" aria-hidden="true" tabindex="-1"></a><span class="kw">def</span> is_cat(x): <span class="cf">return</span> x[<span class="dv">0</span>].isupper()</span>
-<span id="cb30-9"><a href="#cb30-9" aria-hidden="true" tabindex="-1"></a><span class="co"># Create the dataloaders and resize the images</span></span>
-<span id="cb30-10"><a href="#cb30-10" aria-hidden="true" tabindex="-1"></a>dls <span class="op">=</span> ImageDataLoaders.from_name_func(</span>
-<span id="cb30-11"><a href="#cb30-11" aria-hidden="true" tabindex="-1"></a>    path, get_image_files(path), valid_pct<span class="op">=</span><span class="fl">0.2</span>, seed<span class="op">=</span><span class="dv">42</span>,</span>
-<span id="cb30-12"><a href="#cb30-12" aria-hidden="true" tabindex="-1"></a>    label_func<span class="op">=</span>is_cat, item_tfms<span class="op">=</span>Resize(<span class="dv">224</span>))</span>
-<span id="cb30-13"><a href="#cb30-13" aria-hidden="true" tabindex="-1"></a><span class="bu">print</span>(<span class="st">&quot;On the login node, this will download resnet34&quot;</span>)</span>
-<span id="cb30-14"><a href="#cb30-14" aria-hidden="true" tabindex="-1"></a>learn <span class="op">=</span> vision_learner(dls, resnet34, metrics<span class="op">=</span>accuracy)</span>
-<span id="cb30-15"><a href="#cb30-15" aria-hidden="true" tabindex="-1"></a>cbs<span class="op">=</span>[SaveModelCallback(), TensorBoardCallback(<span class="st">&#39;runs&#39;</span>, trace_model<span class="op">=</span><span class="va">True</span>)]</span>
-<span id="cb30-16"><a href="#cb30-16" aria-hidden="true" tabindex="-1"></a><span class="co"># Trains the model for 6 epochs with this dataset</span></span>
-<span id="cb30-17"><a href="#cb30-17" aria-hidden="true" tabindex="-1"></a>learn.unfreeze()</span>
-<span id="cb30-18"><a href="#cb30-18" aria-hidden="true" tabindex="-1"></a>learn.fit_one_cycle(<span class="dv">6</span>, cbs<span class="op">=</span>cbs)</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb17"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a><span class="ex">code</span> cats.py</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb18"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb18-1"><a href="#cb18-1" aria-hidden="true" tabindex="-1"></a><span class="im">from</span> fastai.vision.<span class="bu">all</span> <span class="im">import</span> <span class="op">*</span></span>
+<span id="cb18-2"><a href="#cb18-2" aria-hidden="true" tabindex="-1"></a><span class="im">from</span> fastai.callback.tensorboard <span class="im">import</span> <span class="op">*</span></span>
+<span id="cb18-3"><a href="#cb18-3" aria-hidden="true" tabindex="-1"></a><span class="co">#</span></span>
+<span id="cb18-4"><a href="#cb18-4" aria-hidden="true" tabindex="-1"></a><span class="bu">print</span>(<span class="st">&quot;Downloading dataset...&quot;</span>)</span>
+<span id="cb18-5"><a href="#cb18-5" aria-hidden="true" tabindex="-1"></a>path <span class="op">=</span> untar_data(URLs.PETS)<span class="op">/</span><span class="st">&#39;images&#39;</span></span>
+<span id="cb18-6"><a href="#cb18-6" aria-hidden="true" tabindex="-1"></a><span class="bu">print</span>(<span class="st">&quot;Finished downloading dataset&quot;</span>)</span>
+<span id="cb18-7"><a href="#cb18-7" aria-hidden="true" tabindex="-1"></a><span class="co">#</span></span>
+<span id="cb18-8"><a href="#cb18-8" aria-hidden="true" tabindex="-1"></a><span class="kw">def</span> is_cat(x): <span class="cf">return</span> x[<span class="dv">0</span>].isupper()</span>
+<span id="cb18-9"><a href="#cb18-9" aria-hidden="true" tabindex="-1"></a><span class="co"># Create the dataloaders and resize the images</span></span>
+<span id="cb18-10"><a href="#cb18-10" aria-hidden="true" tabindex="-1"></a>dls <span class="op">=</span> ImageDataLoaders.from_name_func(</span>
+<span id="cb18-11"><a href="#cb18-11" aria-hidden="true" tabindex="-1"></a>    path, get_image_files(path), valid_pct<span class="op">=</span><span class="fl">0.2</span>, seed<span class="op">=</span><span class="dv">42</span>,</span>
+<span id="cb18-12"><a href="#cb18-12" aria-hidden="true" tabindex="-1"></a>    label_func<span class="op">=</span>is_cat, item_tfms<span class="op">=</span>Resize(<span class="dv">224</span>))</span>
+<span id="cb18-13"><a href="#cb18-13" aria-hidden="true" tabindex="-1"></a><span class="bu">print</span>(<span class="st">&quot;On the login node, this will download resnet34&quot;</span>)</span>
+<span id="cb18-14"><a href="#cb18-14" aria-hidden="true" tabindex="-1"></a>learn <span class="op">=</span> vision_learner(dls, resnet34, metrics<span class="op">=</span>accuracy)</span>
+<span id="cb18-15"><a href="#cb18-15" aria-hidden="true" tabindex="-1"></a>cbs<span class="op">=</span>[SaveModelCallback(), TensorBoardCallback(<span class="st">&#39;runs&#39;</span>, trace_model<span class="op">=</span><span class="va">True</span>)]</span>
+<span id="cb18-16"><a href="#cb18-16" aria-hidden="true" tabindex="-1"></a><span class="co"># Trains the model for 6 epochs with this dataset</span></span>
+<span id="cb18-17"><a href="#cb18-17" aria-hidden="true" tabindex="-1"></a>learn.unfreeze()</span>
+<span id="cb18-18"><a href="#cb18-18" aria-hidden="true" tabindex="-1"></a>learn.fit_one_cycle(<span class="dv">6</span>, cbs<span class="op">=</span>cbs)</span></code></pre></div></li>
 </ul>
 </section>
 <section class="slide level2">
 
 <h3 id="submission-file-for-the-classifier">Submission file for the
 classifier</h3>
-<div class="sourceCode" id="cb31"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb31-1"><a href="#cb31-1" aria-hidden="true" tabindex="-1"></a><span class="ex">code</span> fastai.sbatch</span></code></pre></div>
-<div class="sourceCode" id="cb32"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb32-1"><a href="#cb32-1" aria-hidden="true" tabindex="-1"></a><span class="co">#!/bin/bash</span></span>
-<span id="cb32-2"><a href="#cb32-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --account=training2425</span></span>
-<span id="cb32-3"><a href="#cb32-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --mail-user=MYUSER@fz-juelich.de</span></span>
-<span id="cb32-4"><a href="#cb32-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --mail-type=ALL</span></span>
-<span id="cb32-5"><a href="#cb32-5" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=1</span></span>
-<span id="cb32-6"><a href="#cb32-6" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --job-name=cat-classifier</span></span>
-<span id="cb32-7"><a href="#cb32-7" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=1</span></span>
-<span id="cb32-8"><a href="#cb32-8" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=128</span></span>
-<span id="cb32-9"><a href="#cb32-9" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --output=output.%j</span></span>
-<span id="cb32-10"><a href="#cb32-10" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --error=error.%j</span></span>
-<span id="cb32-11"><a href="#cb32-11" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --time=00:20:00</span></span>
-<span id="cb32-12"><a href="#cb32-12" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --partition=dc-gpu</span></span>
-<span id="cb32-13"><a href="#cb32-13" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --reservation=training2425 # For today only</span></span>
-<span id="cb32-14"><a href="#cb32-14" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb32-15"><a href="#cb32-15" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="va">$HOME</span>/course/</span>
-<span id="cb32-16"><a href="#cb32-16" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> sc_venv_template/activate.sh <span class="co"># Now we finally use the fastai module</span></span>
-<span id="cb32-17"><a href="#cb32-17" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb32-18"><a href="#cb32-18" aria-hidden="true" tabindex="-1"></a><span class="ex">srun</span> python cats.py</span></code></pre></div>
+<div class="sourceCode" id="cb19"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb19-1"><a href="#cb19-1" aria-hidden="true" tabindex="-1"></a><span class="ex">code</span> fastai.sbatch</span></code></pre></div>
+<div class="sourceCode" id="cb20"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb20-1"><a href="#cb20-1" aria-hidden="true" tabindex="-1"></a><span class="co">#!/bin/bash</span></span>
+<span id="cb20-2"><a href="#cb20-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --account=training2441</span></span>
+<span id="cb20-3"><a href="#cb20-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --mail-user=MYUSER@fz-juelich.de</span></span>
+<span id="cb20-4"><a href="#cb20-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --mail-type=ALL</span></span>
+<span id="cb20-5"><a href="#cb20-5" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=1</span></span>
+<span id="cb20-6"><a href="#cb20-6" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --job-name=cat-classifier</span></span>
+<span id="cb20-7"><a href="#cb20-7" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=1</span></span>
+<span id="cb20-8"><a href="#cb20-8" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=128</span></span>
+<span id="cb20-9"><a href="#cb20-9" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --output=output.%j</span></span>
+<span id="cb20-10"><a href="#cb20-10" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --error=error.%j</span></span>
+<span id="cb20-11"><a href="#cb20-11" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --time=00:20:00</span></span>
+<span id="cb20-12"><a href="#cb20-12" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --partition=dc-gpu</span></span>
+<span id="cb20-13"><a href="#cb20-13" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --reservation=training2441 # For today only</span></span>
+<span id="cb20-14"><a href="#cb20-14" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb20-15"><a href="#cb20-15" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="va">$HOME</span>/course/</span>
+<span id="cb20-16"><a href="#cb20-16" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> sc_venv_template/activate.sh <span class="co"># Now we finally use the fastai module</span></span>
+<span id="cb20-17"><a href="#cb20-17" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb20-18"><a href="#cb20-18" aria-hidden="true" tabindex="-1"></a><span class="ex">srun</span> python cats.py</span></code></pre></div>
 </section>
 <section class="slide level2">
 
 <h3 id="submit-it">Submit it</h3>
-<div class="sourceCode" id="cb33"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb33-1"><a href="#cb33-1" aria-hidden="true" tabindex="-1"></a><span class="ex">sbatch</span> fastai.sbatch</span></code></pre></div>
+<div class="sourceCode" id="cb21"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb21-1"><a href="#cb21-1" aria-hidden="true" tabindex="-1"></a><span class="ex">sbatch</span> fastai.sbatch</span></code></pre></div>
 </section>
 <section class="slide level2">
 
@@ -1262,17 +993,17 @@ <h3 id="submission-time">Submission time</h3>
 
 <h3 id="probably-not-much-happening">Probably not much happening…</h3>
 <ul>
-<li class="fragment"><div class="sourceCode" id="cb34"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb34-1"><a href="#cb34-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> cat output.7948496 </span>
-<span id="cb34-2"><a href="#cb34-2" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> activation script must be sourced, otherwise the virtual environment will not work.</span>
-<span id="cb34-3"><a href="#cb34-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Setting</span> vars</span>
-<span id="cb34-4"><a href="#cb34-4" aria-hidden="true" tabindex="-1"></a><span class="ex">Downloading</span> dataset...</span></code></pre></div></li>
-<li class="fragment"><div class="sourceCode" id="cb35"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb35-1"><a href="#cb35-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> cat err.7948496 </span>
-<span id="cb35-2"><a href="#cb35-2" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> following modules were not unloaded:</span>
-<span id="cb35-3"><a href="#cb35-3" aria-hidden="true" tabindex="-1"></a><span class="kw">(</span><span class="ex">Use</span> <span class="st">&quot;module --force purge&quot;</span> to unload all<span class="kw">)</span><span class="bu">:</span></span>
-<span id="cb35-4"><a href="#cb35-4" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb35-5"><a href="#cb35-5" aria-hidden="true" tabindex="-1"></a><span class="ex">1</span><span class="er">)</span> <span class="ex">Stages/2024</span></span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb22"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb22-1"><a href="#cb22-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> cat output.7948496 </span>
+<span id="cb22-2"><a href="#cb22-2" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> activation script must be sourced, otherwise the virtual environment will not work.</span>
+<span id="cb22-3"><a href="#cb22-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Setting</span> vars</span>
+<span id="cb22-4"><a href="#cb22-4" aria-hidden="true" tabindex="-1"></a><span class="ex">Downloading</span> dataset...</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb23"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb23-1"><a href="#cb23-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> cat err.7948496 </span>
+<span id="cb23-2"><a href="#cb23-2" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> following modules were not unloaded:</span>
+<span id="cb23-3"><a href="#cb23-3" aria-hidden="true" tabindex="-1"></a><span class="kw">(</span><span class="ex">Use</span> <span class="st">&quot;module --force purge&quot;</span> to unload all<span class="kw">)</span><span class="bu">:</span></span>
+<span id="cb23-4"><a href="#cb23-4" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb23-5"><a href="#cb23-5" aria-hidden="true" tabindex="-1"></a><span class="ex">1</span><span class="er">)</span> <span class="ex">Stages/2024</span></span></code></pre></div></li>
 </ul>
 </section>
 <section id="section" class="slide level2">
@@ -1286,15 +1017,15 @@ <h2>What happened?</h2>
 <li class="fragment">Check the <code>error.${JOBID}</code> file</li>
 <li class="fragment">If you run it longer, you will get the actual
 error:</li>
-<li class="fragment"><div class="sourceCode" id="cb36"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb36-1"><a href="#cb36-1" aria-hidden="true" tabindex="-1"></a>Traceback (most recent call last):</span>
-<span id="cb36-2"><a href="#cb36-2" aria-hidden="true" tabindex="-1"></a>  File <span class="st">&quot;/p/project/training2425/strube1/cats.py&quot;</span>, line <span class="dv">5</span>, <span class="kw">in</span> <span class="op">&lt;</span>module<span class="op">&gt;</span></span>
-<span id="cb36-3"><a href="#cb36-3" aria-hidden="true" tabindex="-1"></a>    path <span class="op">=</span> untar_data(URLs.PETS)<span class="op">/</span><span class="st">&#39;images&#39;</span></span>
-<span id="cb36-4"><a href="#cb36-4" aria-hidden="true" tabindex="-1"></a>    ...</span>
-<span id="cb36-5"><a href="#cb36-5" aria-hidden="true" tabindex="-1"></a>    ...</span>
-<span id="cb36-6"><a href="#cb36-6" aria-hidden="true" tabindex="-1"></a>    <span class="cf">raise</span> URLError(err)</span>
-<span id="cb36-7"><a href="#cb36-7" aria-hidden="true" tabindex="-1"></a>urllib.error.URLError: <span class="op">&lt;</span>urlopen error [Errno <span class="dv">110</span>] Connection timed out<span class="op">&gt;</span></span>
-<span id="cb36-8"><a href="#cb36-8" aria-hidden="true" tabindex="-1"></a>srun: error: jwb0160: task <span class="dv">0</span>: Exited <span class="cf">with</span> exit code <span class="dv">1</span></span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb24"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb24-1"><a href="#cb24-1" aria-hidden="true" tabindex="-1"></a>Traceback (most recent call last):</span>
+<span id="cb24-2"><a href="#cb24-2" aria-hidden="true" tabindex="-1"></a>  File <span class="st">&quot;/p/project/training2441/strube1/cats.py&quot;</span>, line <span class="dv">5</span>, <span class="kw">in</span> <span class="op">&lt;</span>module<span class="op">&gt;</span></span>
+<span id="cb24-3"><a href="#cb24-3" aria-hidden="true" tabindex="-1"></a>    path <span class="op">=</span> untar_data(URLs.PETS)<span class="op">/</span><span class="st">&#39;images&#39;</span></span>
+<span id="cb24-4"><a href="#cb24-4" aria-hidden="true" tabindex="-1"></a>    ...</span>
+<span id="cb24-5"><a href="#cb24-5" aria-hidden="true" tabindex="-1"></a>    ...</span>
+<span id="cb24-6"><a href="#cb24-6" aria-hidden="true" tabindex="-1"></a>    <span class="cf">raise</span> URLError(err)</span>
+<span id="cb24-7"><a href="#cb24-7" aria-hidden="true" tabindex="-1"></a>urllib.error.URLError: <span class="op">&lt;</span>urlopen error [Errno <span class="dv">110</span>] Connection timed out<span class="op">&gt;</span></span>
+<span id="cb24-8"><a href="#cb24-8" aria-hidden="true" tabindex="-1"></a>srun: error: jwb0160: task <span class="dv">0</span>: Exited <span class="cf">with</span> exit code <span class="dv">1</span></span></code></pre></div></li>
 </ul>
 </section>
 <section id="section-1" class="slide level2">
@@ -1305,12 +1036,12 @@ <h2>🤔…</h2>
 <h3 id="what-is-it-doing">What is it doing?</h3>
 <ul>
 <li class="fragment">This downloads the dataset:</li>
-<li class="fragment"><div class="sourceCode" id="cb37"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb37-1"><a href="#cb37-1" aria-hidden="true" tabindex="-1"></a>path <span class="op">=</span> untar_data(URLs.PETS)<span class="op">/</span><span class="st">&#39;images&#39;</span></span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb25"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb25-1"><a href="#cb25-1" aria-hidden="true" tabindex="-1"></a>path <span class="op">=</span> untar_data(URLs.PETS)<span class="op">/</span><span class="st">&#39;images&#39;</span></span></code></pre></div></li>
 <li class="fragment">And this one downloads the pre-trained
 weights:</li>
-<li class="fragment"><div class="sourceCode" id="cb38"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb38-1"><a href="#cb38-1" aria-hidden="true" tabindex="-1"></a>learn <span class="op">=</span> vision_learner(dls, resnet34, metrics<span class="op">=</span>error_rate)</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb26"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb26-1"><a href="#cb26-1" aria-hidden="true" tabindex="-1"></a>learn <span class="op">=</span> vision_learner(dls, resnet34, metrics<span class="op">=</span>error_rate)</span></code></pre></div></li>
 </ul>
 </section>
 <section id="remember-remember" class="slide level2">
@@ -1336,61 +1067,61 @@ <h2>Compute nodes have no internet connection</h2>
 <h2>On the login node:</h2>
 <ul>
 <li class="fragment">Comment out the line which does AI training:</li>
-<li class="fragment"><div class="sourceCode" id="cb39"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb39-1"><a href="#cb39-1" aria-hidden="true" tabindex="-1"></a><span class="co"># learn.fit_one_cycle(6, cbs=cbs)</span></span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb27"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb27-1"><a href="#cb27-1" aria-hidden="true" tabindex="-1"></a><span class="co"># learn.fit_one_cycle(6, cbs=cbs)</span></span></code></pre></div></li>
 <li class="fragment">Call our code on the login node!</li>
-<li class="fragment"><div class="sourceCode" id="cb40"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb40-1"><a href="#cb40-1" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> sc_venv_template/activate.sh <span class="co"># So that we have fast.ai library</span></span>
-<span id="cb40-2"><a href="#cb40-2" aria-hidden="true" tabindex="-1"></a><span class="ex">python</span> cats.py</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb28"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb28-1"><a href="#cb28-1" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> sc_venv_template/activate.sh <span class="co"># So that we have fast.ai library</span></span>
+<span id="cb28-2"><a href="#cb28-2" aria-hidden="true" tabindex="-1"></a><span class="ex">python</span> cats.py</span></code></pre></div></li>
 </ul>
 </section>
 <section id="run-the-downloader-on-the-login-node" class="slide level2">
 <h2>Run the downloader on the login node</h2>
-<div class="sourceCode" id="cb41"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb41-1"><a href="#cb41-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> source sc_venv_template/activate.sh</span>
-<span id="cb41-2"><a href="#cb41-2" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> python cats.py </span>
-<span id="cb41-3"><a href="#cb41-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Downloading</span> dataset...</span>
-<span id="cb41-4"><a href="#cb41-4" aria-hidden="true" tabindex="-1"></a> <span class="kw">|</span><span class="ex">████████-------------------------------</span><span class="kw">|</span> <span class="ex">23.50%</span> [190750720/811706944 00:08<span class="op">&lt;</span>00:26]</span>
-<span id="cb41-5"><a href="#cb41-5" aria-hidden="true" tabindex="-1"></a> <span class="ex">Downloading:</span> <span class="st">&quot;https://download.pytorch.org/models/resnet34-b627a593.pth&quot;</span> to /p/project/ccstao/cstao05/.cache/torch/hub/checkpoints/resnet34-b627a593.pth</span>
-<span id="cb41-6"><a href="#cb41-6" aria-hidden="true" tabindex="-1"></a><span class="ex">100%</span><span class="kw">|</span><span class="ex">█████████████████████████████████████</span><span class="kw">|</span> <span class="ex">83.3M/83.3M</span> [00:00<span class="op">&lt;</span>00:00, 266MB/s]</span></code></pre></div>
+<div class="sourceCode" id="cb29"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb29-1"><a href="#cb29-1" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> source sc_venv_template/activate.sh</span>
+<span id="cb29-2"><a href="#cb29-2" aria-hidden="true" tabindex="-1"></a><span class="ex">$</span> python cats.py </span>
+<span id="cb29-3"><a href="#cb29-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Downloading</span> dataset...</span>
+<span id="cb29-4"><a href="#cb29-4" aria-hidden="true" tabindex="-1"></a> <span class="kw">|</span><span class="ex">████████-------------------------------</span><span class="kw">|</span> <span class="ex">23.50%</span> [190750720/811706944 00:08<span class="op">&lt;</span>00:26]</span>
+<span id="cb29-5"><a href="#cb29-5" aria-hidden="true" tabindex="-1"></a> <span class="ex">Downloading:</span> <span class="st">&quot;https://download.pytorch.org/models/resnet34-b627a593.pth&quot;</span> to /p/project/ccstao/cstao05/.cache/torch/hub/checkpoints/resnet34-b627a593.pth</span>
+<span id="cb29-6"><a href="#cb29-6" aria-hidden="true" tabindex="-1"></a><span class="ex">100%</span><span class="kw">|</span><span class="ex">█████████████████████████████████████</span><span class="kw">|</span> <span class="ex">83.3M/83.3M</span> [00:00<span class="op">&lt;</span>00:00, 266MB/s]</span></code></pre></div>
 </section>
 <section id="run-it-again-on-the-compute-nodes" class="slide level2">
 <h2>Run it again on the compute nodes!</h2>
 <ul>
 <li class="fragment">Un-comment back the line that does training:</li>
-<li class="fragment"><div class="sourceCode" id="cb42"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb42-1"><a href="#cb42-1" aria-hidden="true" tabindex="-1"></a><span class="ex">learn.fit_one_cycle</span><span class="er">(</span><span class="ex">6,</span> cbs=cbs<span class="kw">)</span></span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb30"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb30-1"><a href="#cb30-1" aria-hidden="true" tabindex="-1"></a><span class="ex">learn.fit_one_cycle</span><span class="er">(</span><span class="ex">6,</span> cbs=cbs<span class="kw">)</span></span></code></pre></div></li>
 <li class="fragment">Submit the job!</li>
-<li class="fragment"><div class="sourceCode" id="cb43"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb43-1"><a href="#cb43-1" aria-hidden="true" tabindex="-1"></a><span class="ex">sbatch</span> fastai.sbatch</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb31"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb31-1"><a href="#cb31-1" aria-hidden="true" tabindex="-1"></a><span class="ex">sbatch</span> fastai.sbatch</span></code></pre></div></li>
 </ul>
 </section>
 <section id="masoquistically-waiting-for-the-job-to-run"
 class="slide level2">
 <h2>Masoquistically waiting for the job to run?</h2>
-<div class="sourceCode" id="cb44"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb44-1"><a href="#cb44-1" aria-hidden="true" tabindex="-1"></a><span class="ex">watch</span> squeue <span class="at">--me</span></span></code></pre></div>
+<div class="sourceCode" id="cb32"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb32-1"><a href="#cb32-1" aria-hidden="true" tabindex="-1"></a><span class="ex">watch</span> squeue <span class="at">--me</span></span></code></pre></div>
 <p>(To exit, type CTRL-C)</p>
 </section>
 <section id="check-output-files" class="slide level2">
 <h2>Check output files</h2>
 <ul>
 <li class="fragment">You can see them within VSCode</li>
-<li class="fragment"><div class="sourceCode" id="cb45"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb45-1"><a href="#cb45-1" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> activation script must be sourced, otherwise the virtual environment will not work.</span>
-<span id="cb45-2"><a href="#cb45-2" aria-hidden="true" tabindex="-1"></a><span class="ex">Setting</span> vars</span>
-<span id="cb45-3"><a href="#cb45-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Downloading</span> dataset...</span>
-<span id="cb45-4"><a href="#cb45-4" aria-hidden="true" tabindex="-1"></a><span class="ex">Finished</span> downloading dataset</span>
-<span id="cb45-5"><a href="#cb45-5" aria-hidden="true" tabindex="-1"></a><span class="ex">epoch</span>     train_loss  valid_loss  error_rate  time    </span>
-<span id="cb45-6"><a href="#cb45-6" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 : <span class="kw">|</span><span class="ex">-----------------------------------</span><span class="kw">|</span> <span class="ex">0.00%</span> [0/92 00:00<span class="op">&lt;</span><span class="pp">?</span>]</span>
-<span id="cb45-7"><a href="#cb45-7" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 : <span class="kw">|</span><span class="ex">-----------------------------------</span><span class="kw">|</span> <span class="ex">2.17%</span> [2/92 00:14<span class="op">&lt;</span>10:35 1.7452]</span>
-<span id="cb45-8"><a href="#cb45-8" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 : <span class="kw">|</span><span class="ex">█----------------------------------</span><span class="kw">|</span> <span class="ex">3.26%</span> [3/92 00:14<span class="op">&lt;</span>07:01 1.6413]</span>
-<span id="cb45-9"><a href="#cb45-9" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 : <span class="kw">|</span><span class="ex">██---------------------------------</span><span class="kw">|</span> <span class="ex">5.43%</span> [5/92 00:15<span class="op">&lt;</span>04:36 1.6057]</span>
-<span id="cb45-10"><a href="#cb45-10" aria-hidden="true" tabindex="-1"></a><span class="ex">...</span></span>
-<span id="cb45-11"><a href="#cb45-11" aria-hidden="true" tabindex="-1"></a><span class="ex">....</span></span>
-<span id="cb45-12"><a href="#cb45-12" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 :</span>
-<span id="cb45-13"><a href="#cb45-13" aria-hidden="true" tabindex="-1"></a><span class="ex">epoch</span>     train_loss  valid_loss  error_rate  time    </span>
-<span id="cb45-14"><a href="#cb45-14" aria-hidden="true" tabindex="-1"></a><span class="ex">0</span>         0.049855    0.021369    0.007442    00:42     </span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb33"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb33-1"><a href="#cb33-1" aria-hidden="true" tabindex="-1"></a><span class="ex">The</span> activation script must be sourced, otherwise the virtual environment will not work.</span>
+<span id="cb33-2"><a href="#cb33-2" aria-hidden="true" tabindex="-1"></a><span class="ex">Setting</span> vars</span>
+<span id="cb33-3"><a href="#cb33-3" aria-hidden="true" tabindex="-1"></a><span class="ex">Downloading</span> dataset...</span>
+<span id="cb33-4"><a href="#cb33-4" aria-hidden="true" tabindex="-1"></a><span class="ex">Finished</span> downloading dataset</span>
+<span id="cb33-5"><a href="#cb33-5" aria-hidden="true" tabindex="-1"></a><span class="ex">epoch</span>     train_loss  valid_loss  error_rate  time    </span>
+<span id="cb33-6"><a href="#cb33-6" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 : <span class="kw">|</span><span class="ex">-----------------------------------</span><span class="kw">|</span> <span class="ex">0.00%</span> [0/92 00:00<span class="op">&lt;</span><span class="pp">?</span>]</span>
+<span id="cb33-7"><a href="#cb33-7" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 : <span class="kw">|</span><span class="ex">-----------------------------------</span><span class="kw">|</span> <span class="ex">2.17%</span> [2/92 00:14<span class="op">&lt;</span>10:35 1.7452]</span>
+<span id="cb33-8"><a href="#cb33-8" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 : <span class="kw">|</span><span class="ex">█----------------------------------</span><span class="kw">|</span> <span class="ex">3.26%</span> [3/92 00:14<span class="op">&lt;</span>07:01 1.6413]</span>
+<span id="cb33-9"><a href="#cb33-9" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 : <span class="kw">|</span><span class="ex">██---------------------------------</span><span class="kw">|</span> <span class="ex">5.43%</span> [5/92 00:15<span class="op">&lt;</span>04:36 1.6057]</span>
+<span id="cb33-10"><a href="#cb33-10" aria-hidden="true" tabindex="-1"></a><span class="ex">...</span></span>
+<span id="cb33-11"><a href="#cb33-11" aria-hidden="true" tabindex="-1"></a><span class="ex">....</span></span>
+<span id="cb33-12"><a href="#cb33-12" aria-hidden="true" tabindex="-1"></a><span class="ex">Epoch</span> 1/1 :</span>
+<span id="cb33-13"><a href="#cb33-13" aria-hidden="true" tabindex="-1"></a><span class="ex">epoch</span>     train_loss  valid_loss  error_rate  time    </span>
+<span id="cb33-14"><a href="#cb33-14" aria-hidden="true" tabindex="-1"></a><span class="ex">0</span>         0.049855    0.021369    0.007442    00:42     </span></code></pre></div></li>
 <li class="fragment">🎉</li>
 <li class="fragment">🥳</li>
 </ul>
@@ -1404,16 +1135,16 @@ <h3 id="tools-for-results-analysis">Tools for results analysis</h3>
 Tensorboard</li>
 <li class="fragment">And we already have the code for it on our
 example!</li>
-<li class="fragment"><div class="sourceCode" id="cb46"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb46-1"><a href="#cb46-1" aria-hidden="true" tabindex="-1"></a>cbs<span class="op">=</span>[SaveModelCallback(), TensorBoardCallback(<span class="st">&#39;runs&#39;</span>, trace_model<span class="op">=</span><span class="va">True</span>)]</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb34"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb34-1"><a href="#cb34-1" aria-hidden="true" tabindex="-1"></a>cbs<span class="op">=</span>[SaveModelCallback(), TensorBoardCallback(<span class="st">&#39;runs&#39;</span>, trace_model<span class="op">=</span><span class="va">True</span>)]</span></code></pre></div></li>
 </ul>
 </section>
 <section id="example-tensorboard" class="slide level2">
 <h2>Example: Tensorboard</h2>
 <ul>
 <li class="fragment">The command</li>
-<li class="fragment"><div class="sourceCode" id="cb47"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb47-1"><a href="#cb47-1" aria-hidden="true" tabindex="-1"></a><span class="ex">tensorboard</span> <span class="at">--logdir</span><span class="op">=</span>runs  <span class="at">--port</span><span class="op">=</span>9999 serve</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb35"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb35-1"><a href="#cb35-1" aria-hidden="true" tabindex="-1"></a><span class="ex">tensorboard</span> <span class="at">--logdir</span><span class="op">=</span>runs  <span class="at">--port</span><span class="op">=</span>9999 serve</span></code></pre></div></li>
 <li class="fragment">Opens a connection on port 9999… <em>OF THE
 SUPERCOMPUTER</em>.</li>
 <li class="fragment">This port is behind the firewall. You can’t access
@@ -1437,22 +1168,20 @@ <h2>Port Forwarding</h2>
 supercomputer’s port 3000 as port 1234 locally</figcaption>
 </figure>
 </section>
-<section id="port-forwarding-demo" class="slide level2">
-<h2>Port forwarding demo:</h2>
-<ul>
-<li class="fragment">On VSCode’s terminal:</li>
-<li class="fragment"><div class="sourceCode" id="cb48"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb48-1"><a href="#cb48-1" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="va">$HOME</span>/course/</span>
-<span id="cb48-2"><a href="#cb48-2" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> sc_venv_template/activate.sh</span>
-<span id="cb48-3"><a href="#cb48-3" aria-hidden="true" tabindex="-1"></a><span class="ex">tensorboard</span> <span class="at">--logdir</span><span class="op">=</span>runs  <span class="at">--port</span><span class="op">=</span>12345 serve</span></code></pre></div></li>
-<li class="fragment">Note the tab <code>PORTS</code> next to the
-terminal</li>
-<li class="fragment">On the browser: <a
-href="http://localhost:12345">http://localhost:12345</a></li>
-</ul>
-</section>
 <section class="slide level2">
 
+<!-- ## Port forwarding demo:
+
+- On VSCode's terminal:
+- ```bash
+cd $HOME/course/
+source sc_venv_template/activate.sh
+tensorboard --logdir=runs  --port=12345 serve
+```
+- Note the tab `PORTS` next to the terminal 
+- On the browser: [http://localhost:12345](http://localhost:12345)
+
+--- -->
 <h3 id="tensorboard-on-jureca-dc">Tensorboard on Jureca DC</h3>
 <p><img data-src="images/tensorboard-cats.png" /></p>
 </section>
@@ -1475,176 +1204,6 @@ <h2>Day 1 recap</h2>
 <h2>ANY QUESTIONS??</h2>
 <h4 id="feedback-is-more-than-welcome">Feedback is more than
 welcome!</h4>
-</section>
-<section class="slide level2">
-
-<h3 id="helmholtz-blablador">Helmholtz Blablador</h3>
-<p><img data-src="images/blablador.png" /></p>
-</section>
-<section class="slide level2">
-
-<h3 id="blablador">Blablador</h3>
-<ul>
-<li class="fragment">Blablador is our Large Language Model inference
-server (eg. ChatGPT)</li>
-<li class="fragment">It’s a service for the Helmholtz Association.
-<ul>
-<li class="fragment">It’s fast, free and PRIVATE - I don’t record your
-conversations!</li>
-</ul></li>
-<li class="fragment">Anyone here can use it</li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="blablador-1">Blablador</h3>
-<figure>
-<img data-src="images/blablador-qrcode.png" width="500"
-alt="https://helmholtz-blablador.fz-juelich.de" />
-<figcaption
-aria-hidden="true">https://helmholtz-blablador.fz-juelich.de</figcaption>
-</figure>
-</section>
-<section id="vscode-continue.dev" class="slide level2">
-<h2>VScode + Continue.dev</h2>
-<p><img data-src="images/continue-ask-code.png" /></p>
-</section>
-<section class="slide level2">
-
-<h3 id="obtaining-a-token">Obtaining a token</h3>
-<ul>
-<li class="fragment">Go to helmholtz codebase at <a
-href="http://codebase.helmholtz.cloud">http://codebase.helmholtz.cloud</a></li>
-<li class="fragment">Log in with your email</li>
-<li class="fragment">On the left side, click on your profile, and then
-on “Preferences”</li>
-<li class="fragment">On “Access tokens”, click “Add new token”,
-<ul>
-<li class="fragment">give it a name,</li>
-<li class="fragment">put an expiration date (max 1 year)</li>
-<li class="fragment">and choose “api” in the “scopes” section</li>
-</ul></li>
-<li class="fragment">Click “Create Personal Access Token”
-<ul>
-<li class="fragment">You will see a “………………………..” - copy this and save
-somewhere.</li>
-</ul></li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="blablador-2">Blablador</h3>
-<p><img data-src="images/blablador-api-scope.png" width="800" /></p>
-</section>
-<section class="slide level2">
-
-<h3 id="blablador-on-vscode">Blablador on VSCode!</h3>
-<ul>
-<li class="fragment">Add <a
-href="https://marketplace.visualstudio.com/items?itemName=Continue.continue">continue.dev</a>
-extension to VSCode</li>
-<li class="fragment">On Continue, choose to add model, choose Other
-OpenAI-compatible API</li>
-<li class="fragment">Click in Open Config.json at the end</li>
-</ul>
-</section>
-<section id="blablador-vscode-continue.dev" class="slide level2">
-<h2>Blablador: VScode + Continue.dev</h2>
-<ul>
-<li class="fragment"><p>Inside config.json, add at the
-<code>"models"</code> section:</p></li>
-<li class="fragment"><div class="sourceCode" id="cb49"><pre
-class="sourceCode json"><code class="sourceCode json"><span id="cb49-1"><a href="#cb49-1" aria-hidden="true" tabindex="-1"></a>    <span class="fu">{</span></span>
-<span id="cb49-2"><a href="#cb49-2" aria-hidden="true" tabindex="-1"></a>      <span class="dt">&quot;title&quot;</span><span class="fu">:</span> <span class="st">&quot;Mistral helmholtz&quot;</span><span class="fu">,</span></span>
-<span id="cb49-3"><a href="#cb49-3" aria-hidden="true" tabindex="-1"></a>      <span class="dt">&quot;provider&quot;</span><span class="fu">:</span> <span class="st">&quot;openai&quot;</span><span class="fu">,</span></span>
-<span id="cb49-4"><a href="#cb49-4" aria-hidden="true" tabindex="-1"></a>      <span class="dt">&quot;contextLength&quot;</span><span class="fu">:</span> <span class="dv">16384</span><span class="fu">,</span></span>
-<span id="cb49-5"><a href="#cb49-5" aria-hidden="true" tabindex="-1"></a>      <span class="dt">&quot;model&quot;</span><span class="fu">:</span> <span class="st">&quot;alias-code&quot;</span><span class="fu">,</span></span>
-<span id="cb49-6"><a href="#cb49-6" aria-hidden="true" tabindex="-1"></a>      <span class="dt">&quot;apiKey&quot;</span><span class="fu">:</span> <span class="st">&quot;ADD-YOUR-TOKEN-HERE&quot;</span><span class="fu">,</span></span>
-<span id="cb49-7"><a href="#cb49-7" aria-hidden="true" tabindex="-1"></a>      <span class="dt">&quot;apiBase&quot;</span><span class="fu">:</span> <span class="st">&quot;https://helmholtz-blablador.fz-juelich.de:8000&quot;</span></span>
-<span id="cb49-8"><a href="#cb49-8" aria-hidden="true" tabindex="-1"></a>    <span class="fu">}</span><span class="er">,</span></span></code></pre></div></li>
-<li class="fragment"><p>REPLACE THE APIKEY WITH YOUR OWN
-TOKEN!!!!</p></li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="blablador-on-vscode-1">Blablador on VSCode</h3>
-<ul>
-<li class="fragment">Click on the “Continue.dev extension on the left
-side of VSCode.</li>
-<li class="fragment">Select some code from our exercises, select it and
-send it to continue with cmd-shift-L (or ctrl-shift-L)</li>
-<li class="fragment">Ask it to add unit tests, for example.</li>
-</ul>
-</section>
-<section id="backup-slides" class="slide level2">
-<h2>Backup slides</h2>
-</section>
-<section id="theres-more" class="slide level2">
-<h2>There’s more!</h2>
-<ul>
-<li class="fragment">Remember the magic? 🧙‍♂️</li>
-<li class="fragment">Let’s use it now to access the compute nodes
-directly!</li>
-</ul>
-</section>
-<section id="proxy-jump" class="slide level2">
-<h2>Proxy Jump</h2>
-<h4 id="accessing-compute-nodes-directly">Accessing compute nodes
-directly</h4>
-<ul>
-<li class="fragment">If we need to access some ports on the compute
-nodes</li>
-<li class="fragment"><img data-src="images/proxyjump-magic.svg" /></li>
-</ul>
-</section>
-<section id="proxy-jump---ssh-configuration" class="slide level2">
-<h2>Proxy Jump - SSH Configuration</h2>
-<p>Type on your machine “<code>code $HOME/.ssh/config</code>” and paste
-this at the end:</p>
-<pre class="ssh"><code>
-# -- Compute Nodes --
-Host *.jureca
-        User [ADD YOUR USERNAME HERE]
-        StrictHostKeyChecking no
-        IdentityFile ~/.ssh/id_ed25519-JSC
-        ProxyJump jureca</code></pre>
-</section>
-<section id="proxy-jump-connecting-to-a-node" class="slide level2">
-<h2>Proxy Jump: Connecting to a node</h2>
-<p>Example: A service provides web interface on port 9999</p>
-<p>On the supercomputer:</p>
-<div class="sourceCode" id="cb51"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb51-1"><a href="#cb51-1" aria-hidden="true" tabindex="-1"></a><span class="ex">srun</span> <span class="at">--time</span><span class="op">=</span>00:05:00 <span class="dt">\</span></span>
-<span id="cb51-2"><a href="#cb51-2" aria-hidden="true" tabindex="-1"></a>     <span class="at">--nodes</span><span class="op">=</span>1 <span class="at">--ntasks</span><span class="op">=</span>1 <span class="dt">\</span></span>
-<span id="cb51-3"><a href="#cb51-3" aria-hidden="true" tabindex="-1"></a>     <span class="at">--partition</span><span class="op">=</span>dc-gpu <span class="dt">\</span></span>
-<span id="cb51-4"><a href="#cb51-4" aria-hidden="true" tabindex="-1"></a>     <span class="at">--account</span> training2425 <span class="dt">\</span></span>
-<span id="cb51-5"><a href="#cb51-5" aria-hidden="true" tabindex="-1"></a>     <span class="at">--cpu_bind</span><span class="op">=</span>none <span class="dt">\</span></span>
-<span id="cb51-6"><a href="#cb51-6" aria-hidden="true" tabindex="-1"></a>     <span class="at">--pty</span> /bin/bash <span class="at">-i</span></span>
-<span id="cb51-7"><a href="#cb51-7" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb51-8"><a href="#cb51-8" aria-hidden="true" tabindex="-1"></a><span class="ex">bash-4.4$</span> hostname <span class="co"># This is running on a compute node of the supercomputer</span></span>
-<span id="cb51-9"><a href="#cb51-9" aria-hidden="true" tabindex="-1"></a><span class="ex">jwb0002</span></span>
-<span id="cb51-10"><a href="#cb51-10" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb51-11"><a href="#cb51-11" aria-hidden="true" tabindex="-1"></a><span class="ex">bash-4.4$</span> cd <span class="va">$HOME</span>/course/</span>
-<span id="cb51-12"><a href="#cb51-12" aria-hidden="true" tabindex="-1"></a><span class="ex">bash-4.4$</span> source sc_venv_template/activate.sh</span>
-<span id="cb51-13"><a href="#cb51-13" aria-hidden="true" tabindex="-1"></a><span class="ex">bash-4.4$</span> tensorboard <span class="at">--logdir</span><span class="op">=</span>runs  <span class="at">--port</span><span class="op">=</span>9999 serve</span></code></pre></div>
-</section>
-<section id="proxy-jump-1" class="slide level2">
-<h2>Proxy Jump</h2>
-<p>On your machine:</p>
-<ul>
-<li class="fragment"><div class="sourceCode" id="cb52"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb52-1"><a href="#cb52-1" aria-hidden="true" tabindex="-1"></a><span class="fu">ssh</span> <span class="at">-L</span> :3334:localhost:9999 jrc002i.jureca</span></code></pre></div></li>
-<li class="fragment"><p>Mind the <code>i</code> letter I added at the
-end of the hostname</p></li>
-<li class="fragment"><p>Now you can access the service on your local
-browser at <a
-href="http://localhost:3334">http://localhost:3334</a></p></li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="now-thats-really-the-end">Now that’s really the end! 😓</h3>
 </section>
     </div>
   </div>
diff --git a/public/03-parallelize-training.html b/public/02-parallelize-training.html
similarity index 73%
rename from public/03-parallelize-training.html
rename to public/02-parallelize-training.html
index 69fb6d3..0a9eb0e 100644
--- a/public/03-parallelize-training.html
+++ b/public/02-parallelize-training.html
@@ -3,8 +3,8 @@
 <head>
   <meta charset="utf-8">
   <meta name="generator" content="pandoc">
-  <meta name="author" content="Alexandre Strube // Sabrina Benassou">
-  <meta name="dcterms.date" content="2024-06-25">
+  <meta name="author" content="Alexandre Strube // Sabrina Benassou // Javad Kasravi">
+  <meta name="dcterms.date" content="2024-11-19">
   <title>Bringing Deep Learning Workloads to JSC supercomputers</title>
   <meta name="apple-mobile-web-app-capable" content="yes">
   <meta name="apple-mobile-web-app-status-bar-style" content="black-translucent">
@@ -20,11 +20,8 @@
     div.columns{display: flex; gap: min(4vw, 1.5em);}
     div.column{flex: auto; overflow-x: auto;}
     div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;}
-    /* The extra [class] is a hack that increases specificity enough to
-       override a similar rule in reveal.js */
-    ul.task-list[class]{list-style: none;}
+    ul.task-list{list-style: none;}
     ul.task-list li input[type="checkbox"] {
-      font-size: inherit;
       width: 0.8em;
       margin: 0 0.8em 0.2em -1.6em;
       vertical-align: middle;
@@ -32,7 +29,7 @@
     .display.math{display: block; text-align: center; margin: 0.5rem auto;}
     /* CSS for syntax highlighting */
     pre > code.sourceCode { white-space: pre; position: relative; }
-    pre > code.sourceCode > span { line-height: 1.25; }
+    pre > code.sourceCode > span { display: inline-block; line-height: 1.25; }
     pre > code.sourceCode > span:empty { height: 1.2em; }
     .sourceCode { overflow: visible; }
     code.sourceCode > span { color: inherit; text-decoration: inherit; }
@@ -43,7 +40,7 @@
     }
     @media print {
     pre > code.sourceCode { white-space: pre-wrap; }
-    pre > code.sourceCode > span { display: inline-block; text-indent: -5em; padding-left: 5em; }
+    pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
     }
     pre.numberSource code
       { counter-reset: source-line 0; }
@@ -228,140 +225,243 @@
   <h1 class="title">Bringing Deep Learning Workloads to JSC
 supercomputers</h1>
   <p class="subtitle">Parallelize Training</p>
-  <p class="author">Alexandre Strube // Sabrina Benassou</p>
-  <p class="date">June 25, 2024</p>
+  <p class="author">Alexandre Strube // Sabrina Benassou // Javad
+Kasravi</p>
+  <p class="date">November 19, 2024</p>
 </section>
 
+<section id="good-practice" class="slide level2">
+<h2>Good practice</h2>
+<ul>
+<li class="fragment">Always store your code in the project folder. In
+our case</li>
+<li class="fragment"><div class="sourceCode" id="cb1"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="ex">/p/project/training2441/</span><span class="va">$USER</span></span></code></pre></div></li>
+<li class="fragment">Store data in the scratch directory for faster I/O
+access. Files in scratch are deleted after 90 days of inactivity.</li>
+<li class="fragment"><div class="sourceCode" id="cb2"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb2-1"><a href="#cb2-1" aria-hidden="true" tabindex="-1"></a><span class="ex">/p/scratch/training2441/</span><span class="va">$USER</span></span></code></pre></div></li>
+<li class="fragment">Store the data in <code>$DATA_dataset</code> for a
+more permanent location.This location is not accessible by compute
+nodes. You have to Join the <a
+href="https://judoor.fz-juelich.de/projects/datasets/">project</a> in
+order to store and access data</li>
+</ul>
+</section>
+<section id="we-need-to-download-some-code" class="slide level2">
+<h2>We need to download some code</h2>
+<div class="sourceCode" id="cb3"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="va">$HOME</span>/course</span>
+<span id="cb3-2"><a href="#cb3-2" aria-hidden="true" tabindex="-1"></a><span class="fu">git</span> clone https://github.com/HelmholtzAI-FZJ/2024-11-course-deep-learning-in-neuroscience</span></code></pre></div>
+</section>
 <section id="the-resnet50-model" class="slide level2">
 <h2>The ResNet50 Model</h2>
 <p><img data-src="images/resnet.png" /></p>
 </section>
+<section id="the-imagenet-dataset" class="slide level2">
+<h2>The ImageNet dataset</h2>
+<h4 id="large-scale-visual-recognition-challenge-ilsvrc">Large Scale
+Visual Recognition Challenge (ILSVRC)</h4>
+<ul>
+<li class="fragment">An image dataset organized according to the <a
+href="https://wordnet.princeton.edu">WordNet hierarchy</a>.</li>
+<li class="fragment">Extensively used in algorithms for object detection
+and image classification at large scale.</li>
+<li class="fragment">It has 1000 classes, that comprises 1.2 million
+images for training, and 50,000 images for the validation set.</li>
+</ul>
+<p><img data-src="images/imagenet_banner.jpeg" /></p>
+</section>
 <section id="imagenet-class" class="slide level2">
 <h2>ImageNet class</h2>
-<div class="sourceCode" id="cb1"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="kw">class</span> ImageNet(Dataset):</span>
-<span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__init__</span>(<span class="va">self</span>, root, split, transform<span class="op">=</span><span class="va">None</span>):</span>
-<span id="cb1-3"><a href="#cb1-3" aria-hidden="true" tabindex="-1"></a>        <span class="cf">if</span> split <span class="kw">not</span> <span class="kw">in</span> [<span class="st">&quot;train&quot;</span>, <span class="st">&quot;val&quot;</span>]:</span>
-<span id="cb1-4"><a href="#cb1-4" aria-hidden="true" tabindex="-1"></a>            <span class="cf">raise</span> <span class="pp">ValueError</span>(<span class="st">&quot;split must be either &#39;train&#39; or &#39;val&#39;&quot;</span>)</span>
-<span id="cb1-5"><a href="#cb1-5" aria-hidden="true" tabindex="-1"></a>        </span>
-<span id="cb1-6"><a href="#cb1-6" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.root <span class="op">=</span> root</span>
-<span id="cb1-7"><a href="#cb1-7" aria-hidden="true" tabindex="-1"></a>        </span>
-<span id="cb1-8"><a href="#cb1-8" aria-hidden="true" tabindex="-1"></a>        <span class="cf">with</span> <span class="bu">open</span>(os.path.join(root, <span class="st">&quot;imagenet_</span><span class="sc">{}</span><span class="st">.json&quot;</span>.<span class="bu">format</span>(split)), <span class="st">&quot;rb&quot;</span>) <span class="im">as</span> f:</span>
-<span id="cb1-9"><a href="#cb1-9" aria-hidden="true" tabindex="-1"></a>            data <span class="op">=</span> json.load(f)</span>
-<span id="cb1-10"><a href="#cb1-10" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb1-11"><a href="#cb1-11" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.samples <span class="op">=</span> <span class="bu">list</span>(data.keys())</span>
-<span id="cb1-12"><a href="#cb1-12" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.targets <span class="op">=</span> <span class="bu">list</span>(data.values())</span>
-<span id="cb1-13"><a href="#cb1-13" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.transform <span class="op">=</span> transform</span>
-<span id="cb1-14"><a href="#cb1-14" aria-hidden="true" tabindex="-1"></a>        </span>
-<span id="cb1-15"><a href="#cb1-15" aria-hidden="true" tabindex="-1"></a>                </span>
-<span id="cb1-16"><a href="#cb1-16" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__len__</span>(<span class="va">self</span>):</span>
-<span id="cb1-17"><a href="#cb1-17" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> <span class="bu">len</span>(<span class="va">self</span>.samples)    </span>
-<span id="cb1-18"><a href="#cb1-18" aria-hidden="true" tabindex="-1"></a>    </span>
-<span id="cb1-19"><a href="#cb1-19" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__getitem__</span>(<span class="va">self</span>, idx):</span>
-<span id="cb1-20"><a href="#cb1-20" aria-hidden="true" tabindex="-1"></a>        x <span class="op">=</span> Image.<span class="bu">open</span>(os.path.join(<span class="va">self</span>.root, <span class="va">self</span>.samples[idx])).convert(<span class="st">&quot;RGB&quot;</span>)</span>
-<span id="cb1-21"><a href="#cb1-21" aria-hidden="true" tabindex="-1"></a>        <span class="cf">if</span> <span class="va">self</span>.transform:</span>
-<span id="cb1-22"><a href="#cb1-22" aria-hidden="true" tabindex="-1"></a>            x <span class="op">=</span> <span class="va">self</span>.transform(x)</span>
-<span id="cb1-23"><a href="#cb1-23" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> x, <span class="va">self</span>.targets[idx]</span>
-<span id="cb1-24"><a href="#cb1-24" aria-hidden="true" tabindex="-1"></a>    </span></code></pre></div>
+<div class="sourceCode" id="cb4"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a><span class="kw">class</span> ImageNet(Dataset):</span>
+<span id="cb4-2"><a href="#cb4-2" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__init__</span>(<span class="va">self</span>, root, split, transform<span class="op">=</span><span class="va">None</span>):</span>
+<span id="cb4-3"><a href="#cb4-3" aria-hidden="true" tabindex="-1"></a>        <span class="cf">if</span> split <span class="kw">not</span> <span class="kw">in</span> [<span class="st">&quot;train&quot;</span>, <span class="st">&quot;val&quot;</span>]:</span>
+<span id="cb4-4"><a href="#cb4-4" aria-hidden="true" tabindex="-1"></a>            <span class="cf">raise</span> <span class="pp">ValueError</span>(<span class="st">&quot;split must be either &#39;train&#39; or &#39;val&#39;&quot;</span>)</span>
+<span id="cb4-5"><a href="#cb4-5" aria-hidden="true" tabindex="-1"></a>        </span>
+<span id="cb4-6"><a href="#cb4-6" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.root <span class="op">=</span> root</span>
+<span id="cb4-7"><a href="#cb4-7" aria-hidden="true" tabindex="-1"></a>        </span>
+<span id="cb4-8"><a href="#cb4-8" aria-hidden="true" tabindex="-1"></a>        <span class="cf">with</span> <span class="bu">open</span>(os.path.join(root, <span class="st">&quot;imagenet_</span><span class="sc">{}</span><span class="st">.pk&quot;</span>.<span class="bu">format</span>(split)), <span class="st">&quot;rb&quot;</span>) <span class="im">as</span> f:</span>
+<span id="cb4-9"><a href="#cb4-9" aria-hidden="true" tabindex="-1"></a>            data <span class="op">=</span> pickle.load(f)</span>
+<span id="cb4-10"><a href="#cb4-10" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb4-11"><a href="#cb4-11" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.samples <span class="op">=</span> <span class="bu">list</span>(data.keys())</span>
+<span id="cb4-12"><a href="#cb4-12" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.targets <span class="op">=</span> <span class="bu">list</span>(data.values())</span>
+<span id="cb4-13"><a href="#cb4-13" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.transform <span class="op">=</span> transform</span>
+<span id="cb4-14"><a href="#cb4-14" aria-hidden="true" tabindex="-1"></a>        </span>
+<span id="cb4-15"><a href="#cb4-15" aria-hidden="true" tabindex="-1"></a>                </span>
+<span id="cb4-16"><a href="#cb4-16" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__len__</span>(<span class="va">self</span>):</span>
+<span id="cb4-17"><a href="#cb4-17" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> <span class="bu">len</span>(<span class="va">self</span>.samples)    </span>
+<span id="cb4-18"><a href="#cb4-18" aria-hidden="true" tabindex="-1"></a>    </span>
+<span id="cb4-19"><a href="#cb4-19" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__getitem__</span>(<span class="va">self</span>, idx):</span>
+<span id="cb4-20"><a href="#cb4-20" aria-hidden="true" tabindex="-1"></a>        x <span class="op">=</span> Image.<span class="bu">open</span>(os.path.join(<span class="va">self</span>.root, <span class="va">self</span>.samples[idx])).convert(<span class="st">&quot;RGB&quot;</span>)</span>
+<span id="cb4-21"><a href="#cb4-21" aria-hidden="true" tabindex="-1"></a>        <span class="cf">if</span> <span class="va">self</span>.transform:</span>
+<span id="cb4-22"><a href="#cb4-22" aria-hidden="true" tabindex="-1"></a>            x <span class="op">=</span> <span class="va">self</span>.transform(x)</span>
+<span id="cb4-23"><a href="#cb4-23" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> x, <span class="va">self</span>.targets[idx]</span>
+<span id="cb4-24"><a href="#cb4-24" aria-hidden="true" tabindex="-1"></a>    </span></code></pre></div>
 </section>
 <section id="pytorch-lightning-data-module" class="slide level2">
 <h2>PyTorch Lightning Data Module</h2>
-<div class="sourceCode" id="cb2"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb2-1"><a href="#cb2-1" aria-hidden="true" tabindex="-1"></a><span class="kw">class</span> ImageNetDataModule(pl.LightningDataModule):</span>
-<span id="cb2-2"><a href="#cb2-2" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__init__</span>(</span>
-<span id="cb2-3"><a href="#cb2-3" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>,</span>
-<span id="cb2-4"><a href="#cb2-4" aria-hidden="true" tabindex="-1"></a>        data_root: <span class="bu">str</span>,</span>
-<span id="cb2-5"><a href="#cb2-5" aria-hidden="true" tabindex="-1"></a>        batch_size: <span class="bu">int</span>,</span>
-<span id="cb2-6"><a href="#cb2-6" aria-hidden="true" tabindex="-1"></a>        num_workers: <span class="bu">int</span>,</span>
-<span id="cb2-7"><a href="#cb2-7" aria-hidden="true" tabindex="-1"></a>        dataset_transforms: <span class="bu">dict</span>(),</span>
-<span id="cb2-8"><a href="#cb2-8" aria-hidden="true" tabindex="-1"></a>    ):</span>
-<span id="cb2-9"><a href="#cb2-9" aria-hidden="true" tabindex="-1"></a>        <span class="bu">super</span>().<span class="fu">__init__</span>()</span>
-<span id="cb2-10"><a href="#cb2-10" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.data_root <span class="op">=</span> data_root</span>
-<span id="cb2-11"><a href="#cb2-11" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.batch_size <span class="op">=</span> batch_size</span>
-<span id="cb2-12"><a href="#cb2-12" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.num_workers <span class="op">=</span> num_workers</span>
-<span id="cb2-13"><a href="#cb2-13" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.dataset_transforms <span class="op">=</span> dataset_transforms</span>
-<span id="cb2-14"><a href="#cb2-14" aria-hidden="true" tabindex="-1"></a>        </span>
-<span id="cb2-15"><a href="#cb2-15" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> setup(<span class="va">self</span>, stage: Optional[<span class="bu">str</span>] <span class="op">=</span> <span class="va">None</span>):</span>
-<span id="cb2-16"><a href="#cb2-16" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.train <span class="op">=</span> ImageNet(<span class="va">self</span>.data_root, <span class="st">&quot;train&quot;</span>, <span class="va">self</span>.dataset_transforms)</span>
-<span id="cb2-17"><a href="#cb2-17" aria-hidden="true" tabindex="-1"></a>            </span>
-<span id="cb2-18"><a href="#cb2-18" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> train_dataloader(<span class="va">self</span>):</span>
-<span id="cb2-19"><a href="#cb2-19" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> DataLoader(<span class="va">self</span>.train, batch_size<span class="op">=</span><span class="va">self</span>.batch_size, <span class="op">\</span></span>
-<span id="cb2-20"><a href="#cb2-20" aria-hidden="true" tabindex="-1"></a>            num_workers<span class="op">=</span><span class="va">self</span>.num_workers)</span></code></pre></div>
+<div class="sourceCode" id="cb5"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a><span class="kw">class</span> ImageNetDataModule(pl.LightningDataModule):</span>
+<span id="cb5-2"><a href="#cb5-2" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__init__</span>(</span>
+<span id="cb5-3"><a href="#cb5-3" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>,</span>
+<span id="cb5-4"><a href="#cb5-4" aria-hidden="true" tabindex="-1"></a>        data_root: <span class="bu">str</span>,</span>
+<span id="cb5-5"><a href="#cb5-5" aria-hidden="true" tabindex="-1"></a>        batch_size: <span class="bu">int</span>,</span>
+<span id="cb5-6"><a href="#cb5-6" aria-hidden="true" tabindex="-1"></a>        num_workers: <span class="bu">int</span>,</span>
+<span id="cb5-7"><a href="#cb5-7" aria-hidden="true" tabindex="-1"></a>        dataset_transforms: <span class="bu">dict</span>(),</span>
+<span id="cb5-8"><a href="#cb5-8" aria-hidden="true" tabindex="-1"></a>    ):</span>
+<span id="cb5-9"><a href="#cb5-9" aria-hidden="true" tabindex="-1"></a>        <span class="bu">super</span>().<span class="fu">__init__</span>()</span>
+<span id="cb5-10"><a href="#cb5-10" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.data_root <span class="op">=</span> data_root</span>
+<span id="cb5-11"><a href="#cb5-11" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.batch_size <span class="op">=</span> batch_size</span>
+<span id="cb5-12"><a href="#cb5-12" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.num_workers <span class="op">=</span> num_workers</span>
+<span id="cb5-13"><a href="#cb5-13" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.dataset_transforms <span class="op">=</span> dataset_transforms</span>
+<span id="cb5-14"><a href="#cb5-14" aria-hidden="true" tabindex="-1"></a>        </span>
+<span id="cb5-15"><a href="#cb5-15" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> setup(<span class="va">self</span>, stage: Optional[<span class="bu">str</span>] <span class="op">=</span> <span class="va">None</span>):</span>
+<span id="cb5-16"><a href="#cb5-16" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.train <span class="op">=</span> ImageNet(<span class="va">self</span>.data_root, <span class="st">&quot;train&quot;</span>, <span class="va">self</span>.dataset_transforms)</span>
+<span id="cb5-17"><a href="#cb5-17" aria-hidden="true" tabindex="-1"></a>            </span>
+<span id="cb5-18"><a href="#cb5-18" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> train_dataloader(<span class="va">self</span>):</span>
+<span id="cb5-19"><a href="#cb5-19" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> DataLoader(<span class="va">self</span>.train, batch_size<span class="op">=</span><span class="va">self</span>.batch_size, <span class="op">\</span></span>
+<span id="cb5-20"><a href="#cb5-20" aria-hidden="true" tabindex="-1"></a>            num_workers<span class="op">=</span><span class="va">self</span>.num_workers)</span></code></pre></div>
 </section>
 <section id="pytorch-lightning-module" class="slide level2">
 <h2>PyTorch Lightning Module</h2>
-<div class="sourceCode" id="cb3"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a><span class="kw">class</span> resnet50Model(pl.LightningModule):</span>
-<span id="cb3-2"><a href="#cb3-2" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__init__</span>(<span class="va">self</span>):</span>
-<span id="cb3-3"><a href="#cb3-3" aria-hidden="true" tabindex="-1"></a>        <span class="bu">super</span>().<span class="fu">__init__</span>()</span>
-<span id="cb3-4"><a href="#cb3-4" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.model <span class="op">=</span> resnet50(pretrained<span class="op">=</span><span class="va">True</span>)</span>
-<span id="cb3-5"><a href="#cb3-5" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb3-6"><a href="#cb3-6" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> forward(<span class="va">self</span>, x):</span>
-<span id="cb3-7"><a href="#cb3-7" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> <span class="va">self</span>.model(x)</span>
-<span id="cb3-8"><a href="#cb3-8" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb3-9"><a href="#cb3-9" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> training_step(<span class="va">self</span>,batch):</span>
-<span id="cb3-10"><a href="#cb3-10" aria-hidden="true" tabindex="-1"></a>        x, labels <span class="op">=</span> batch</span>
-<span id="cb3-11"><a href="#cb3-11" aria-hidden="true" tabindex="-1"></a>        pred<span class="op">=</span><span class="va">self</span>.forward(x)</span>
-<span id="cb3-12"><a href="#cb3-12" aria-hidden="true" tabindex="-1"></a>        train_loss <span class="op">=</span> F.cross_entropy(pred, labels)</span>
-<span id="cb3-13"><a href="#cb3-13" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.log(<span class="st">&quot;training_loss&quot;</span>, train_loss)</span>
-<span id="cb3-14"><a href="#cb3-14" aria-hidden="true" tabindex="-1"></a>    </span>
-<span id="cb3-15"><a href="#cb3-15" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> train_loss</span>
-<span id="cb3-16"><a href="#cb3-16" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb3-17"><a href="#cb3-17" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> configure_optimizers(<span class="va">self</span>):</span>
-<span id="cb3-18"><a href="#cb3-18" aria-hidden="true" tabindex="-1"></a>            <span class="cf">return</span> torch.optim.Adam(<span class="va">self</span>.parameters(), lr<span class="op">=</span><span class="fl">0.02</span>)</span></code></pre></div>
+<div class="sourceCode" id="cb6"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a><span class="kw">class</span> resnet50Model(pl.LightningModule):</span>
+<span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> <span class="fu">__init__</span>(<span class="va">self</span>):</span>
+<span id="cb6-3"><a href="#cb6-3" aria-hidden="true" tabindex="-1"></a>        <span class="bu">super</span>().<span class="fu">__init__</span>()</span>
+<span id="cb6-4"><a href="#cb6-4" aria-hidden="true" tabindex="-1"></a>        weights <span class="op">=</span> ResNet50_Weights.DEFAULT</span>
+<span id="cb6-5"><a href="#cb6-5" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.model <span class="op">=</span> resnet50(weights<span class="op">=</span>weights)</span>
+<span id="cb6-6"><a href="#cb6-6" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb6-7"><a href="#cb6-7" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> forward(<span class="va">self</span>, x):</span>
+<span id="cb6-8"><a href="#cb6-8" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> <span class="va">self</span>.model(x)</span>
+<span id="cb6-9"><a href="#cb6-9" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb6-10"><a href="#cb6-10" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> training_step(<span class="va">self</span>,batch):</span>
+<span id="cb6-11"><a href="#cb6-11" aria-hidden="true" tabindex="-1"></a>        x, labels <span class="op">=</span> batch</span>
+<span id="cb6-12"><a href="#cb6-12" aria-hidden="true" tabindex="-1"></a>        pred<span class="op">=</span><span class="va">self</span>.forward(x)</span>
+<span id="cb6-13"><a href="#cb6-13" aria-hidden="true" tabindex="-1"></a>        train_loss <span class="op">=</span> F.cross_entropy(pred, labels)</span>
+<span id="cb6-14"><a href="#cb6-14" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.log(<span class="st">&quot;training_loss&quot;</span>, train_loss)</span>
+<span id="cb6-15"><a href="#cb6-15" aria-hidden="true" tabindex="-1"></a>    </span>
+<span id="cb6-16"><a href="#cb6-16" aria-hidden="true" tabindex="-1"></a>        <span class="cf">return</span> train_loss</span>
+<span id="cb6-17"><a href="#cb6-17" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb6-18"><a href="#cb6-18" aria-hidden="true" tabindex="-1"></a>    <span class="kw">def</span> configure_optimizers(<span class="va">self</span>):</span>
+<span id="cb6-19"><a href="#cb6-19" aria-hidden="true" tabindex="-1"></a>            <span class="cf">return</span> torch.optim.Adam(<span class="va">self</span>.parameters(), lr<span class="op">=</span><span class="fl">0.02</span>)</span></code></pre></div>
 </section>
 <section id="one-gpu-training" class="slide level2">
 <h2>One GPU training</h2>
-<div class="sourceCode" id="cb4"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a>transform <span class="op">=</span> transforms.Compose([</span>
-<span id="cb4-2"><a href="#cb4-2" aria-hidden="true" tabindex="-1"></a>    transforms.ToTensor(),</span>
-<span id="cb4-3"><a href="#cb4-3" aria-hidden="true" tabindex="-1"></a>    transforms.Resize((<span class="dv">256</span>, <span class="dv">256</span>))</span>
-<span id="cb4-4"><a href="#cb4-4" aria-hidden="true" tabindex="-1"></a>])</span>
-<span id="cb4-5"><a href="#cb4-5" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb4-6"><a href="#cb4-6" aria-hidden="true" tabindex="-1"></a><span class="co"># 1. Organize the data</span></span>
-<span id="cb4-7"><a href="#cb4-7" aria-hidden="true" tabindex="-1"></a>datamodule <span class="op">=</span> ImageNetDataModule(<span class="st">&quot;/p/scratch/training2425/data/&quot;</span>, <span class="dv">256</span>, <span class="op">\</span></span>
-<span id="cb4-8"><a href="#cb4-8" aria-hidden="true" tabindex="-1"></a>    <span class="bu">int</span>(os.getenv(<span class="st">&#39;SLURM_CPUS_PER_TASK&#39;</span>)), transform)</span>
-<span id="cb4-9"><a href="#cb4-9" aria-hidden="true" tabindex="-1"></a><span class="co"># 2. Build the model using desired Task</span></span>
-<span id="cb4-10"><a href="#cb4-10" aria-hidden="true" tabindex="-1"></a>model <span class="op">=</span> resnet50Model()</span>
-<span id="cb4-11"><a href="#cb4-11" aria-hidden="true" tabindex="-1"></a><span class="co"># 3. Create the trainer</span></span>
-<span id="cb4-12"><a href="#cb4-12" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>)</span>
-<span id="cb4-13"><a href="#cb4-13" aria-hidden="true" tabindex="-1"></a><span class="co"># 4. Train the model</span></span>
-<span id="cb4-14"><a href="#cb4-14" aria-hidden="true" tabindex="-1"></a>trainer.fit(model, datamodule<span class="op">=</span>datamodule)</span>
-<span id="cb4-15"><a href="#cb4-15" aria-hidden="true" tabindex="-1"></a><span class="co"># 5. Save the model!</span></span>
-<span id="cb4-16"><a href="#cb4-16" aria-hidden="true" tabindex="-1"></a>trainer.save_checkpoint(<span class="st">&quot;image_classification_model.pt&quot;</span>)</span></code></pre></div>
+<div class="sourceCode" id="cb7"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a>transform <span class="op">=</span> transforms.Compose([</span>
+<span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a>    transforms.ToTensor(),</span>
+<span id="cb7-3"><a href="#cb7-3" aria-hidden="true" tabindex="-1"></a>    transforms.Resize((<span class="dv">256</span>, <span class="dv">256</span>))</span>
+<span id="cb7-4"><a href="#cb7-4" aria-hidden="true" tabindex="-1"></a>])</span>
+<span id="cb7-5"><a href="#cb7-5" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb7-6"><a href="#cb7-6" aria-hidden="true" tabindex="-1"></a><span class="co"># 1. Organize the data</span></span>
+<span id="cb7-7"><a href="#cb7-7" aria-hidden="true" tabindex="-1"></a>datamodule <span class="op">=</span> ImageNetDataModule(<span class="st">&quot;/p/scratch/training2441/&quot;</span>, <span class="dv">256</span>, <span class="op">\</span></span>
+<span id="cb7-8"><a href="#cb7-8" aria-hidden="true" tabindex="-1"></a>    <span class="bu">int</span>(os.getenv(<span class="st">&#39;SLURM_CPUS_PER_TASK&#39;</span>)), transform)</span>
+<span id="cb7-9"><a href="#cb7-9" aria-hidden="true" tabindex="-1"></a><span class="co"># 2. Build the model using desired Task</span></span>
+<span id="cb7-10"><a href="#cb7-10" aria-hidden="true" tabindex="-1"></a>model <span class="op">=</span> resnet50Model()</span>
+<span id="cb7-11"><a href="#cb7-11" aria-hidden="true" tabindex="-1"></a><span class="co"># 3. Create the trainer</span></span>
+<span id="cb7-12"><a href="#cb7-12" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>)</span>
+<span id="cb7-13"><a href="#cb7-13" aria-hidden="true" tabindex="-1"></a><span class="co"># 4. Train the model</span></span>
+<span id="cb7-14"><a href="#cb7-14" aria-hidden="true" tabindex="-1"></a>trainer.fit(model, datamodule<span class="op">=</span>datamodule)</span>
+<span id="cb7-15"><a href="#cb7-15" aria-hidden="true" tabindex="-1"></a><span class="co"># 5. Save the model!</span></span>
+<span id="cb7-16"><a href="#cb7-16" aria-hidden="true" tabindex="-1"></a>trainer.save_checkpoint(<span class="st">&quot;image_classification_model.pt&quot;</span>)</span></code></pre></div>
 </section>
 <section id="one-gpu-training-1" class="slide level2">
 <h2>One GPU training</h2>
-<div class="sourceCode" id="cb5"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a><span class="co">#!/bin/bash -x</span></span>
-<span id="cb5-2"><a href="#cb5-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=1            </span></span>
-<span id="cb5-3"><a href="#cb5-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:1</span></span>
-<span id="cb5-4"><a href="#cb5-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=1  </span></span>
-<span id="cb5-5"><a href="#cb5-5" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=96</span></span>
-<span id="cb5-6"><a href="#cb5-6" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --time=06:00:00</span></span>
-<span id="cb5-7"><a href="#cb5-7" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --partition=dc-gpu</span></span>
-<span id="cb5-8"><a href="#cb5-8" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --account=training2425</span></span>
-<span id="cb5-9"><a href="#cb5-9" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --output=%j.out</span></span>
-<span id="cb5-10"><a href="#cb5-10" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --error=%j.err</span></span>
-<span id="cb5-11"><a href="#cb5-11" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --reservation=training2425 </span></span>
-<span id="cb5-12"><a href="#cb5-12" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb5-13"><a href="#cb5-13" aria-hidden="true" tabindex="-1"></a><span class="co"># To get number of cpu per task</span></span>
-<span id="cb5-14"><a href="#cb5-14" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">SRUN_CPUS_PER_TASK</span><span class="op">=</span><span class="st">&quot;</span><span class="va">$SLURM_CPUS_PER_TASK</span><span class="st">&quot;</span></span>
-<span id="cb5-15"><a href="#cb5-15" aria-hidden="true" tabindex="-1"></a><span class="co"># activate env</span></span>
-<span id="cb5-16"><a href="#cb5-16" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> <span class="va">$HOME</span>/course/<span class="va">$USER</span>/sc_venv_template/activate.sh</span>
-<span id="cb5-17"><a href="#cb5-17" aria-hidden="true" tabindex="-1"></a><span class="co"># run script from above</span></span>
-<span id="cb5-18"><a href="#cb5-18" aria-hidden="true" tabindex="-1"></a><span class="bu">time</span> srun python3 gpu_training.py</span></code></pre></div>
-<div class="sourceCode" id="cb6"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    342m11.864s</span></code></pre></div>
+<div class="sourceCode" id="cb8"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a><span class="co">#!/bin/bash -x</span></span>
+<span id="cb8-2"><a href="#cb8-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=1            </span></span>
+<span id="cb8-3"><a href="#cb8-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:1</span></span>
+<span id="cb8-4"><a href="#cb8-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=1  </span></span>
+<span id="cb8-5"><a href="#cb8-5" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=128</span></span>
+<span id="cb8-6"><a href="#cb8-6" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --time=06:00:00</span></span>
+<span id="cb8-7"><a href="#cb8-7" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --partition=dc-gpu</span></span>
+<span id="cb8-8"><a href="#cb8-8" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --account=training2441</span></span>
+<span id="cb8-9"><a href="#cb8-9" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --output=%j.out</span></span>
+<span id="cb8-10"><a href="#cb8-10" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --error=%j.err</span></span>
+<span id="cb8-11"><a href="#cb8-11" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --reservation=training2441</span></span>
+<span id="cb8-12"><a href="#cb8-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb8-13"><a href="#cb8-13" aria-hidden="true" tabindex="-1"></a><span class="co"># To get number of cpu per task</span></span>
+<span id="cb8-14"><a href="#cb8-14" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">SRUN_CPUS_PER_TASK</span><span class="op">=</span><span class="st">&quot;</span><span class="va">$SLURM_CPUS_PER_TASK</span><span class="st">&quot;</span></span>
+<span id="cb8-15"><a href="#cb8-15" aria-hidden="true" tabindex="-1"></a><span class="co"># activate env</span></span>
+<span id="cb8-16"><a href="#cb8-16" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> <span class="va">$HOME</span>/course/<span class="va">$USER</span>/sc_venv_template/activate.sh</span>
+<span id="cb8-17"><a href="#cb8-17" aria-hidden="true" tabindex="-1"></a><span class="co"># run script from above</span></span>
+<span id="cb8-18"><a href="#cb8-18" aria-hidden="true" tabindex="-1"></a><span class="bu">time</span> srun python3 gpu_training.py</span></code></pre></div>
+<div class="sourceCode" id="cb9"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    342m11.864s</span></code></pre></div>
 </section>
 <section id="demo" class="slide level2">
 <h2>DEMO</h2>
 </section>
 <section id="but-what-about-many-gpus" class="slide level2">
 <h2>But what about many GPUs?</h2>
+<div class="container">
+<div class="col">
 <ul>
+<li class="fragment">We make use of the GPU of our supercomputer and
+distribute our training to make training faster.</li>
 <li class="fragment">It’s when things get interesting</li>
 </ul>
+</div>
+<div class="col">
+<p><img data-src="images/GPUs.svg" /></p>
+</div>
+</div>
+</section>
+<section id="distributed-training" class="slide level2">
+<h2>Distributed Training</h2>
+<ul>
+<li class="fragment">Parallelize the training across multiple
+nodes,</li>
+<li class="fragment">Significantly enhancing training speed and model
+accuracy.</li>
+<li class="fragment">It is particularly beneficial for large models and
+computationally intensive tasks, such as deep learning.<a
+href="https://pytorch.org/tutorials/distributed/home.html">[1]</a></li>
+</ul>
+</section>
+<section class="slide level2">
+
+<!-- ## Terminologies
+
+- WORLD_SIZE: number of processes participating in the job.
+- RANK: the rank of the process in the network.
+- LOCAL_RANK: the rank of the process on the local machine.
+- MASTER_PORT: free port on machine with rank 0.
+<!-- - MASTER_ADDR: address of rank 0 node. -->
+<!-- ---  -->
+</section>
+<section class="slide level2">
+
+<!-- ![](images/ranks.svg)
+
+---
+
+![](images/local_ranks.svg)
+
+
+--- -->
+</section>
+<section id="parallel-training-with-pytorch-ddp" class="slide level2">
+<h2>Parallel Training with PyTorch DDP</h2>
+<ul>
+<li class="fragment"><a
+href="https://lightning.ai/docs/pytorch/stable/accelerators/gpu_intermediate.html">PyTorch’s
+DDP (Distributed Data Parallel)</a> works as follows:
+<ul>
+<li class="fragment">Each GPU across each node gets its own
+process.</li>
+<li class="fragment">Each GPU gets visibility into a subset of the
+overall dataset. It will only ever see that subset.</li>
+<li class="fragment">Each process inits the model.</li>
+<li class="fragment">Each process performs a full forward and backward
+pass in parallel.</li>
+<li class="fragment">The gradients are synced and averaged across all
+processes.</li>
+<li class="fragment">Each process updates its optimizer.</li>
+</ul></li>
+</ul>
 </section>
 <section id="data-parallel" class="slide level2">
 <h2>Data Parallel</h2>
@@ -378,26 +478,26 @@ <h2>Data Parallel - Averaging</h2>
 <section id="multi-gpu-training" class="slide level2">
 <h2>Multi-GPU training</h2>
 <p>1 node and 4 GPU</p>
-<div class="sourceCode" id="cb7"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a><span class="co">#!/bin/bash -x</span></span>
-<span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=1                     </span></span>
-<span id="cb7-3"><a href="#cb7-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:4                  # Use the 4 GPUs available</span></span>
-<span id="cb7-4"><a href="#cb7-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=4           # When using pl it should always be set to 4</span></span>
-<span id="cb7-5"><a href="#cb7-5" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=24            # Divide the number of cpus (96) by the number of GPUs (4)</span></span>
-<span id="cb7-6"><a href="#cb7-6" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --time=02:00:00</span></span>
-<span id="cb7-7"><a href="#cb7-7" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --partition=dc-gpu</span></span>
-<span id="cb7-8"><a href="#cb7-8" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --account=training2425</span></span>
-<span id="cb7-9"><a href="#cb7-9" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --output=%j.out</span></span>
-<span id="cb7-10"><a href="#cb7-10" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --error=%j.err</span></span>
-<span id="cb7-11"><a href="#cb7-11" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --reservation=training2425 </span></span>
-<span id="cb7-12"><a href="#cb7-12" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb7-13"><a href="#cb7-13" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">CUDA_VISIBLE_DEVICES</span><span class="op">=</span>0,1,2,3    <span class="co"># Very important to make the GPUs visible</span></span>
-<span id="cb7-14"><a href="#cb7-14" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">SRUN_CPUS_PER_TASK</span><span class="op">=</span><span class="st">&quot;</span><span class="va">$SLURM_CPUS_PER_TASK</span><span class="st">&quot;</span></span>
-<span id="cb7-15"><a href="#cb7-15" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb7-16"><a href="#cb7-16" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> <span class="va">$HOME</span>/course/<span class="va">$USER</span>/sc_venv_template/activate.sh</span>
-<span id="cb7-17"><a href="#cb7-17" aria-hidden="true" tabindex="-1"></a><span class="bu">time</span> srun python3 gpu_training.py</span></code></pre></div>
-<div class="sourceCode" id="cb8"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    89m15.923s</span></code></pre></div>
+<div class="sourceCode" id="cb10"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a><span class="co">#!/bin/bash -x</span></span>
+<span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=1                     </span></span>
+<span id="cb10-3"><a href="#cb10-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:4                  # Use the 4 GPUs available</span></span>
+<span id="cb10-4"><a href="#cb10-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=4           # When using pl it should always be set to 4</span></span>
+<span id="cb10-5"><a href="#cb10-5" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=32            # Divide the number of cpus (128) by the number of GPUs (4)</span></span>
+<span id="cb10-6"><a href="#cb10-6" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --time=02:00:00</span></span>
+<span id="cb10-7"><a href="#cb10-7" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --partition=dc-gpu</span></span>
+<span id="cb10-8"><a href="#cb10-8" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --account=training2441</span></span>
+<span id="cb10-9"><a href="#cb10-9" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --output=%j.out</span></span>
+<span id="cb10-10"><a href="#cb10-10" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --error=%j.err</span></span>
+<span id="cb10-11"><a href="#cb10-11" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --reservation=training2441</span></span>
+<span id="cb10-12"><a href="#cb10-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb10-13"><a href="#cb10-13" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">CUDA_VISIBLE_DEVICES</span><span class="op">=</span>0,1,2,3    <span class="co"># Very important to make the GPUs visible</span></span>
+<span id="cb10-14"><a href="#cb10-14" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">SRUN_CPUS_PER_TASK</span><span class="op">=</span><span class="st">&quot;</span><span class="va">$SLURM_CPUS_PER_TASK</span><span class="st">&quot;</span></span>
+<span id="cb10-15"><a href="#cb10-15" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb10-16"><a href="#cb10-16" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> <span class="va">$HOME</span>/course/<span class="va">$USER</span>/sc_venv_template/activate.sh</span>
+<span id="cb10-17"><a href="#cb10-17" aria-hidden="true" tabindex="-1"></a><span class="bu">time</span> srun python3 gpu_training.py</span></code></pre></div>
+<div class="sourceCode" id="cb11"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    89m15.923s</span></code></pre></div>
 </section>
 <section id="demo-1" class="slide level2">
 <h2>DEMO</h2>
@@ -424,6 +524,137 @@ <h2>Data Parallel - Multi Node</h2>
 <h2>Data Parallel - Multi Node</h2>
 <p><img data-src="images/data-parallel-multi-node-averaging.svg" /></p>
 </section>
+<section id="ddp-steps" class="slide level2">
+<h2>DDP steps</h2>
+<ol type="1">
+<li class="fragment">Set up the environement variables for the
+distributed mode (WORLD_SIZE, RANK, LOCAL_RANK …)</li>
+</ol>
+<ul>
+<li class="fragment"><div class="sourceCode" id="cb12"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a><span class="co"># The number of total processes started by Slurm.</span></span>
+<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a>ntasks <span class="op">=</span> os.getenv(<span class="st">&#39;SLURM_NTASKS&#39;</span>)</span>
+<span id="cb12-3"><a href="#cb12-3" aria-hidden="true" tabindex="-1"></a><span class="co"># Index of the current process.</span></span>
+<span id="cb12-4"><a href="#cb12-4" aria-hidden="true" tabindex="-1"></a>rank <span class="op">=</span> os.getenv(<span class="st">&#39;SLURM_PROCID&#39;</span>)</span>
+<span id="cb12-5"><a href="#cb12-5" aria-hidden="true" tabindex="-1"></a><span class="co"># Index of the current process on this node only.</span></span>
+<span id="cb12-6"><a href="#cb12-6" aria-hidden="true" tabindex="-1"></a>local_rank <span class="op">=</span> os.getenv(<span class="st">&#39;SLURM_LOCALID&#39;</span>)</span>
+<span id="cb12-7"><a href="#cb12-7" aria-hidden="true" tabindex="-1"></a><span class="co"># The number of nodes</span></span>
+<span id="cb12-8"><a href="#cb12-8" aria-hidden="true" tabindex="-1"></a>nnodes <span class="op">=</span> os.getenv(<span class="st">&quot;SLURM_NNODES&quot;</span>)</span></code></pre></div></li>
+</ul>
+</section>
+<section id="ddp-steps-1" class="slide level2">
+<h2>DDP steps</h2>
+<ol start="2" type="1">
+<li class="fragment">Initialize a sampler to specify the sequence of
+indices/keys used in data loading.</li>
+<li class="fragment">Implements data parallelism of the model.</li>
+<li class="fragment">Allow only one process to save checkpoints.</li>
+</ol>
+<ul>
+<li class="fragment"><div class="sourceCode" id="cb13"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a>datamodule <span class="op">=</span> ImageNetDataModule(<span class="st">&quot;/p/scratch/training2441/&quot;</span>, <span class="dv">256</span>, <span class="op">\</span></span>
+<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a>    <span class="bu">int</span>(os.getenv(<span class="st">&#39;SLURM_CPUS_PER_TASK&#39;</span>)), transform)</span>
+<span id="cb13-3"><a href="#cb13-3" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>, num_nodes<span class="op">=</span>nnodes)</span>
+<span id="cb13-4"><a href="#cb13-4" aria-hidden="true" tabindex="-1"></a>trainer.fit(model, datamodule<span class="op">=</span>datamodule)</span>
+<span id="cb13-5"><a href="#cb13-5" aria-hidden="true" tabindex="-1"></a>trainer.save_checkpoint(<span class="st">&quot;image_classification_model.pt&quot;</span>)</span></code></pre></div></li>
+</ul>
+</section>
+<section id="multi-node-training" class="slide level2">
+<h2>Multi-Node training</h2>
+<div class="sourceCode" id="cb14"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb14-1"><a href="#cb14-1" aria-hidden="true" tabindex="-1"></a>transform <span class="op">=</span> transforms.Compose([</span>
+<span id="cb14-2"><a href="#cb14-2" aria-hidden="true" tabindex="-1"></a>    transforms.ToTensor(),</span>
+<span id="cb14-3"><a href="#cb14-3" aria-hidden="true" tabindex="-1"></a>    transforms.Resize((<span class="dv">256</span>, <span class="dv">256</span>))</span>
+<span id="cb14-4"><a href="#cb14-4" aria-hidden="true" tabindex="-1"></a>])</span>
+<span id="cb14-5"><a href="#cb14-5" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb14-6"><a href="#cb14-6" aria-hidden="true" tabindex="-1"></a><span class="co"># 1. The number of nodes</span></span>
+<span id="cb14-7"><a href="#cb14-7" aria-hidden="true" tabindex="-1"></a>nnodes <span class="op">=</span> os.getenv(<span class="st">&quot;SLURM_NNODES&quot;</span>)</span>
+<span id="cb14-8"><a href="#cb14-8" aria-hidden="true" tabindex="-1"></a><span class="co"># 2. Organize the data</span></span>
+<span id="cb14-9"><a href="#cb14-9" aria-hidden="true" tabindex="-1"></a>datamodule <span class="op">=</span> ImageNetDataModule(<span class="st">&quot;/p/scratch/training2441/&quot;</span>, <span class="dv">128</span>, <span class="op">\</span></span>
+<span id="cb14-10"><a href="#cb14-10" aria-hidden="true" tabindex="-1"></a>    <span class="bu">int</span>(os.getenv(<span class="st">&#39;SLURM_CPUS_PER_TASK&#39;</span>)), transform)</span>
+<span id="cb14-11"><a href="#cb14-11" aria-hidden="true" tabindex="-1"></a><span class="co"># 3. Build the model using desired Task</span></span>
+<span id="cb14-12"><a href="#cb14-12" aria-hidden="true" tabindex="-1"></a>model <span class="op">=</span> resnet50Model()</span>
+<span id="cb14-13"><a href="#cb14-13" aria-hidden="true" tabindex="-1"></a><span class="co"># 4. Create the trainer</span></span>
+<span id="cb14-14"><a href="#cb14-14" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>, num_nodes<span class="op">=</span>nnodes)</span>
+<span id="cb14-15"><a href="#cb14-15" aria-hidden="true" tabindex="-1"></a><span class="co"># 5. Train the model</span></span>
+<span id="cb14-16"><a href="#cb14-16" aria-hidden="true" tabindex="-1"></a>trainer.fit(model, datamodule<span class="op">=</span>datamodule)</span>
+<span id="cb14-17"><a href="#cb14-17" aria-hidden="true" tabindex="-1"></a><span class="co"># 6. Save the model!</span></span>
+<span id="cb14-18"><a href="#cb14-18" aria-hidden="true" tabindex="-1"></a>trainer.save_checkpoint(<span class="st">&quot;image_classification_model.pt&quot;</span>)</span></code></pre></div>
+</section>
+<section id="multi-node-training-1" class="slide level2">
+<h2>Multi-Node training</h2>
+<p>16 nodes and 4 GPU each</p>
+<div class="sourceCode" id="cb15"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a><span class="co">#!/bin/bash -x</span></span>
+<span id="cb15-2"><a href="#cb15-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=16                     # This needs to match Trainer(num_nodes=...)</span></span>
+<span id="cb15-3"><a href="#cb15-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:4                   # Use the 4 GPUs available</span></span>
+<span id="cb15-4"><a href="#cb15-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=4            # When using pl it should always be set to 4</span></span>
+<span id="cb15-5"><a href="#cb15-5" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=32             # Divide the number of cpus (128) by the number of GPUs (4)</span></span>
+<span id="cb15-6"><a href="#cb15-6" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --time=00:15:00</span></span>
+<span id="cb15-7"><a href="#cb15-7" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --partition=dc-gpu</span></span>
+<span id="cb15-8"><a href="#cb15-8" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --account=training2441</span></span>
+<span id="cb15-9"><a href="#cb15-9" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --output=%j.out</span></span>
+<span id="cb15-10"><a href="#cb15-10" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --error=%j.err</span></span>
+<span id="cb15-11"><a href="#cb15-11" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --reservation=training2441</span></span>
+<span id="cb15-12"><a href="#cb15-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb15-13"><a href="#cb15-13" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">CUDA_VISIBLE_DEVICES</span><span class="op">=</span>0,1,2,3    <span class="co"># Very important to make the GPUs visible</span></span>
+<span id="cb15-14"><a href="#cb15-14" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">SRUN_CPUS_PER_TASK</span><span class="op">=</span><span class="st">&quot;</span><span class="va">$SLURM_CPUS_PER_TASK</span><span class="st">&quot;</span></span>
+<span id="cb15-15"><a href="#cb15-15" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb15-16"><a href="#cb15-16" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> <span class="va">$HOME</span>/course/<span class="va">$USER</span>/sc_venv_template/activate.sh</span>
+<span id="cb15-17"><a href="#cb15-17" aria-hidden="true" tabindex="-1"></a><span class="bu">time</span> srun python3 ddp_training.py</span></code></pre></div>
+<div class="sourceCode" id="cb16"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    6m56.457s</span></code></pre></div>
+</section>
+<section id="multi-node-training-2" class="slide level2">
+<h2>Multi-Node training</h2>
+<p>With 4 nodes:</p>
+<div class="sourceCode" id="cb17"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    24m48.169s</span></code></pre></div>
+<p>With 8 nodes:</p>
+<div class="sourceCode" id="cb18"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb18-1"><a href="#cb18-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    13m10.722s</span></code></pre></div>
+<p>With 16 nodes:</p>
+<div class="sourceCode" id="cb19"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb19-1"><a href="#cb19-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    6m56.457s</span></code></pre></div>
+<p>With 32 nodes:</p>
+<div class="sourceCode" id="cb20"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb20-1"><a href="#cb20-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    4m48.313s</span></code></pre></div>
+</section>
+<section id="data-parallel-2" class="slide level2">
+<h2>Data Parallel</h2>
+<!-- What changed? -->
+<ul>
+<li class="fragment">It was</li>
+<li class="fragment"><div class="sourceCode" id="cb21"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb21-1"><a href="#cb21-1" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>)</span></code></pre></div></li>
+<li class="fragment">Became</li>
+<li class="fragment"><div class="sourceCode" id="cb22"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb22-1"><a href="#cb22-1" aria-hidden="true" tabindex="-1"></a>nnodes <span class="op">=</span> os.getenv(<span class="st">&quot;SLURM_NNODES&quot;</span>)</span>
+<span id="cb22-2"><a href="#cb22-2" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>, num_nodes<span class="op">=</span>nnodes)</span></code></pre></div></li>
+</ul>
+</section>
+<section id="data-parallel-3" class="slide level2">
+<h2>Data Parallel</h2>
+<!-- What changed? -->
+<ul>
+<li class="fragment">It was</li>
+<li class="fragment"><div class="sourceCode" id="cb23"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb23-1"><a href="#cb23-1" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=1                </span></span>
+<span id="cb23-2"><a href="#cb23-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:1</span></span>
+<span id="cb23-3"><a href="#cb23-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=1</span></span>
+<span id="cb23-4"><a href="#cb23-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=128</span></span></code></pre></div></li>
+<li class="fragment">Became</li>
+<li class="fragment"><div class="sourceCode" id="cb24"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb24-1"><a href="#cb24-1" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=16                   # This needs to match Trainer(num_nodes=...)</span></span>
+<span id="cb24-2"><a href="#cb24-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:4                 # Use the 4 GPUs available</span></span>
+<span id="cb24-3"><a href="#cb24-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=4          # When using pl it should always be set to 4</span></span>
+<span id="cb24-4"><a href="#cb24-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=32           # Divide the number of cpus (128) by the number of GPUs (4)</span></span>
+<span id="cb24-5"><a href="#cb24-5" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">CUDA_VISIBLE_DEVICES</span><span class="op">=</span>0,1,2,3  <span class="co"># Very important to make the GPUs visible</span></span></code></pre></div></li>
+</ul>
+</section>
+<section id="demo-2" class="slide level2">
+<h2>DEMO</h2>
+</section>
 <section id="before-we-go-further" class="slide level2">
 <h2>Before we go further…</h2>
 <ul>
@@ -602,183 +833,20 @@ <h2>Recap</h2>
 </ul></li>
 </ul>
 </section>
-<section id="parallel-training-with-pytorch-ddp" class="slide level2">
-<h2>Parallel Training with PyTorch DDP</h2>
-<ul>
-<li class="fragment"><a
-href="https://lightning.ai/docs/pytorch/stable/accelerators/gpu_intermediate.html">PyTorch’s
-DDP (Distributed Data Parallel)</a> works as follows:
-<ul>
-<li class="fragment">Each GPU across each node gets its own
-process.</li>
-<li class="fragment">Each GPU gets visibility into a subset of the
-overall dataset. It will only ever see that subset.</li>
-<li class="fragment">Each process inits the model.</li>
-<li class="fragment">Each process performs a full forward and backward
-pass in parallel.</li>
-<li class="fragment">The gradients are synced and averaged across all
-processes.</li>
-<li class="fragment">Each process updates its optimizer.</li>
-</ul></li>
-</ul>
-</section>
-<section id="terminologies" class="slide level2">
-<h2>Terminologies</h2>
-<ul>
-<li class="fragment">WORLD_SIZE: number of processes participating in
-the job.</li>
-<li class="fragment">RANK: the rank of the process in the network.</li>
-<li class="fragment">LOCAL_RANK: the rank of the process on the local
-machine.</li>
-<li class="fragment">MASTER_PORT: free port on machine with rank 0.
-<!-- - MASTER_ADDR: address of rank 0 node. --></li>
-</ul>
-</section>
-<section id="ddp-steps" class="slide level2">
-<h2>DDP steps</h2>
-<ol type="1">
-<li class="fragment">Set up the environement variables for the
-distributed mode (WORLD_SIZE, RANK, LOCAL_RANK …)</li>
-</ol>
-<ul>
-<li class="fragment"><div class="sourceCode" id="cb9"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a><span class="co"># The number of total processes started by Slurm.</span></span>
-<span id="cb9-2"><a href="#cb9-2" aria-hidden="true" tabindex="-1"></a>ntasks <span class="op">=</span> os.getenv(<span class="st">&#39;SLURM_NTASKS&#39;</span>)</span>
-<span id="cb9-3"><a href="#cb9-3" aria-hidden="true" tabindex="-1"></a><span class="co"># Index of the current process.</span></span>
-<span id="cb9-4"><a href="#cb9-4" aria-hidden="true" tabindex="-1"></a>rank <span class="op">=</span> os.getenv(<span class="st">&#39;SLURM_PROCID&#39;</span>)</span>
-<span id="cb9-5"><a href="#cb9-5" aria-hidden="true" tabindex="-1"></a><span class="co"># Index of the current process on this node only.</span></span>
-<span id="cb9-6"><a href="#cb9-6" aria-hidden="true" tabindex="-1"></a>local_rank <span class="op">=</span> os.getenv(<span class="st">&#39;SLURM_LOCALID&#39;</span>)</span>
-<span id="cb9-7"><a href="#cb9-7" aria-hidden="true" tabindex="-1"></a><span class="co"># The number of nodes</span></span>
-<span id="cb9-8"><a href="#cb9-8" aria-hidden="true" tabindex="-1"></a>nnodes <span class="op">=</span> os.getenv(<span class="st">&quot;SLURM_NNODES&quot;</span>)</span></code></pre></div></li>
-</ul>
-</section>
-<section id="ddp-steps-1" class="slide level2">
-<h2>DDP steps</h2>
-<ol start="2" type="1">
-<li class="fragment">Initialize a sampler to specify the sequence of
-indices/keys used in data loading.</li>
-<li class="fragment">Implements data parallelism of the model.</li>
-<li class="fragment">Allow only one process to save checkpoints.</li>
-</ol>
-<ul>
-<li class="fragment"><div class="sourceCode" id="cb10"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a>datamodule <span class="op">=</span> ImageNetDataModule(<span class="st">&quot;/p/scratch/training2425/data/&quot;</span>, <span class="dv">256</span>, <span class="op">\</span></span>
-<span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a>    <span class="bu">int</span>(os.getenv(<span class="st">&#39;SLURM_CPUS_PER_TASK&#39;</span>)), transform)</span>
-<span id="cb10-3"><a href="#cb10-3" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>, num_nodes<span class="op">=</span>nnodes)</span>
-<span id="cb10-4"><a href="#cb10-4" aria-hidden="true" tabindex="-1"></a>trainer.fit(model, datamodule<span class="op">=</span>datamodule)</span>
-<span id="cb10-5"><a href="#cb10-5" aria-hidden="true" tabindex="-1"></a>trainer.save_checkpoint(<span class="st">&quot;image_classification_model.pt&quot;</span>)</span></code></pre></div></li>
-</ul>
-</section>
-<section id="ddp-steps-2" class="slide level2">
-<h2>DDP steps</h2>
-<div class="sourceCode" id="cb11"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a>transform <span class="op">=</span> transforms.Compose([</span>
-<span id="cb11-2"><a href="#cb11-2" aria-hidden="true" tabindex="-1"></a>    transforms.ToTensor(),</span>
-<span id="cb11-3"><a href="#cb11-3" aria-hidden="true" tabindex="-1"></a>    transforms.Resize((<span class="dv">256</span>, <span class="dv">256</span>))</span>
-<span id="cb11-4"><a href="#cb11-4" aria-hidden="true" tabindex="-1"></a>])</span>
-<span id="cb11-5"><a href="#cb11-5" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb11-6"><a href="#cb11-6" aria-hidden="true" tabindex="-1"></a><span class="co"># 1. The number of nodes</span></span>
-<span id="cb11-7"><a href="#cb11-7" aria-hidden="true" tabindex="-1"></a>nnodes <span class="op">=</span> os.getenv(<span class="st">&quot;SLURM_NNODES&quot;</span>)</span>
-<span id="cb11-8"><a href="#cb11-8" aria-hidden="true" tabindex="-1"></a><span class="co"># 2. Organize the data</span></span>
-<span id="cb11-9"><a href="#cb11-9" aria-hidden="true" tabindex="-1"></a>datamodule <span class="op">=</span> ImageNetDataModule(<span class="st">&quot;/p/scratch/training2425/data/&quot;</span>, <span class="dv">128</span>, <span class="op">\</span></span>
-<span id="cb11-10"><a href="#cb11-10" aria-hidden="true" tabindex="-1"></a>    <span class="bu">int</span>(os.getenv(<span class="st">&#39;SLURM_CPUS_PER_TASK&#39;</span>)), transform)</span>
-<span id="cb11-11"><a href="#cb11-11" aria-hidden="true" tabindex="-1"></a><span class="co"># 3. Build the model using desired Task</span></span>
-<span id="cb11-12"><a href="#cb11-12" aria-hidden="true" tabindex="-1"></a>model <span class="op">=</span> resnet50Model()</span>
-<span id="cb11-13"><a href="#cb11-13" aria-hidden="true" tabindex="-1"></a><span class="co"># 4. Create the trainer</span></span>
-<span id="cb11-14"><a href="#cb11-14" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>, num_nodes<span class="op">=</span>nnodes)</span>
-<span id="cb11-15"><a href="#cb11-15" aria-hidden="true" tabindex="-1"></a><span class="co"># 5. Train the model</span></span>
-<span id="cb11-16"><a href="#cb11-16" aria-hidden="true" tabindex="-1"></a>trainer.fit(model, datamodule<span class="op">=</span>datamodule)</span>
-<span id="cb11-17"><a href="#cb11-17" aria-hidden="true" tabindex="-1"></a><span class="co"># 6. Save the model!</span></span>
-<span id="cb11-18"><a href="#cb11-18" aria-hidden="true" tabindex="-1"></a>trainer.save_checkpoint(<span class="st">&quot;image_classification_model.pt&quot;</span>)</span></code></pre></div>
-</section>
-<section id="ddp-training" class="slide level2">
-<h2>DDP training</h2>
-<p>16 nodes and 4 GPU each</p>
-<div class="sourceCode" id="cb12"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a><span class="co">#!/bin/bash -x</span></span>
-<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=16                     # This needs to match Trainer(num_nodes=...)</span></span>
-<span id="cb12-3"><a href="#cb12-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:4                   # Use the 4 GPUs available</span></span>
-<span id="cb12-4"><a href="#cb12-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=4            # When using pl it should always be set to 4</span></span>
-<span id="cb12-5"><a href="#cb12-5" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=24             # Divide the number of cpus (96) by the number of GPUs (4)</span></span>
-<span id="cb12-6"><a href="#cb12-6" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --time=00:15:00</span></span>
-<span id="cb12-7"><a href="#cb12-7" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --partition=dc-gpu</span></span>
-<span id="cb12-8"><a href="#cb12-8" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --account=training2425</span></span>
-<span id="cb12-9"><a href="#cb12-9" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --output=%j.out</span></span>
-<span id="cb12-10"><a href="#cb12-10" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --error=%j.err</span></span>
-<span id="cb12-11"><a href="#cb12-11" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --reservation=training2425 </span></span>
-<span id="cb12-12"><a href="#cb12-12" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb12-13"><a href="#cb12-13" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">CUDA_VISIBLE_DEVICES</span><span class="op">=</span>0,1,2,3    <span class="co"># Very important to make the GPUs visible</span></span>
-<span id="cb12-14"><a href="#cb12-14" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">SRUN_CPUS_PER_TASK</span><span class="op">=</span><span class="st">&quot;</span><span class="va">$SLURM_CPUS_PER_TASK</span><span class="st">&quot;</span></span>
-<span id="cb12-15"><a href="#cb12-15" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb12-16"><a href="#cb12-16" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> <span class="va">$HOME</span>/course/<span class="va">$USER</span>/sc_venv_template/activate.sh</span>
-<span id="cb12-17"><a href="#cb12-17" aria-hidden="true" tabindex="-1"></a><span class="bu">time</span> srun python3 ddp_training.py</span></code></pre></div>
-<div class="sourceCode" id="cb13"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    6m56.457s</span></code></pre></div>
-</section>
-<section id="ddp-training-1" class="slide level2">
-<h2>DDP training</h2>
-<p>With 4 nodes:</p>
-<div class="sourceCode" id="cb14"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb14-1"><a href="#cb14-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    24m48.169s</span></code></pre></div>
-<p>With 8 nodes:</p>
-<div class="sourceCode" id="cb15"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    13m10.722s</span></code></pre></div>
-<p>With 16 nodes:</p>
-<div class="sourceCode" id="cb16"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    6m56.457s</span></code></pre></div>
-<p>With 32 nodes:</p>
-<div class="sourceCode" id="cb17"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a><span class="ex">real</span>    4m48.313s</span></code></pre></div>
-</section>
-<section id="data-parallel-2" class="slide level2">
-<h2>Data Parallel</h2>
-<!-- What changed? -->
-<ul>
-<li class="fragment">It was</li>
-<li class="fragment"><div class="sourceCode" id="cb18"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb18-1"><a href="#cb18-1" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>)</span></code></pre></div></li>
-<li class="fragment">Became</li>
-<li class="fragment"><div class="sourceCode" id="cb19"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb19-1"><a href="#cb19-1" aria-hidden="true" tabindex="-1"></a>nnodes <span class="op">=</span> os.getenv(<span class="st">&quot;SLURM_NNODES&quot;</span>)</span>
-<span id="cb19-2"><a href="#cb19-2" aria-hidden="true" tabindex="-1"></a>trainer <span class="op">=</span> pl.Trainer(max_epochs<span class="op">=</span><span class="dv">10</span>,  accelerator<span class="op">=</span><span class="st">&quot;gpu&quot;</span>, num_nodes<span class="op">=</span>nnodes)</span></code></pre></div></li>
-</ul>
-</section>
-<section id="data-parallel-3" class="slide level2">
-<h2>Data Parallel</h2>
-<!-- What changed? -->
-<ul>
-<li class="fragment">It was</li>
-<li class="fragment"><div class="sourceCode" id="cb20"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb20-1"><a href="#cb20-1" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=1                </span></span>
-<span id="cb20-2"><a href="#cb20-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:1</span></span>
-<span id="cb20-3"><a href="#cb20-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=1</span></span>
-<span id="cb20-4"><a href="#cb20-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=96</span></span></code></pre></div></li>
-<li class="fragment">Became</li>
-<li class="fragment"><div class="sourceCode" id="cb21"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb21-1"><a href="#cb21-1" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --nodes=16                   # This needs to match Trainer(num_nodes=...)</span></span>
-<span id="cb21-2"><a href="#cb21-2" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --gres=gpu:4                 # Use the 4 GPUs available</span></span>
-<span id="cb21-3"><a href="#cb21-3" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --ntasks-per-node=4          # When using pl it should always be set to 4</span></span>
-<span id="cb21-4"><a href="#cb21-4" aria-hidden="true" tabindex="-1"></a><span class="co">#SBATCH --cpus-per-task=24           # Divide the number of cpus (96) by the number of GPUs (4)</span></span>
-<span id="cb21-5"><a href="#cb21-5" aria-hidden="true" tabindex="-1"></a><span class="bu">export</span> <span class="va">CUDA_VISIBLE_DEVICES</span><span class="op">=</span>0,1,2,3  <span class="co"># Very important to make the GPUs visible</span></span></code></pre></div></li>
-</ul>
-</section>
-<section id="demo-2" class="slide level2">
-<h2>DEMO</h2>
-</section>
 <section id="tensorboard" class="slide level2">
 <h2>TensorBoard</h2>
 <ul>
 <li class="fragment">In resnet50.py</li>
-<li class="fragment"><div class="sourceCode" id="cb22"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb22-1"><a href="#cb22-1" aria-hidden="true" tabindex="-1"></a><span class="va">self</span>.log(<span class="st">&quot;training_loss&quot;</span>, train_loss)</span></code></pre></div></li>
+<li class="fragment"><div class="sourceCode" id="cb25"><pre
+class="sourceCode python"><code class="sourceCode python"><span id="cb25-1"><a href="#cb25-1" aria-hidden="true" tabindex="-1"></a><span class="va">self</span>.log(<span class="st">&quot;training_loss&quot;</span>, train_loss)</span></code></pre></div></li>
 <li class="fragment"><img data-src="images/pl_tb.png" /></li>
 </ul>
 </section>
 <section id="tensorboard-1" class="slide level2">
 <h2>TensorBoard</h2>
-<div class="sourceCode" id="cb23"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb23-1"><a href="#cb23-1" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> <span class="va">$HOME</span>/course/<span class="va">$USER</span>/sc_venv_template/activate.sh</span>
-<span id="cb23-2"><a href="#cb23-2" aria-hidden="true" tabindex="-1"></a><span class="ex">tensorboard</span> <span class="at">--logdir</span><span class="op">=</span><span class="pp">[</span><span class="ss">PATH_TO_TENSOR_BOARD</span><span class="pp">]</span> </span></code></pre></div>
+<div class="sourceCode" id="cb26"><pre
+class="sourceCode bash"><code class="sourceCode bash"><span id="cb26-1"><a href="#cb26-1" aria-hidden="true" tabindex="-1"></a><span class="bu">source</span> <span class="va">$HOME</span>/course/<span class="va">$USER</span>/sc_venv_template/activate.sh</span>
+<span id="cb26-2"><a href="#cb26-2" aria-hidden="true" tabindex="-1"></a><span class="ex">tensorboard</span> <span class="at">--logdir</span><span class="op">=</span><span class="pp">[</span><span class="ss">PATH_TO_TENSOR_BOARD</span><span class="pp">]</span> </span></code></pre></div>
 <p><img data-src="images/tb.png" width="750" /></p>
 </section>
 <section id="demo-3" class="slide level2">
@@ -809,7 +877,7 @@ <h2>ANY QUESTIONS??</h2>
 <h4 id="feedback-is-more-than-welcome">Feedback is more than
 welcome!</h4>
 <h4 id="link-to-other-courses-at-jsc">Link to <a
-href="https://go.fzj.de/intro-sc-ai-2023-other-courses">other courses at
+href="https://go.fzj.de/dl-in-neuroscience-all-courses">other courses at
 JSC</a></h4>
 </section>
 <section class="slide level2">
diff --git a/public/02-speedup-data-loading.html b/public/02-speedup-data-loading.html
deleted file mode 100644
index 2a937e4..0000000
--- a/public/02-speedup-data-loading.html
+++ /dev/null
@@ -1,852 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-  <meta charset="utf-8">
-  <meta name="generator" content="pandoc">
-  <meta name="author" content="Alexandre Strube // Sabrina Benassou">
-  <meta name="dcterms.date" content="2024-06-25">
-  <title>Bringing Deep Learning Workloads to JSC supercomputers</title>
-  <meta name="apple-mobile-web-app-capable" content="yes">
-  <meta name="apple-mobile-web-app-status-bar-style" content="black-translucent">
-  <meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1.0, user-scalable=no, minimal-ui">
-  <link rel="stylesheet" href="./dist/reset.css">
-  <link rel="stylesheet" href="./dist/reveal.css">
-  <style>
-    .reveal .sourceCode {  /* see #7635 */
-      overflow: visible;
-    }
-    code{white-space: pre-wrap;}
-    span.smallcaps{font-variant: small-caps;}
-    div.columns{display: flex; gap: min(4vw, 1.5em);}
-    div.column{flex: auto; overflow-x: auto;}
-    div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;}
-    /* The extra [class] is a hack that increases specificity enough to
-       override a similar rule in reveal.js */
-    ul.task-list[class]{list-style: none;}
-    ul.task-list li input[type="checkbox"] {
-      font-size: inherit;
-      width: 0.8em;
-      margin: 0 0.8em 0.2em -1.6em;
-      vertical-align: middle;
-    }
-    .display.math{display: block; text-align: center; margin: 0.5rem auto;}
-    /* CSS for syntax highlighting */
-    pre > code.sourceCode { white-space: pre; position: relative; }
-    pre > code.sourceCode > span { line-height: 1.25; }
-    pre > code.sourceCode > span:empty { height: 1.2em; }
-    .sourceCode { overflow: visible; }
-    code.sourceCode > span { color: inherit; text-decoration: inherit; }
-    div.sourceCode { margin: 1em 0; }
-    pre.sourceCode { margin: 0; }
-    @media screen {
-    div.sourceCode { overflow: auto; }
-    }
-    @media print {
-    pre > code.sourceCode { white-space: pre-wrap; }
-    pre > code.sourceCode > span { display: inline-block; text-indent: -5em; padding-left: 5em; }
-    }
-    pre.numberSource code
-      { counter-reset: source-line 0; }
-    pre.numberSource code > span
-      { position: relative; left: -4em; counter-increment: source-line; }
-    pre.numberSource code > span > a:first-child::before
-      { content: counter(source-line);
-        position: relative; left: -1em; text-align: right; vertical-align: baseline;
-        border: none; display: inline-block;
-        -webkit-touch-callout: none; -webkit-user-select: none;
-        -khtml-user-select: none; -moz-user-select: none;
-        -ms-user-select: none; user-select: none;
-        padding: 0 4px; width: 4em;
-        color: #aaaaaa;
-      }
-    pre.numberSource { margin-left: 3em; border-left: 1px solid #aaaaaa;  padding-left: 4px; }
-    div.sourceCode
-      {   }
-    @media screen {
-    pre > code.sourceCode > span > a:first-child::before { text-decoration: underline; }
-    }
-    code span.al { color: #ff0000; font-weight: bold; } /* Alert */
-    code span.an { color: #60a0b0; font-weight: bold; font-style: italic; } /* Annotation */
-    code span.at { color: #7d9029; } /* Attribute */
-    code span.bn { color: #40a070; } /* BaseN */
-    code span.bu { color: #008000; } /* BuiltIn */
-    code span.cf { color: #007020; font-weight: bold; } /* ControlFlow */
-    code span.ch { color: #4070a0; } /* Char */
-    code span.cn { color: #880000; } /* Constant */
-    code span.co { color: #60a0b0; font-style: italic; } /* Comment */
-    code span.cv { color: #60a0b0; font-weight: bold; font-style: italic; } /* CommentVar */
-    code span.do { color: #ba2121; font-style: italic; } /* Documentation */
-    code span.dt { color: #902000; } /* DataType */
-    code span.dv { color: #40a070; } /* DecVal */
-    code span.er { color: #ff0000; font-weight: bold; } /* Error */
-    code span.ex { } /* Extension */
-    code span.fl { color: #40a070; } /* Float */
-    code span.fu { color: #06287e; } /* Function */
-    code span.im { color: #008000; font-weight: bold; } /* Import */
-    code span.in { color: #60a0b0; font-weight: bold; font-style: italic; } /* Information */
-    code span.kw { color: #007020; font-weight: bold; } /* Keyword */
-    code span.op { color: #666666; } /* Operator */
-    code span.ot { color: #007020; } /* Other */
-    code span.pp { color: #bc7a00; } /* Preprocessor */
-    code span.sc { color: #4070a0; } /* SpecialChar */
-    code span.ss { color: #bb6688; } /* SpecialString */
-    code span.st { color: #4070a0; } /* String */
-    code span.va { color: #19177c; } /* Variable */
-    code span.vs { color: #4070a0; } /* VerbatimString */
-    code span.wa { color: #60a0b0; font-weight: bold; font-style: italic; } /* Warning */
-  </style>
-  <link rel="stylesheet" href="./dist/theme/sky.css" id="theme">
-  <style>
-  .container{
-    display: flex;
-  }
-  .col {
-    flex: 1;
-  }
-
-  .slides {
-      font-size: 0.75em;
-  }
-  .reveal ul {
-      display: block;
-  }
-  .reveal ol {
-      display: block;
-  }
-
-  img {
-      max-height: 600px !important;
-  }
-
-  figcaption {
-      font-size: 0.6em !important;
-      font-style: italic !important;
-  }
-
-  .subtitle {
-      font-style: italic !important;
-  }
-
-  .date {
-      font-size: 0.75em !important;
-  }
-
-
-  body {
-      font-family: "Arial", "sans-serif"
-  }
-
-  section {
-      margin: 0;
-  }
-
-  .reveal .slides {
-      margin: 0 1vmin;
-  }
-  .reveal h1,
-  .reveal h2,
-  .reveal h3,
-  .reveal h4 {
-      font-family: "Arial", "sans-serif";
-      text-transform: Uppercase;
-      color: #023d6b;
-  }
-
-  .reveal h1 {
-      color: #023d6b;
-      font-size: 250%;
-  }
-
-
-  .reveal h2 + h3 {
-      text-transform: Unset;
-      font-size: 80%;
-  }
-
-  .controls {
-      visibility: hidden;
-  }
-
-  .reveal .progress {
-      position: absolute;
-      bottom: 1px;
-  }
-
-  .prompt {
-      min-width: 0;
-      width: 0;
-      visibility: hidden;
-  }
-
-  div.dateauthor {
-      padding-top: 4em;
-      color: white;
-  }
-
-  div.prompt {
-      width:0;
-  }
-
-
-  div#footer {
-      position: fixed;
-      bottom: 0;
-      width: 100%;
-      z-index: 10;
-  font-size: 0.5em; font-weight: bold; padding: 0 1vmin; height: 20vmin; background: #fff}
-  #footer h1 {
-      position: absolute; 
-      bottom: 3.2vmin; 
-      display: block; 
-      padding: 0 1em; 
-      font-size: 1.7vmin;
-      font-weight: bold;
-      text-transform: unset;
-      color: #023d6b;
-  }
-  #footer h2 {display: block; padding: 0.em 1em 0;}
-
-  img.fzjlogo {
-      position: fixed;
-      bottom: 0;
-      right: 0;
-      height: 24vmin; /* The height of the svg is about 3 times the height of the logo */
-      margin-bottom: -3vmin; /* Baseline of logo should be about 5% of short side above edge. */
-  }
-
-  .rendered_html img, svg {
-      max-height: 440px;
-  }
-
-  </style>
-</head>
-<body>
-  <div class="reveal">
-    <div class="slides">
-
-<section id="title-slide">
-  <h1 class="title">Bringing Deep Learning Workloads to JSC
-supercomputers</h1>
-  <p class="subtitle">Data loading</p>
-  <p class="author">Alexandre Strube // Sabrina Benassou</p>
-  <p class="date">June 25, 2024</p>
-</section>
-
-<section class="slide level2">
-
-<h3 id="schedule-for-day-2">Schedule for day 2</h3>
-<table>
-<thead>
-<tr class="header">
-<th>Time</th>
-<th>Title</th>
-</tr>
-</thead>
-<tbody>
-<tr class="odd">
-<td>10:00 - 10:15</td>
-<td>Welcome, questions</td>
-</tr>
-<tr class="even">
-<td>10:15 - 11:30</td>
-<td>Data loading</td>
-</tr>
-<tr class="odd">
-<td>11:30 - 12:00</td>
-<td>Coffee Break (flexible)</td>
-</tr>
-<tr class="even">
-<td>12:30 - 14:00</td>
-<td>Parallelize Training</td>
-</tr>
-</tbody>
-</table>
-</section>
-<section id="lets-talk-about-data" class="slide level2">
-<h2>Let’s talk about DATA</h2>
-<ul>
-<li class="fragment">Some general considerations one should have in
-mind</li>
-</ul>
-</section>
-<section class="slide level2">
-
-<figure>
-<img data-src="images/data-and-lore.jpg" alt="Not this data" />
-<figcaption aria-hidden="true">Not this data</figcaption>
-</figure>
-</section>
-<section id="io-is-separate-and-shared" class="slide level2">
-<h2>I/O is separate and shared</h2>
-<h4 id="all-compute-nodes-of-all-supercomputers-see-the-same-files">All
-compute nodes of all supercomputers see the same files</h4>
-<ul>
-<li class="fragment">Performance tradeoff between shared acessibility
-and speed</li>
-<li class="fragment">It’s simple to load data fast to 1 or 2 gpus. But
-to 100? 1000? 10000?</li>
-</ul>
-</section>
-<section class="slide level2">
-
-<h3 id="jülich-supercomputers">Jülich Supercomputers</h3>
-<ul>
-<li class="fragment">Our I/O server is almost a supercomputer by
-itself</li>
-<li class="fragment"><figure>
-<img data-src="images/machines.png" alt="JSC Supercomputer Stragegy" />
-<figcaption aria-hidden="true">JSC Supercomputer Stragegy</figcaption>
-</figure></li>
-</ul>
-</section>
-<section id="where-do-i-keep-my-files" class="slide level2">
-<h2>Where do I keep my files?</h2>
-<ul>
-<li class="fragment"><strong><code>$PROJECT_projectname</code></strong>
-for code (<code>projectname</code> is <code>training2425</code> in this
-case)
-<ul>
-<li class="fragment">Most of your work should stay here</li>
-</ul></li>
-<li class="fragment"><strong><code>$DATA_projectname</code></strong> for
-big data(*)
-<ul>
-<li class="fragment">Permanent location for big datasets</li>
-</ul></li>
-<li class="fragment"><strong><code>$SCRATCH_projectname</code></strong>
-for temporary files (fast, but not permanent)
-<ul>
-<li class="fragment">Files are deleted after 90 days untouched</li>
-</ul></li>
-</ul>
-</section>
-<section id="data-services" class="slide level2">
-<h2>Data services</h2>
-<ul>
-<li class="fragment">JSC provides different data services</li>
-<li class="fragment">Data projects give massive amounts of storage</li>
-<li class="fragment">We use it for ML datasets. Join the project at
-<strong><a
-href="https://judoor.fz-juelich.de/projects/join/datasets">Judoor</a></strong></li>
-<li class="fragment">After being approved, connect to the supercomputer
-and try it:</li>
-<li class="fragment"><div class="sourceCode" id="cb1"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="va">$DATA_datasets</span></span>
-<span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a><span class="fu">ls</span> <span class="at">-la</span></span></code></pre></div></li>
-</ul>
-</section>
-<section id="data-staging" class="slide level2">
-<h2>Data Staging</h2>
-<ul>
-<li class="fragment"><a
-href="https://apps.fz-juelich.de/jsc/hps/juwels/filesystems.html">LARGEDATA
-filesystem</a> is not accessible by compute nodes
-<ul>
-<li class="fragment">Copy files to an accessible filesystem BEFORE
-working</li>
-</ul></li>
-<li class="fragment">Imagenet-21K copy alone takes 21+ minutes to
-$SCRATCH
-<ul>
-<li class="fragment">We already copied it to $SCRATCH for you</li>
-</ul></li>
-</ul>
-</section>
-<section id="data-loading" class="slide level2">
-<h2>Data loading</h2>
-<figure>
-<img data-src="images/nomnom.jpg" alt="Fat GPUs need to be fed FAST" />
-<figcaption aria-hidden="true">Fat GPUs need to be fed FAST</figcaption>
-</figure>
-</section>
-<section id="strategies" class="slide level2">
-<h2>Strategies</h2>
-<ul>
-<li class="fragment">We have CPUs and lots of memory - let’s use them
-<ul>
-<li class="fragment">multitask training and data loading for the next
-batch</li>
-<li class="fragment"><code>/dev/shm</code> is a filesystem on ram -
-ultra fast ⚡️</li>
-</ul></li>
-<li class="fragment">Use big files made for parallel computing
-<ul>
-<li class="fragment">HDF5, Zarr, mmap() in a parallel fs, LMDB</li>
-</ul></li>
-<li class="fragment">Use specialized data loading libraries
-<ul>
-<li class="fragment">FFCV, DALI, Apache Arrow</li>
-</ul></li>
-<li class="fragment">Compression sush as squashfs
-<ul>
-<li class="fragment">data transfer can be slower than decompression
-(must be checked case by case)</li>
-<li class="fragment">Beneficial in cases where numerous small files are
-at hand.</li>
-</ul></li>
-</ul>
-</section>
-<section id="libraries" class="slide level2">
-<h2>Libraries</h2>
-<ul>
-<li class="fragment">Apache Arrow <a
-href="https://arrow.apache.org/">https://arrow.apache.org/</a></li>
-<li class="fragment">FFCV <a
-href="https://github.com/libffcv/ffcv">https://github.com/libffcv/ffcv</a>
-and <a href="https://github.com/SerezD/ffcv_pytorch_lightning">FFCV for
-PyTorch-Lightning</a></li>
-<li class="fragment">Nvidia’s DALI <a
-href="https://developer.nvidia.com/dali">https://developer.nvidia.com/dali</a></li>
-</ul>
-</section>
-<section id="we-need-to-download-some-code" class="slide level2">
-<h2>We need to download some code</h2>
-<div class="sourceCode" id="cb2"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb2-1"><a href="#cb2-1" aria-hidden="true" tabindex="-1"></a><span class="bu">cd</span> <span class="va">$HOME</span>/course</span>
-<span id="cb2-2"><a href="#cb2-2" aria-hidden="true" tabindex="-1"></a><span class="fu">git</span> clone https://github.com/HelmholtzAI-FZJ/2024-06-course-Bringing-Deep-Learning-Workloads-to-JSC-supercomputers.git</span></code></pre></div>
-</section>
-<section id="the-imagenet-dataset" class="slide level2">
-<h2>The ImageNet dataset</h2>
-<h4 id="large-scale-visual-recognition-challenge-ilsvrc">Large Scale
-Visual Recognition Challenge (ILSVRC)</h4>
-<ul>
-<li class="fragment">An image dataset organized according to the <a
-href="https://wordnet.princeton.edu">WordNet hierarchy</a>.</li>
-<li class="fragment">Extensively used in algorithms for object detection
-and image classification at large scale.</li>
-<li class="fragment">It has 1000 classes, that comprises 1.2 million
-images for training, and 50,000 images for the validation set.</li>
-</ul>
-<p><img data-src="images/imagenet_banner.jpeg" /></p>
-</section>
-<section id="the-imagenet-dataset-1" class="slide level2">
-<h2>The ImageNet dataset</h2>
-<div class="sourceCode" id="cb3"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a><span class="ex">ILSVRC</span></span>
-<span id="cb3-2"><a href="#cb3-2" aria-hidden="true" tabindex="-1"></a><span class="kw">|</span><span class="ex">--</span> Data/</span>
-<span id="cb3-3"><a href="#cb3-3" aria-hidden="true" tabindex="-1"></a>    <span class="kw">`</span><span class="ex">--</span> CLS-LOC</span>
-<span id="cb3-4"><a href="#cb3-4" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span><span class="ex">--</span> test</span>
-<span id="cb3-5"><a href="#cb3-5" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span><span class="ex">--</span> train</span>
-<span id="cb3-6"><a href="#cb3-6" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span><span class="ex">--</span> n01440764</span>
-<span id="cb3-7"><a href="#cb3-7" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span>   <span class="kw">|</span><span class="ex">--</span> n01440764_10026.JPEG</span>
-<span id="cb3-8"><a href="#cb3-8" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span>   <span class="kw">|</span><span class="ex">--</span> n01440764_10027.JPEG</span>
-<span id="cb3-9"><a href="#cb3-9" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span>   <span class="kw">|</span><span class="ex">--</span> n01440764_10029.JPEG</span>
-<span id="cb3-10"><a href="#cb3-10" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span><span class="ex">--</span> n01695060</span>
-<span id="cb3-11"><a href="#cb3-11" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span>   <span class="kw">|</span><span class="ex">--</span> n01695060_10009.JPEG</span>
-<span id="cb3-12"><a href="#cb3-12" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span>   <span class="kw">|</span><span class="ex">--</span> n01695060_10022.JPEG</span>
-<span id="cb3-13"><a href="#cb3-13" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span>   <span class="kw">|</span><span class="ex">--</span> n01695060_10028.JPEG</span>
-<span id="cb3-14"><a href="#cb3-14" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span>   <span class="kw">|</span><span class="ex">--</span> ...</span>
-<span id="cb3-15"><a href="#cb3-15" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span>   <span class="kw">|</span><span class="ex">...</span></span>
-<span id="cb3-16"><a href="#cb3-16" aria-hidden="true" tabindex="-1"></a>        <span class="kw">|</span><span class="ex">--</span> val</span>
-<span id="cb3-17"><a href="#cb3-17" aria-hidden="true" tabindex="-1"></a>            <span class="kw">|</span><span class="ex">--</span> ILSVRC2012_val_00000001.JPEG  </span>
-<span id="cb3-18"><a href="#cb3-18" aria-hidden="true" tabindex="-1"></a>            <span class="kw">|</span><span class="ex">--</span> ILSVRC2012_val_00016668.JPEG  </span>
-<span id="cb3-19"><a href="#cb3-19" aria-hidden="true" tabindex="-1"></a>            <span class="kw">|</span><span class="ex">--</span> ILSVRC2012_val_00033335.JPEG      </span>
-<span id="cb3-20"><a href="#cb3-20" aria-hidden="true" tabindex="-1"></a>            <span class="kw">|</span><span class="ex">--</span> ...</span></code></pre></div>
-</section>
-<section id="the-imagenet-dataset-2" class="slide level2">
-<h2>The ImageNet dataset</h2>
-<p>imagenet_train.json</p>
-<div class="sourceCode" id="cb4"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a><span class="kw">{</span></span>
-<span id="cb4-2"><a href="#cb4-2" aria-hidden="true" tabindex="-1"></a>    <span class="st">&#39;ILSVRC/Data/CLS-LOC/train/n03146219/n03146219_8050.JPEG&#39;</span><span class="ex">:</span> 524,</span>
-<span id="cb4-3"><a href="#cb4-3" aria-hidden="true" tabindex="-1"></a>    <span class="st">&#39;ILSVRC/Data/CLS-LOC/train/n03146219/n03146219_12728.JPEG&#39;</span><span class="ex">:</span> 524,</span>
-<span id="cb4-4"><a href="#cb4-4" aria-hidden="true" tabindex="-1"></a>    <span class="st">&#39;ILSVRC/Data/CLS-LOC/train/n03146219/n03146219_9736.JPEG&#39;</span><span class="ex">:</span> 524,</span>
-<span id="cb4-5"><a href="#cb4-5" aria-hidden="true" tabindex="-1"></a>    <span class="ex">...</span></span>
-<span id="cb4-6"><a href="#cb4-6" aria-hidden="true" tabindex="-1"></a>    <span class="st">&#39;ILSVRC/Data/CLS-LOC/train/n03146219/n03146219_7460.JPEG&#39;</span><span class="ex">:</span> 524,</span>
-<span id="cb4-7"><a href="#cb4-7" aria-hidden="true" tabindex="-1"></a>    <span class="ex">...</span></span>
-<span id="cb4-8"><a href="#cb4-8" aria-hidden="true" tabindex="-1"></a> <span class="kw">}</span></span></code></pre></div>
-<p>imagenet_val.json</p>
-<div class="sourceCode" id="cb5"><pre
-class="sourceCode bash"><code class="sourceCode bash"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a><span class="kw">{</span></span>
-<span id="cb5-2"><a href="#cb5-2" aria-hidden="true" tabindex="-1"></a>    <span class="st">&#39;ILSVRC/Data/CLS-LOC/val/ILSVRC2012_val_00008838.JPEG&#39;</span><span class="ex">:</span> 785,</span>
-<span id="cb5-3"><a href="#cb5-3" aria-hidden="true" tabindex="-1"></a>    <span class="st">&#39;ILSVRC/Data/CLS-LOC/val/ILSVRC2012_val_00008555.JPEG&#39;</span><span class="ex">:</span> 129,</span>
-<span id="cb5-4"><a href="#cb5-4" aria-hidden="true" tabindex="-1"></a>    <span class="st">&#39;ILSVRC/Data/CLS-LOC/val/ILSVRC2012_val_00028410.JPEG&#39;</span><span class="ex">:</span> 968,</span>
-<span id="cb5-5"><a href="#cb5-5" aria-hidden="true" tabindex="-1"></a>    <span class="ex">...</span></span>
-<span id="cb5-6"><a href="#cb5-6" aria-hidden="true" tabindex="-1"></a>    <span class="st">&#39;ILSVRC/Data/CLS-LOC/val/ILSVRC2012_val_00016007.JPEG&#39;</span><span class="ex">:</span> 709,</span>
-<span id="cb5-7"><a href="#cb5-7" aria-hidden="true" tabindex="-1"></a> <span class="kw">}</span></span></code></pre></div>
-</section>
-<section id="access-file-system" class="slide level2">
-<h2>Access File System</h2>
-<div class="sourceCode" id="cb6"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a><span class="kw">def</span> <span class="fu">__getitem__</span>(<span class="va">self</span>, idx):</span>
-<span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a>    x <span class="op">=</span> Image.<span class="bu">open</span>(os.path.join(<span class="va">self</span>.root, <span class="va">self</span>.samples[idx])).convert(<span class="st">&quot;RGB&quot;</span>)</span>
-<span id="cb6-3"><a href="#cb6-3" aria-hidden="true" tabindex="-1"></a>    <span class="cf">if</span> <span class="va">self</span>.transform:</span>
-<span id="cb6-4"><a href="#cb6-4" aria-hidden="true" tabindex="-1"></a>        x <span class="op">=</span> <span class="va">self</span>.transform(x)</span>
-<span id="cb6-5"><a href="#cb6-5" aria-hidden="true" tabindex="-1"></a>    <span class="cf">return</span> x, <span class="va">self</span>.targets[idx]</span>
-<span id="cb6-6"><a href="#cb6-6" aria-hidden="true" tabindex="-1"></a>   </span></code></pre></div>
-</section>
-<section id="inodes" class="slide level2">
-<h2>Inodes</h2>
-<ul>
-<li class="fragment">Inodes (Index Nodes) are data structures that store
-metadata about files and directories.</li>
-<li class="fragment">Unique identification of files and directories
-within the file system.</li>
-<li class="fragment">Efficient management and retrieval of file
-metadata.</li>
-<li class="fragment">Essential for file operations like opening,
-reading, and writing.</li>
-<li class="fragment"><strong>Limitations</strong>:
-<ul>
-<li class="fragment"><strong>Fixed Number</strong>: Limited number of
-inodes; no new files if exhausted, even with free disk space.</li>
-<li class="fragment"><strong>Space Consumption</strong>: Inodes consume
-disk space, balancing is needed for efficiency. <img
-data-src="images/inodes.png" /></li>
-</ul></li>
-</ul>
-</section>
-<section id="pyarrow-file-creation" class="slide level2">
-<h2>Pyarrow File Creation</h2>
-<p><img data-src="images/field.png" /></p>
-<div class="sourceCode" id="cb7"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a>    binary_t <span class="op">=</span> pa.binary()</span>
-<span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a>    uint16_t <span class="op">=</span> pa.uint16()</span></code></pre></div>
-</section>
-<section id="pyarrow-file-creation-1" class="slide level2">
-<h2>Pyarrow File Creation</h2>
-<p><img data-src="images/schema.png" /></p>
-<div class="sourceCode" id="cb8"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a>    binary_t <span class="op">=</span> pa.binary()</span>
-<span id="cb8-2"><a href="#cb8-2" aria-hidden="true" tabindex="-1"></a>    uint16_t <span class="op">=</span> pa.uint16()</span>
-<span id="cb8-3"><a href="#cb8-3" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb8-4"><a href="#cb8-4" aria-hidden="true" tabindex="-1"></a>    schema <span class="op">=</span> pa.schema([</span>
-<span id="cb8-5"><a href="#cb8-5" aria-hidden="true" tabindex="-1"></a>        pa.field(<span class="st">&#39;image_data&#39;</span>, binary_t),</span>
-<span id="cb8-6"><a href="#cb8-6" aria-hidden="true" tabindex="-1"></a>        pa.field(<span class="st">&#39;label&#39;</span>, uint16_t),</span>
-<span id="cb8-7"><a href="#cb8-7" aria-hidden="true" tabindex="-1"></a>    ])</span></code></pre></div>
-</section>
-<section id="pyarrow-file-creation-2" class="slide level2">
-<h2>Pyarrow File Creation</h2>
-<p><img data-src="images/file.png" width="700" height="350" /></p>
-<div class="sourceCode" id="cb9"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a>    <span class="cf">with</span> pa.OSFile(</span>
-<span id="cb9-2"><a href="#cb9-2" aria-hidden="true" tabindex="-1"></a>            os.path.join(args.target_folder, <span class="ss">f&#39;ImageNet-</span><span class="sc">{</span>split<span class="sc">}</span><span class="ss">.arrow&#39;</span>),</span>
-<span id="cb9-3"><a href="#cb9-3" aria-hidden="true" tabindex="-1"></a>            <span class="st">&#39;wb&#39;</span>,</span>
-<span id="cb9-4"><a href="#cb9-4" aria-hidden="true" tabindex="-1"></a>    ) <span class="im">as</span> f:</span>
-<span id="cb9-5"><a href="#cb9-5" aria-hidden="true" tabindex="-1"></a>        <span class="cf">with</span> pa.ipc.new_file(f, schema) <span class="im">as</span> writer:</span></code></pre></div>
-</section>
-<section id="pyarrow-file-creation-3" class="slide level2">
-<h2>Pyarrow File Creation</h2>
-<p><img data-src="images/batch.png" width="650" height="300" /></p>
-<div class="sourceCode" id="cb10"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a>    <span class="cf">with</span> <span class="bu">open</span>(sample, <span class="st">&#39;rb&#39;</span>) <span class="im">as</span> f:</span>
-<span id="cb10-3"><a href="#cb10-3" aria-hidden="true" tabindex="-1"></a>        img_string <span class="op">=</span> f.read()</span>
-<span id="cb10-4"><a href="#cb10-4" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb10-5"><a href="#cb10-5" aria-hidden="true" tabindex="-1"></a>    image_data <span class="op">=</span> pa.array([img_string], <span class="bu">type</span><span class="op">=</span>binary_t)</span>
-<span id="cb10-6"><a href="#cb10-6" aria-hidden="true" tabindex="-1"></a>    label <span class="op">=</span> pa.array([label], <span class="bu">type</span><span class="op">=</span>uint16_t)</span>
-<span id="cb10-7"><a href="#cb10-7" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb10-8"><a href="#cb10-8" aria-hidden="true" tabindex="-1"></a>    batch <span class="op">=</span> pa.record_batch([image_data, label], schema<span class="op">=</span>schema)</span>
-<span id="cb10-9"><a href="#cb10-9" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb10-10"><a href="#cb10-10" aria-hidden="true" tabindex="-1"></a>    writer.write(batch)</span></code></pre></div>
-</section>
-<section id="pyarrow-file-creation-4" class="slide level2">
-<h2>Pyarrow File Creation</h2>
-<p><img data-src="images/pyarrow.png" width="650" height="300" /></p>
-<div class="sourceCode" id="cb11"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb11-2"><a href="#cb11-2" aria-hidden="true" tabindex="-1"></a>    <span class="cf">with</span> <span class="bu">open</span>(sample, <span class="st">&#39;rb&#39;</span>) <span class="im">as</span> f:</span>
-<span id="cb11-3"><a href="#cb11-3" aria-hidden="true" tabindex="-1"></a>        img_string <span class="op">=</span> f.read()</span>
-<span id="cb11-4"><a href="#cb11-4" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb11-5"><a href="#cb11-5" aria-hidden="true" tabindex="-1"></a>    image_data <span class="op">=</span> pa.array([img_string], <span class="bu">type</span><span class="op">=</span>binary_t)</span>
-<span id="cb11-6"><a href="#cb11-6" aria-hidden="true" tabindex="-1"></a>    label <span class="op">=</span> pa.array([label], <span class="bu">type</span><span class="op">=</span>uint16_t)</span>
-<span id="cb11-7"><a href="#cb11-7" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb11-8"><a href="#cb11-8" aria-hidden="true" tabindex="-1"></a>    batch <span class="op">=</span> pa.record_batch([image_data, label], schema<span class="op">=</span>schema)</span>
-<span id="cb11-9"><a href="#cb11-9" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb11-10"><a href="#cb11-10" aria-hidden="true" tabindex="-1"></a>    writer.write(batch)</span></code></pre></div>
-</section>
-<section id="access-arrow-file" class="slide level2">
-<h2>Access Arrow File</h2>
-<div class="container">
-<div class="col">
-<p><img data-src="images/pyarrow.png" width="500" height="300" /></p>
-</div>
-<div class="col">
-<div class="sourceCode" id="cb12"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a><span class="kw">def</span> <span class="fu">__getitem__</span>(<span class="va">self</span>, idx):</span>
-<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a>    <span class="cf">if</span> <span class="va">self</span>.arrowfile <span class="kw">is</span> <span class="va">None</span>:</span>
-<span id="cb12-3"><a href="#cb12-3" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.arrowfile <span class="op">=</span> pa.OSFile(<span class="va">self</span>.data_root, <span class="st">&#39;rb&#39;</span>)</span>
-<span id="cb12-4"><a href="#cb12-4" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.reader <span class="op">=</span> pa.ipc.open_file(<span class="va">self</span>.arrowfile)</span>
-<span id="cb12-5"><a href="#cb12-5" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb12-6"><a href="#cb12-6" aria-hidden="true" tabindex="-1"></a>    row <span class="op">=</span> <span class="va">self</span>.reader.get_batch(idx)</span>
-<span id="cb12-7"><a href="#cb12-7" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb12-8"><a href="#cb12-8" aria-hidden="true" tabindex="-1"></a>    img_string <span class="op">=</span> row[<span class="st">&#39;image_data&#39;</span>][<span class="dv">0</span>].as_py()</span>
-<span id="cb12-9"><a href="#cb12-9" aria-hidden="true" tabindex="-1"></a>    target <span class="op">=</span> row[<span class="st">&#39;label&#39;</span>][<span class="dv">0</span>].as_py()</span>
-<span id="cb12-10"><a href="#cb12-10" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb12-11"><a href="#cb12-11" aria-hidden="true" tabindex="-1"></a>    <span class="cf">with</span> io.BytesIO(img_string) <span class="im">as</span> byte_stream:</span>
-<span id="cb12-12"><a href="#cb12-12" aria-hidden="true" tabindex="-1"></a>        <span class="cf">with</span> Image.<span class="bu">open</span>(byte_stream) <span class="im">as</span> img:</span>
-<span id="cb12-13"><a href="#cb12-13" aria-hidden="true" tabindex="-1"></a>            img <span class="op">=</span> img.convert(<span class="st">&quot;RGB&quot;</span>)</span>
-<span id="cb12-14"><a href="#cb12-14" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb12-15"><a href="#cb12-15" aria-hidden="true" tabindex="-1"></a>    <span class="cf">if</span> <span class="va">self</span>.transform:</span>
-<span id="cb12-16"><a href="#cb12-16" aria-hidden="true" tabindex="-1"></a>        img <span class="op">=</span> <span class="va">self</span>.transform(img)</span>
-<span id="cb12-17"><a href="#cb12-17" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb12-18"><a href="#cb12-18" aria-hidden="true" tabindex="-1"></a>    <span class="cf">return</span> img, target</span></code></pre></div>
-</div>
-</div>
-</section>
-<section id="hdf5" class="slide level2">
-<h2>HDF5</h2>
-<p><img data-src="images/h5.png" /></p>
-<div class="sourceCode" id="cb13"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a><span class="cf">with</span> h5py.File(os.path.join(args.target_folder, <span class="st">&#39;ImageNet.h5&#39;</span>), <span class="st">&quot;w&quot;</span>) <span class="im">as</span> f:</span></code></pre></div>
-</section>
-<section id="hdf5-1" class="slide level2">
-<h2>HDF5</h2>
-<div class="container">
-<div class="col">
-<div class="sourceCode" id="cb14"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb14-1"><a href="#cb14-1" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb14-2"><a href="#cb14-2" aria-hidden="true" tabindex="-1"></a>group <span class="op">=</span> g.create_group(split)</span></code></pre></div>
-</div>
-<div class="col">
-<p><img data-src="images/groups.png" /></p>
-</div>
-</div>
-</section>
-<section id="hdf5-2" class="slide level2">
-<h2>HDF5</h2>
-<div class="container">
-<div class="col">
-<div class="sourceCode" id="cb15"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a>dt_sample <span class="op">=</span> h5py.vlen_dtype(np.dtype(np.uint8))</span>
-<span id="cb15-2"><a href="#cb15-2" aria-hidden="true" tabindex="-1"></a>dt_target <span class="op">=</span> np.dtype(<span class="st">&#39;int16&#39;</span>)</span>
-<span id="cb15-3"><a href="#cb15-3" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb15-4"><a href="#cb15-4" aria-hidden="true" tabindex="-1"></a>dset <span class="op">=</span> group.create_dataset(</span>
-<span id="cb15-5"><a href="#cb15-5" aria-hidden="true" tabindex="-1"></a>                <span class="st">&#39;images&#39;</span>,</span>
-<span id="cb15-6"><a href="#cb15-6" aria-hidden="true" tabindex="-1"></a>                (<span class="bu">len</span>(samples),),</span>
-<span id="cb15-7"><a href="#cb15-7" aria-hidden="true" tabindex="-1"></a>                dtype<span class="op">=</span>dt_sample,</span>
-<span id="cb15-8"><a href="#cb15-8" aria-hidden="true" tabindex="-1"></a>            )</span>
-<span id="cb15-9"><a href="#cb15-9" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb15-10"><a href="#cb15-10" aria-hidden="true" tabindex="-1"></a>dtargets <span class="op">=</span> group.create_dataset(</span>
-<span id="cb15-11"><a href="#cb15-11" aria-hidden="true" tabindex="-1"></a>        <span class="st">&#39;targets&#39;</span>,</span>
-<span id="cb15-12"><a href="#cb15-12" aria-hidden="true" tabindex="-1"></a>        (<span class="bu">len</span>(samples),),</span>
-<span id="cb15-13"><a href="#cb15-13" aria-hidden="true" tabindex="-1"></a>        dtype<span class="op">=</span>dt_target,</span>
-<span id="cb15-14"><a href="#cb15-14" aria-hidden="true" tabindex="-1"></a>    )</span></code></pre></div>
-</div>
-<div class="col">
-<p><img data-src="images/datasets.png" width="400" height="350" /></p>
-</div>
-</div>
-</section>
-<section id="hdf5-3" class="slide level2">
-<h2>HDF5</h2>
-<p><img data-src="images/first_iter.png" width="750" height="350" /></p>
-<div class="sourceCode" id="cb16"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="cf">for</span> idx, (sample, target) <span class="kw">in</span> tqdm(<span class="bu">enumerate</span>(<span class="bu">zip</span>(samples, targets))):        </span>
-<span id="cb16-2"><a href="#cb16-2" aria-hidden="true" tabindex="-1"></a>    <span class="cf">with</span> <span class="bu">open</span>(sample, <span class="st">&#39;rb&#39;</span>) <span class="im">as</span> f:</span>
-<span id="cb16-3"><a href="#cb16-3" aria-hidden="true" tabindex="-1"></a>        img_string <span class="op">=</span> f.read() </span>
-<span id="cb16-4"><a href="#cb16-4" aria-hidden="true" tabindex="-1"></a>        dset[idx] <span class="op">=</span> np.array(<span class="bu">list</span>(img_string), dtype<span class="op">=</span>np.uint8)</span>
-<span id="cb16-5"><a href="#cb16-5" aria-hidden="true" tabindex="-1"></a>        dtargets[idx] <span class="op">=</span> target</span></code></pre></div>
-</section>
-<section id="hdf5-4" class="slide level2">
-<h2>HDF5</h2>
-<p><img data-src="images/last_iter.png" width="750" height="350" /></p>
-<div class="sourceCode" id="cb17"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a><span class="cf">for</span> idx, (sample, target) <span class="kw">in</span> tqdm(<span class="bu">enumerate</span>(<span class="bu">zip</span>(samples, targets))):        </span>
-<span id="cb17-2"><a href="#cb17-2" aria-hidden="true" tabindex="-1"></a>    <span class="cf">with</span> <span class="bu">open</span>(sample, <span class="st">&#39;rb&#39;</span>) <span class="im">as</span> f:</span>
-<span id="cb17-3"><a href="#cb17-3" aria-hidden="true" tabindex="-1"></a>        img_string <span class="op">=</span> f.read() </span>
-<span id="cb17-4"><a href="#cb17-4" aria-hidden="true" tabindex="-1"></a>        dset[idx] <span class="op">=</span> np.array(<span class="bu">list</span>(img_string), dtype<span class="op">=</span>np.uint8)</span>
-<span id="cb17-5"><a href="#cb17-5" aria-hidden="true" tabindex="-1"></a>        dtargets[idx] <span class="op">=</span> target</span></code></pre></div>
-</section>
-<section id="hdf5-5" class="slide level2">
-<h2>HDF5</h2>
-<p><img data-src="images/hdf5.png" /></p>
-</section>
-<section id="access-h5-file" class="slide level2">
-<h2>Access h5 File</h2>
-<div class="sourceCode" id="cb18"><pre
-class="sourceCode python"><code class="sourceCode python"><span id="cb18-1"><a href="#cb18-1" aria-hidden="true" tabindex="-1"></a><span class="kw">def</span> <span class="fu">__getitem__</span>(<span class="va">self</span>, idx):</span>
-<span id="cb18-2"><a href="#cb18-2" aria-hidden="true" tabindex="-1"></a>    <span class="cf">if</span> <span class="va">self</span>.h5file <span class="kw">is</span> <span class="va">None</span>:</span>
-<span id="cb18-3"><a href="#cb18-3" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.h5file <span class="op">=</span> h5py.File(<span class="va">self</span>.train_data_path, <span class="st">&#39;r&#39;</span>)[<span class="va">self</span>.split]</span>
-<span id="cb18-4"><a href="#cb18-4" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.imgs <span class="op">=</span> <span class="va">self</span>.h5file[<span class="st">&quot;images&quot;</span>]</span>
-<span id="cb18-5"><a href="#cb18-5" aria-hidden="true" tabindex="-1"></a>        <span class="va">self</span>.targets <span class="op">=</span> <span class="va">self</span>.h5file[<span class="st">&quot;targets&quot;</span>]</span>
-<span id="cb18-6"><a href="#cb18-6" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb18-7"><a href="#cb18-7" aria-hidden="true" tabindex="-1"></a>    img_string <span class="op">=</span> <span class="va">self</span>.imgs[idx]</span>
-<span id="cb18-8"><a href="#cb18-8" aria-hidden="true" tabindex="-1"></a>    target <span class="op">=</span> <span class="va">self</span>.targets[idx]</span>
-<span id="cb18-9"><a href="#cb18-9" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb18-10"><a href="#cb18-10" aria-hidden="true" tabindex="-1"></a>    <span class="cf">with</span> io.BytesIO(img_string) <span class="im">as</span> byte_stream:</span>
-<span id="cb18-11"><a href="#cb18-11" aria-hidden="true" tabindex="-1"></a>        <span class="cf">with</span> Image.<span class="bu">open</span>(byte_stream) <span class="im">as</span> img:</span>
-<span id="cb18-12"><a href="#cb18-12" aria-hidden="true" tabindex="-1"></a>            img <span class="op">=</span> img.convert(<span class="st">&quot;RGB&quot;</span>)</span>
-<span id="cb18-13"><a href="#cb18-13" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb18-14"><a href="#cb18-14" aria-hidden="true" tabindex="-1"></a>    <span class="cf">if</span> <span class="va">self</span>.transform:</span>
-<span id="cb18-15"><a href="#cb18-15" aria-hidden="true" tabindex="-1"></a>        img <span class="op">=</span> <span class="va">self</span>.transform(img)</span>
-<span id="cb18-16"><a href="#cb18-16" aria-hidden="true" tabindex="-1"></a>        </span>
-<span id="cb18-17"><a href="#cb18-17" aria-hidden="true" tabindex="-1"></a>    <span class="cf">return</span> img, target</span></code></pre></div>
-</section>
-<section id="demo" class="slide level2">
-<h2>DEMO</h2>
-</section>
-<section id="exercise" class="slide level2">
-<h2>Exercise</h2>
-<ul>
-<li class="fragment">Could you create an arrow file for the flickr
-dataset stored in <code>/p/scratch/training2402/data/Flickr30K/</code>
-and read it using a dataloader ?</li>
-</ul>
-</section>
-    </div>
-  </div>
-
-  <script src="./dist/reveal.js"></script>
-
-  <!-- reveal.js plugins -->
-  <script src="./plugin/notes/notes.js"></script>
-  <script src="./plugin/search/search.js"></script>
-  <script src="./plugin/zoom/zoom.js"></script>
-
-  <script>
-
-      // Full list of configuration options available at:
-      // https://revealjs.com/config/
-      Reveal.initialize({
-        // Display controls in the bottom right corner
-        controls: true,
-
-        // Help the user learn the controls by providing hints, for example by
-        // bouncing the down arrow when they first encounter a vertical slide
-        controlsTutorial: true,
-
-        // Determines where controls appear, "edges" or "bottom-right"
-        controlsLayout: 'bottom-right',
-
-        // Visibility rule for backwards navigation arrows; "faded", "hidden"
-        // or "visible"
-        controlsBackArrows: 'faded',
-
-        // Display a presentation progress bar
-        progress: true,
-
-        // Display the page number of the current slide
-        slideNumber: false,
-
-        // 'all', 'print', or 'speaker'
-        showSlideNumber: 'all',
-
-        // Add the current slide number to the URL hash so that reloading the
-        // page/copying the URL will return you to the same slide
-        hash: true,
-
-        // Start with 1 for the hash rather than 0
-        hashOneBasedIndex: false,
-
-        // Flags if we should monitor the hash and change slides accordingly
-        respondToHashChanges: true,
-
-        // Push each slide change to the browser history
-        history: false,
-
-        // Enable keyboard shortcuts for navigation
-        keyboard: true,
-
-        // Enable the slide overview mode
-        overview: true,
-
-        // Disables the default reveal.js slide layout (scaling and centering)
-        // so that you can use custom CSS layout
-        disableLayout: false,
-
-        // Vertical centering of slides
-        center: true,
-
-        // Enables touch navigation on devices with touch input
-        touch: true,
-
-        // Loop the presentation
-        loop: false,
-
-        // Change the presentation direction to be RTL
-        rtl: false,
-
-        // see https://revealjs.com/vertical-slides/#navigation-mode
-        navigationMode: 'default',
-
-        // Randomizes the order of slides each time the presentation loads
-        shuffle: false,
-
-        // Turns fragments on and off globally
-        fragments: true,
-
-        // Flags whether to include the current fragment in the URL,
-        // so that reloading brings you to the same fragment position
-        fragmentInURL: true,
-
-        // Flags if the presentation is running in an embedded mode,
-        // i.e. contained within a limited portion of the screen
-        embedded: false,
-
-        // Flags if we should show a help overlay when the questionmark
-        // key is pressed
-        help: true,
-
-        // Flags if it should be possible to pause the presentation (blackout)
-        pause: true,
-
-        // Flags if speaker notes should be visible to all viewers
-        showNotes: false,
-
-        // Global override for autoplaying embedded media (null/true/false)
-        autoPlayMedia: null,
-
-        // Global override for preloading lazy-loaded iframes (null/true/false)
-        preloadIframes: null,
-
-        // Number of milliseconds between automatically proceeding to the
-        // next slide, disabled when set to 0, this value can be overwritten
-        // by using a data-autoslide attribute on your slides
-        autoSlide: 0,
-
-        // Stop auto-sliding after user input
-        autoSlideStoppable: true,
-
-        // Use this method for navigation when auto-sliding
-        autoSlideMethod: null,
-
-        // Specify the average time in seconds that you think you will spend
-        // presenting each slide. This is used to show a pacing timer in the
-        // speaker view
-        defaultTiming: null,
-
-        // Enable slide navigation via mouse wheel
-        mouseWheel: false,
-
-        // The display mode that will be used to show slides
-        display: 'block',
-
-        // Hide cursor if inactive
-        hideInactiveCursor: true,
-
-        // Time before the cursor is hidden (in ms)
-        hideCursorTime: 5000,
-
-        // Opens links in an iframe preview overlay
-        previewLinks: false,
-
-        // Transition style (none/fade/slide/convex/concave/zoom)
-        transition: 'fade',
-
-        // Transition speed (default/fast/slow)
-        transitionSpeed: 'default',
-
-        // Transition style for full page slide backgrounds
-        // (none/fade/slide/convex/concave/zoom)
-        backgroundTransition: 'fade',
-
-        // Number of slides away from the current that are visible
-        viewDistance: 3,
-
-        // Number of slides away from the current that are visible on mobile
-        // devices. It is advisable to set this to a lower number than
-        // viewDistance in order to save resources.
-        mobileViewDistance: 2,
-
-        // reveal.js plugins
-        plugins: [
-          RevealNotes,
-          RevealSearch,
-          RevealZoom
-        ]
-      });
-    </script>
-    </body>
-</html>
diff --git a/public/images/GPUs.svg b/public/images/GPUs.svg
new file mode 100644
index 0000000..84beac3
--- /dev/null
+++ b/public/images/GPUs.svg
@@ -0,0 +1,460 @@
+<?xml version='1.0' encoding='UTF-8'?>
+<!-- This file was generated by dvisvgm 2.13.3 -->
+<svg version='1.1' xmlns='http://www.w3.org/2000/svg' xmlns:xlink='http://www.w3.org/1999/xlink' width='309.555626pt' height='331.278535pt' viewBox='-72.000004 -72.000006 309.555626 331.278535'>
+<defs>
+<font id='cmr10' horiz-adv-x='0'>
+<font-face font-family='cmr10' units-per-em='1000' ascent='750' descent='250'/>
+<glyph unicode='#' horiz-adv-x='833' vert-adv-y='833' glyph-name='numbersign' d='M519 133H743C757 133 776 133 776 153S757 173 742 173H531L571 327H742C757 327 776 327 776 347S757 367 743 367H583L661 656C662 658 665 670 665 674C665 685 656 694 645 694C629 694 626 681 623 670L542 367H354L432 656C433 658 436 670 436 674C436 685 427 694 416 694C400 694 397 681 394 670L313 367H89C75 367 56 367 56 347S75 327 90 327H301L261 173H90C75 173 56 173 56 153S75 133 89 133H249L171-156C170-158 167-170 167-174C167-185 176-194 187-194C203-194 206-181 209-170L290 133H478L400-156C399-158 396-170 396-174C396-185 405-194 416-194C432-194 435-181 438-170L519 133ZM302 173L342 327H530L490 173H302Z'/>
+<glyph unicode='0' horiz-adv-x='500' vert-adv-y='500' glyph-name='zero' d='M460 320C460 400 455 480 420 554C374 650 292 666 250 666C190 666 117 640 76 547C44 478 39 400 39 320C39 245 43 155 84 79C127-2 200-22 249-22C303-22 379-1 423 94C455 163 460 241 460 320ZM249 0C210 0 151 25 133 121C122 181 122 273 122 332C122 396 122 462 130 516C149 635 224 644 249 644C282 644 348 626 367 527C377 471 377 395 377 332C377 257 377 189 366 125C351 30 294 0 249 0Z'/>
+<glyph unicode='1' horiz-adv-x='500' vert-adv-y='500' glyph-name='one' d='M294 640C294 664 294 666 271 666C209 602 121 602 89 602V571C109 571 168 571 220 597V79C220 43 217 31 127 31H95V0C130 3 217 3 257 3S384 3 419 0V31H387C297 31 294 42 294 79V640Z'/>
+<glyph unicode='2' horiz-adv-x='500' vert-adv-y='500' glyph-name='two' d='M127 77L233 180C389 318 449 372 449 472C449 586 359 666 237 666C124 666 50 574 50 485C50 429 100 429 103 429C120 429 155 441 155 482C155 508 137 534 102 534C94 534 92 534 89 533C112 598 166 635 224 635C315 635 358 554 358 472C358 392 308 313 253 251L61 37C50 26 50 24 50 0H421L449 174H424C419 144 412 100 402 85C395 77 329 77 307 77H127Z'/>
+<glyph unicode='3' horiz-adv-x='500' vert-adv-y='500' glyph-name='three' d='M290 352C372 379 430 449 430 528C430 610 342 666 246 666C145 666 69 606 69 530C69 497 91 478 120 478C151 478 171 500 171 529C171 579 124 579 109 579C140 628 206 641 242 641C283 641 338 619 338 529C338 517 336 459 310 415C280 367 246 364 221 363C213 362 189 360 182 360C174 359 167 358 167 348C167 337 174 337 191 337H235C317 337 354 269 354 171C354 35 285 6 241 6C198 6 123 23 88 82C123 77 154 99 154 137C154 173 127 193 98 193C74 193 42 179 42 135C42 44 135-22 244-22C366-22 457 69 457 171C457 253 394 331 290 352Z'/>
+<glyph unicode='4' horiz-adv-x='500' vert-adv-y='500' glyph-name='four' d='M294 165V78C294 42 292 31 218 31H197V0C238 3 290 3 332 3S427 3 468 0V31H447C373 31 371 42 371 78V165H471V196H371V651C371 671 371 677 355 677C346 677 343 677 335 665L28 196V165H294ZM300 196H56L300 569V196Z'/>
+<glyph unicode='G' horiz-adv-x='784' vert-adv-y='784' glyph-name='G' d='M593 63C606 41 646 1 657 1C666 1 666 9 666 24V198C666 237 670 242 735 242V273C698 272 643 270 613 270C573 270 488 270 452 273V242H484C574 242 577 231 577 194V130C577 18 450 9 422 9C357 9 159 44 159 342C159 641 356 674 416 674C523 674 614 584 634 437C636 423 636 420 650 420C666 420 666 423 666 444V681C666 698 666 705 655 705C651 705 647 705 639 693L589 619C557 651 503 705 404 705C218 705 56 547 56 342S216-22 406-22C479-22 559 4 593 63Z'/>
+<glyph unicode='N' horiz-adv-x='750' vert-adv-y='750' glyph-name='N' d='M232 670C223 682 222 683 203 683H33V652H62C77 652 97 651 112 650C135 647 136 646 136 627V105C136 78 136 31 33 31V0C68 1 117 3 150 3S232 1 267 0V31C164 31 164 78 164 105V625C169 620 170 619 174 613L582 13C591 1 592 0 599 0C613 0 613 7 613 26V578C613 605 613 652 716 652V683C681 682 632 680 599 680S517 682 482 683V652C585 652 585 605 585 578V151L232 670Z'/>
+<glyph unicode='P' horiz-adv-x='680' vert-adv-y='680' glyph-name='P' d='M227 316H396C516 316 624 397 624 497C624 595 525 683 388 683H35V652H59C136 652 138 641 138 605V78C138 42 136 31 59 31H35V0C70 3 144 3 182 3S295 3 330 0V31H306C229 31 227 42 227 78V316ZM224 342V612C224 645 226 652 273 652H362C521 652 521 546 521 497C521 450 521 342 362 342H224Z'/>
+<glyph unicode='U' horiz-adv-x='750' vert-adv-y='750' glyph-name='U' d='M582 231C582 89 485 9 390 9C343 9 225 34 225 224V605C225 641 227 652 304 652H328V683C293 680 219 680 181 680S68 680 33 683V652H57C134 652 136 641 136 605V228C136 87 252-22 388-22C503-22 593 71 610 185C613 205 613 214 613 254V574C613 607 613 652 716 652V683C680 682 632 680 598 680C563 680 515 682 479 683V652C582 652 582 605 582 578V231Z'/>
+<glyph unicode='d' horiz-adv-x='555' vert-adv-y='555' glyph-name='d' d='M380 55V-11L527 0V31C457 31 449 38 449 87V694L305 683V652C375 652 383 645 383 596V380C354 416 311 442 257 442C139 442 34 344 34 215C34 88 132-11 246-11C310-11 355 23 380 55ZM380 323V118C380 100 380 98 369 81C339 33 294 11 251 11C206 11 170 37 146 75C120 116 117 173 117 214C117 251 119 311 148 356C169 387 207 420 261 420C296 420 338 405 369 360C380 343 380 341 380 323Z'/>
+<glyph unicode='e' horiz-adv-x='444' vert-adv-y='444' glyph-name='e' d='M112 252C118 401 202 426 236 426C339 426 349 291 349 252H112ZM111 231H390C412 231 415 231 415 252C415 351 361 448 236 448C120 448 28 345 28 220C28 86 133-11 248-11C370-11 415 100 415 119C415 129 407 131 402 131C393 131 391 125 389 117C354 14 264 14 254 14C204 14 164 44 141 81C111 129 111 195 111 231Z'/>
+<glyph unicode='o' horiz-adv-x='500' vert-adv-y='500' glyph-name='o' d='M471 214C471 342 371 448 250 448C125 448 28 339 28 214C28 85 132-11 249-11C370-11 471 87 471 214ZM250 14C207 14 163 35 136 81C111 125 111 186 111 222C111 261 111 315 135 359C162 405 209 426 249 426C293 426 336 404 362 361S388 260 388 222C388 186 388 132 366 88C344 43 300 14 250 14Z'/>
+</font>
+<font id='cmbx12' horiz-adv-x='0'>
+<font-face font-family='cmbx12' units-per-em='1000' ascent='750' descent='251'/>
+<glyph unicode='1' horiz-adv-x='562' vert-adv-y='562' glyph-name='one' d='M346 627C346 656 344 656 312 656C238 592 127 592 105 592H86V549H105C140 549 193 555 233 568V43H94V0C136 2 241 2 288 2S441 2 483 0V43H346V627Z'/>
+<glyph unicode='6' horiz-adv-x='562' vert-adv-y='562' glyph-name='six' d='M174 362C174 423 177 485 205 539C227 581 278 620 343 620C362 620 398 617 422 592C384 588 368 560 368 533C368 497 395 474 427 474C453 474 486 491 486 535C486 595 446 656 341 656C216 656 47 578 47 317C47 250 47-12 284-12C416-12 514 66 514 206S417 420 298 420C278 420 214 420 174 335V362ZM283 28C219 28 194 82 190 90C176 123 176 203 176 212C176 335 229 387 288 387C387 387 387 307 387 207C387 108 387 28 283 28Z'/>
+<glyph unicode='d' horiz-adv-x='625' vert-adv-y='625' glyph-name='d' d='M347 686V643C409 643 416 643 416 604V402C394 421 350 450 281 450C139 450 38 362 38 222C38 79 137-6 270-6C325-6 372 13 411 46V-6L589 0V43C527 43 520 43 520 82V694L347 686ZM411 101C369 40 318 27 282 27C158 27 158 154 158 220C158 267 158 317 180 355C211 411 270 417 293 417C334 417 378 399 411 355V101Z'/>
+<glyph unicode='e' horiz-adv-x='513' vert-adv-y='513' glyph-name='e' d='M452 221C475 221 481 221 481 246C481 276 474 351 428 398C386 440 330 453 272 453C116 453 31 351 31 225C31 85 134-6 285-6S481 100 481 117C481 132 466 132 460 132C444 132 443 129 437 116C411 51 345 30 297 30C152 30 151 165 151 221H452ZM151 250C153 291 154 330 175 366C194 398 229 420 272 420C379 420 388 300 389 250H151Z'/>
+<glyph unicode='i' horiz-adv-x='312' vert-adv-y='312' glyph-name='i' d='M223 623C223 664 190 695 151 695C110 695 79 661 79 623S110 551 151 551C190 551 223 582 223 623ZM47 442V399C106 399 113 399 113 360V43H44V0C67 2 136 2 163 2C191 2 255 2 279 0V43H217V450L47 442Z'/>
+<glyph unicode='l' horiz-adv-x='312' vert-adv-y='312' glyph-name='l' d='M217 694L44 686V643C106 643 113 643 113 604V43H44V0C67 2 138 2 165 2S263 2 286 0V43H217V694Z'/>
+<glyph unicode='o' horiz-adv-x='562' vert-adv-y='562' glyph-name='o' d='M530 219C530 354 438 453 281 453C118 453 31 349 31 219C31 88 123-6 280-6C443-6 530 93 530 219ZM281 30C151 30 151 147 151 229C151 276 151 326 170 362C192 402 237 420 280 420C337 420 373 393 390 365C410 329 410 277 410 229C410 146 410 30 281 30Z'/>
+<glyph unicode='r' horiz-adv-x='459' vert-adv-y='459' glyph-name='r' d='M208 225C208 258 215 417 332 417C318 406 311 389 311 371C311 330 344 313 369 313S427 330 427 371C427 422 375 450 327 450C248 450 214 381 200 339H199V450L35 442V399C97 399 104 399 104 360V43H35V0C58 2 133 2 161 2C190 2 271 2 295 0V43H208V225Z'/>
+<glyph unicode='s' horiz-adv-x='443' vert-adv-y='443' glyph-name='s' d='M377 427C377 446 377 453 362 453C356 453 354 453 339 444C335 441 324 434 319 431C291 447 254 453 218 453C188 453 38 453 38 323C38 218 162 196 193 191C220 186 253 180 257 180C297 171 337 146 337 104C337 27 246 27 225 27C174 27 111 43 83 140C77 160 76 161 59 161C38 161 38 158 38 135V20C38 1 38-6 53-6C60-6 62-6 82 10L108 29C153-6 207-6 225-6C280-6 405 6 405 139C405 187 379 220 359 238C318 272 284 278 230 288C166 299 106 310 106 358C106 423 195 423 216 423C327 423 333 353 335 328C336 315 344 315 356 315C377 315 377 318 377 341V427Z'/>
+<glyph unicode='w' horiz-adv-x='812' vert-adv-y='812' glyph-name='w' d='M720 377C727 394 730 401 788 401V444C759 442 746 442 710 442C669 442 656 442 620 444V401C629 401 678 401 678 388C678 387 678 386 674 376L569 116L454 401H511V444C487 442 428 442 401 442C350 442 348 442 298 444V401H357C360 393 365 380 369 371C372 365 382 341 382 336C382 334 379 327 378 324L302 135L194 401H251V444C228 442 160 442 133 442C101 442 53 442 23 444V401H82L235 20C242 2 246-5 268-5C279-5 291-5 299 15L405 280L511 17C520-5 531-5 543-5C565-5 568 1 575 19L720 377Z'/>
+<glyph unicode='z' horiz-adv-x='500' vert-adv-y='500' glyph-name='z' d='M440 406C449 417 449 419 449 425C449 444 439 444 420 444H58L47 272H89C95 367 117 411 228 411H318L40 40C31 29 31 27 31 20C31 0 40 0 60 0H435L452 198H410C402 92 380 36 254 36H163L440 406Z'/>
+</font>
+</defs>
+<style type='text/css'>
+<![CDATA[text.f0 {font-family:cmbx12;font-size:17.215441px}
+text.f1 {font-family:cmr10;font-size:9.96264px}
+]]>
+</style>
+<g id='page1'>
+<g stroke-miterlimit='10' transform='translate(-2.261524,136.594658)scale(0.996264,-0.996264)'>
+<g fill='#000' stroke='#000'>
+<g stroke-width='0.4'>
+<g transform='translate(29.93007,-119.81096)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(-2.261524,136.594658)scale(-1,-1)'>
+<g fill='#000'>
+<g stroke='none'>
+<text class='f0' x='-2.261524' y='136.594658'>w<tspan x='11.188031'>orld</tspan></text>
+<rect x='46.083539' y='136.196158' height='.3985' width='5.810258'/>
+<text class='f0' x='51.893797' y='136.594658'>size<tspan x='88.813369'>16</tspan></text>
+</g>
+</g>
+</g>
+</g>
+<g fill='#c0c0c0'>
+<g fill='#c0c0c0'>
+<path d='M 55.0 209.37689 L -55.0 209.37689 C -63.28438 209.37689 -70.0 202.66127 -70.0 194.37689 L -70.0 104.37689 C -70.0 96.09251 -63.28438 89.37689 -55.0 89.37689 L 55.0 89.37689 C 63.28438 89.37689 70.0 96.09251 70.0 104.37689 L 70.0 194.37689 C 70.0 202.66127 63.28438 209.37689 55.0 209.37689 Z M -70.0 89.37689' stroke='none'/>
+</g>
+<g transform='translate(-70.0,89.37689)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(-2.261524,136.594658)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<g stroke='#000' stroke-miterlimit='10' transform='translate(67.476957,76.818817)scale(0.996264,-0.996264)'>
+<g fill='#fff' stroke='#fff'>
+<g stroke-width='0.4'>
+<g transform='translate(-19.72226,44.813)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#000'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>No<tspan x='80.207044'>de</tspan><tspan x='93.490587'>#1</tspan></text>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M -14.1432 22.11317 L -54.1432 22.11317 C -59.66612 22.11317 -64.1432 17.6361 -64.1432 12.11317 L -64.1432 2.11317 C -64.1432 -3.40974 -59.66612 -7.88683 -54.1432 -7.88683 L -14.1432 -7.88683 C -8.62029 -7.88683 -4.1432 -3.40974 -4.1432 2.11317 L -4.1432 12.11317 C -4.1432 17.6361 -8.62029 22.11317 -14.1432 22.11317 Z M -64.1432 -7.88683'/>
+</g>
+</g>
+<g transform='translate(-49.38628,3.69652)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>0</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M 54.1432 22.11317 L 14.1432 22.11317 C 8.62029 22.11317 4.1432 17.6361 4.1432 12.11317 L 4.1432 2.11317 C 4.1432 -3.40974 8.62029 -7.88683 14.1432 -7.88683 L 54.1432 -7.88683 C 59.66612 -7.88683 64.1432 -3.40974 64.1432 2.11317 L 64.1432 12.11317 C 64.1432 17.6361 59.66612 22.11317 54.1432 22.11317 Z M 4.1432 -7.88683'/>
+</g>
+</g>
+<g transform='translate(18.90013,3.69652)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>1</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M -14.1432 -13.45274 L -54.1432 -13.45274 C -59.66612 -13.45274 -64.1432 -17.92982 -64.1432 -23.45274 L -64.1432 -33.45274 C -64.1432 -38.97566 -59.66612 -43.45274 -54.1432 -43.45274 L -14.1432 -43.45274 C -8.62029 -43.45274 -4.1432 -38.97566 -4.1432 -33.45274 L -4.1432 -23.45274 C -4.1432 -17.92982 -8.62029 -13.45274 -14.1432 -13.45274 Z M -64.1432 -43.45274'/>
+</g>
+</g>
+<g transform='translate(-49.38628,-31.8694)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>2</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M 54.1432 -13.45274 L 14.1432 -13.45274 C 8.62029 -13.45274 4.1432 -17.92982 4.1432 -23.45274 L 4.1432 -33.45274 C 4.1432 -38.97566 8.62029 -43.45274 14.1432 -43.45274 L 54.1432 -43.45274 C 59.66612 -43.45274 64.1432 -38.97566 64.1432 -33.45274 L 64.1432 -23.45274 C 64.1432 -17.92982 59.66612 -13.45274 54.1432 -13.45274 Z M 4.1432 -43.45274'/>
+</g>
+</g>
+<g transform='translate(18.90013,-31.8694)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>3</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g fill='#c0c0c0'>
+<g fill='#c0c0c0'>
+<path d='M 225.71646 209.37689 L 115.71646 209.37689 C 107.43208 209.37689 100.71646 202.66127 100.71646 194.37689 L 100.71646 104.37689 C 100.71646 96.09251 107.43208 89.37689 115.71646 89.37689 L 225.71646 89.37689 C 234.00084 89.37689 240.71646 96.09251 240.71646 104.37689 L 240.71646 194.37689 C 240.71646 202.66127 234.00084 209.37689 225.71646 209.37689 Z M 100.71646 89.37689' stroke='none'/>
+</g>
+<g transform='translate(100.71646,89.37689)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(-2.261524,136.594658)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<g stroke='#000' stroke-miterlimit='10' transform='translate(67.476957,76.818817)scale(0.996264,-0.996264)'>
+<g fill='#fff' stroke='#fff'>
+<g stroke-width='0.4'>
+<g transform='translate(-19.72226,44.813)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#000'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>No<tspan x='80.207044'>de</tspan><tspan x='93.490587'>#2</tspan></text>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M -14.1432 22.11317 L -54.1432 22.11317 C -59.66612 22.11317 -64.1432 17.6361 -64.1432 12.11317 L -64.1432 2.11317 C -64.1432 -3.40974 -59.66612 -7.88683 -54.1432 -7.88683 L -14.1432 -7.88683 C -8.62029 -7.88683 -4.1432 -3.40974 -4.1432 2.11317 L -4.1432 12.11317 C -4.1432 17.6361 -8.62029 22.11317 -14.1432 22.11317 Z M -64.1432 -7.88683'/>
+</g>
+</g>
+<g transform='translate(-49.38628,3.69652)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>0</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M 54.1432 22.11317 L 14.1432 22.11317 C 8.62029 22.11317 4.1432 17.6361 4.1432 12.11317 L 4.1432 2.11317 C 4.1432 -3.40974 8.62029 -7.88683 14.1432 -7.88683 L 54.1432 -7.88683 C 59.66612 -7.88683 64.1432 -3.40974 64.1432 2.11317 L 64.1432 12.11317 C 64.1432 17.6361 59.66612 22.11317 54.1432 22.11317 Z M 4.1432 -7.88683'/>
+</g>
+</g>
+<g transform='translate(18.90013,3.69652)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>1</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M -14.1432 -13.45274 L -54.1432 -13.45274 C -59.66612 -13.45274 -64.1432 -17.92982 -64.1432 -23.45274 L -64.1432 -33.45274 C -64.1432 -38.97566 -59.66612 -43.45274 -54.1432 -43.45274 L -14.1432 -43.45274 C -8.62029 -43.45274 -4.1432 -38.97566 -4.1432 -33.45274 L -4.1432 -23.45274 C -4.1432 -17.92982 -8.62029 -13.45274 -14.1432 -13.45274 Z M -64.1432 -43.45274'/>
+</g>
+</g>
+<g transform='translate(-49.38628,-31.8694)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>2</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M 54.1432 -13.45274 L 14.1432 -13.45274 C 8.62029 -13.45274 4.1432 -17.92982 4.1432 -23.45274 L 4.1432 -33.45274 C 4.1432 -38.97566 8.62029 -43.45274 14.1432 -43.45274 L 54.1432 -43.45274 C 59.66612 -43.45274 64.1432 -38.97566 64.1432 -33.45274 L 64.1432 -23.45274 C 64.1432 -17.92982 59.66612 -13.45274 54.1432 -13.45274 Z M 4.1432 -43.45274'/>
+</g>
+</g>
+<g transform='translate(18.90013,-31.8694)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>3</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g fill='#c0c0c0'>
+<g fill='#c0c0c0'>
+<path d='M 55.0 60.0 L -55.0 60.0 C -63.28438 60.0 -70.0 53.28438 -70.0 45.0 L -70.0 -45.0 C -70.0 -53.28438 -63.28438 -60.0 -55.0 -60.0 L 55.0 -60.0 C 63.28438 -60.0 70.0 -53.28438 70.0 -45.0 L 70.0 45.0 C 70.0 53.28438 63.28438 60.0 55.0 60.0 Z M -70.0 -60.0' stroke='none'/>
+</g>
+<g transform='translate(-70.0,-60.0)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(-2.261524,136.594658)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<g stroke='#000' stroke-miterlimit='10' transform='translate(67.476957,76.818817)scale(0.996264,-0.996264)'>
+<g fill='#fff' stroke='#fff'>
+<g stroke-width='0.4'>
+<g transform='translate(-19.72226,44.813)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#000'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>No<tspan x='80.207044'>de</tspan><tspan x='93.490587'>#3</tspan></text>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M -14.1432 22.11317 L -54.1432 22.11317 C -59.66612 22.11317 -64.1432 17.6361 -64.1432 12.11317 L -64.1432 2.11317 C -64.1432 -3.40974 -59.66612 -7.88683 -54.1432 -7.88683 L -14.1432 -7.88683 C -8.62029 -7.88683 -4.1432 -3.40974 -4.1432 2.11317 L -4.1432 12.11317 C -4.1432 17.6361 -8.62029 22.11317 -14.1432 22.11317 Z M -64.1432 -7.88683'/>
+</g>
+</g>
+<g transform='translate(-49.38628,3.69652)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>0</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M 54.1432 22.11317 L 14.1432 22.11317 C 8.62029 22.11317 4.1432 17.6361 4.1432 12.11317 L 4.1432 2.11317 C 4.1432 -3.40974 8.62029 -7.88683 14.1432 -7.88683 L 54.1432 -7.88683 C 59.66612 -7.88683 64.1432 -3.40974 64.1432 2.11317 L 64.1432 12.11317 C 64.1432 17.6361 59.66612 22.11317 54.1432 22.11317 Z M 4.1432 -7.88683'/>
+</g>
+</g>
+<g transform='translate(18.90013,3.69652)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>1</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M -14.1432 -13.45274 L -54.1432 -13.45274 C -59.66612 -13.45274 -64.1432 -17.92982 -64.1432 -23.45274 L -64.1432 -33.45274 C -64.1432 -38.97566 -59.66612 -43.45274 -54.1432 -43.45274 L -14.1432 -43.45274 C -8.62029 -43.45274 -4.1432 -38.97566 -4.1432 -33.45274 L -4.1432 -23.45274 C -4.1432 -17.92982 -8.62029 -13.45274 -14.1432 -13.45274 Z M -64.1432 -43.45274'/>
+</g>
+</g>
+<g transform='translate(-49.38628,-31.8694)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>2</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M 54.1432 -13.45274 L 14.1432 -13.45274 C 8.62029 -13.45274 4.1432 -17.92982 4.1432 -23.45274 L 4.1432 -33.45274 C 4.1432 -38.97566 8.62029 -43.45274 14.1432 -43.45274 L 54.1432 -43.45274 C 59.66612 -43.45274 64.1432 -38.97566 64.1432 -33.45274 L 64.1432 -23.45274 C 64.1432 -17.92982 59.66612 -13.45274 54.1432 -13.45274 Z M 4.1432 -43.45274'/>
+</g>
+</g>
+<g transform='translate(18.90013,-31.8694)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>3</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g fill='#c0c0c0'>
+<g fill='#c0c0c0'>
+<path d='M 225.71646 60.0 L 115.71646 60.0 C 107.43208 60.0 100.71646 53.28438 100.71646 45.0 L 100.71646 -45.0 C 100.71646 -53.28438 107.43208 -60.0 115.71646 -60.0 L 225.71646 -60.0 C 234.00084 -60.0 240.71646 -53.28438 240.71646 -45.0 L 240.71646 45.0 C 240.71646 53.28438 234.00084 60.0 225.71646 60.0 Z M 100.71646 -60.0' stroke='none'/>
+</g>
+<g transform='translate(100.71646,-60.0)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(-2.261524,136.594658)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<g stroke='#000' stroke-miterlimit='10' transform='translate(67.476957,76.818817)scale(0.996264,-0.996264)'>
+<g fill='#fff' stroke='#fff'>
+<g stroke-width='0.4'>
+<g transform='translate(-19.72226,44.813)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#000'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>No<tspan x='80.207044'>de</tspan><tspan x='93.490587'>#4</tspan></text>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M -14.1432 22.11317 L -54.1432 22.11317 C -59.66612 22.11317 -64.1432 17.6361 -64.1432 12.11317 L -64.1432 2.11317 C -64.1432 -3.40974 -59.66612 -7.88683 -54.1432 -7.88683 L -14.1432 -7.88683 C -8.62029 -7.88683 -4.1432 -3.40974 -4.1432 2.11317 L -4.1432 12.11317 C -4.1432 17.6361 -8.62029 22.11317 -14.1432 22.11317 Z M -64.1432 -7.88683'/>
+</g>
+</g>
+<g transform='translate(-49.38628,3.69652)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>0</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M 54.1432 22.11317 L 14.1432 22.11317 C 8.62029 22.11317 4.1432 17.6361 4.1432 12.11317 L 4.1432 2.11317 C 4.1432 -3.40974 8.62029 -7.88683 14.1432 -7.88683 L 54.1432 -7.88683 C 59.66612 -7.88683 64.1432 -3.40974 64.1432 2.11317 L 64.1432 12.11317 C 64.1432 17.6361 59.66612 22.11317 54.1432 22.11317 Z M 4.1432 -7.88683'/>
+</g>
+</g>
+<g transform='translate(18.90013,3.69652)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>1</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M -14.1432 -13.45274 L -54.1432 -13.45274 C -59.66612 -13.45274 -64.1432 -17.92982 -64.1432 -23.45274 L -64.1432 -33.45274 C -64.1432 -38.97566 -59.66612 -43.45274 -54.1432 -43.45274 L -14.1432 -43.45274 C -8.62029 -43.45274 -4.1432 -38.97566 -4.1432 -33.45274 L -4.1432 -23.45274 C -4.1432 -17.92982 -8.62029 -13.45274 -14.1432 -13.45274 Z M -64.1432 -43.45274'/>
+</g>
+</g>
+<g transform='translate(-49.38628,-31.8694)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>2</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+<g stroke='#000'>
+<g fill='#808080'>
+<g stroke='#000'>
+<g fill='#808080'>
+<path d='M 54.1432 -13.45274 L 14.1432 -13.45274 C 8.62029 -13.45274 4.1432 -17.92982 4.1432 -23.45274 L 4.1432 -33.45274 C 4.1432 -38.97566 8.62029 -43.45274 14.1432 -43.45274 L 54.1432 -43.45274 C 59.66612 -43.45274 64.1432 -38.97566 64.1432 -33.45274 L 64.1432 -23.45274 C 64.1432 -17.92982 59.66612 -13.45274 54.1432 -13.45274 Z M 4.1432 -43.45274'/>
+</g>
+</g>
+<g transform='translate(18.90013,-31.8694)'>
+<g stroke='none' transform='scale(-1.00375,1.00375)translate(67.476957,76.818817)scale(-1,-1)'>
+<g fill='#fff'>
+<g stroke='none'>
+<text class='f1' x='67.476957' y='76.818817'>GPU<tspan x='92.867903'>3</tspan></text>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</g>
+</svg>
\ No newline at end of file
diff --git a/public/images/create-python-file.png b/public/images/create-python-file.png
new file mode 100644
index 0000000..7517560
Binary files /dev/null and b/public/images/create-python-file.png differ
diff --git a/public/images/jupyter-partition.png b/public/images/jupyter-partition.png
index 4b1edc2..16ed359 100644
Binary files a/public/images/jupyter-partition.png and b/public/images/jupyter-partition.png differ
diff --git a/public/images/open-editor-matrix-python.png b/public/images/open-editor-matrix-python.png
new file mode 100644
index 0000000..149a99c
Binary files /dev/null and b/public/images/open-editor-matrix-python.png differ
diff --git a/public/images/open-new-file-jp.png b/public/images/open-new-file-jp.png
new file mode 100644
index 0000000..6d0900d
Binary files /dev/null and b/public/images/open-new-file-jp.png differ
diff --git a/public/images/rename-matrix-python-file.png b/public/images/rename-matrix-python-file.png
new file mode 100644
index 0000000..906da0e
Binary files /dev/null and b/public/images/rename-matrix-python-file.png differ
diff --git a/public/pics/javad.jpg b/public/pics/javad.jpg
new file mode 100644
index 0000000..e7999ef
Binary files /dev/null and b/public/pics/javad.jpg differ