Update Welcome notebook with planning

bc5ef9dc · Benoit SEIGNOVERT · 902c58ee · bc5ef9dc
--- a/notebooks/Welcome.ipynb
+++ b/notebooks/Welcome.ipynb
 {
 "cells": [
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "0ef87507-4419-480b-9e05-2e1444af60d1",
   "metadata": {},
@@ -9,89 +10,186 @@
    "\n",
    "# Welcome to the NuTS workshop praticals\n",
    "\n",
-    "Here is a list of the notebook that we will exercice during this workshop:\n",
+    "The praticals were prepared and will be presented by [Leonard Sédoux](https://leonard-seydoux.github.io/) with the help of [Benoît Seignovert](https://benoit.seignovert.fr).\n",
-    "- [Machine learning using MNIST classification with fully connected neural network](machine-learning/notebooks/session_1a_fcnn.ipynb)\n",
-    "- [Machine learning using MNIST classification with a convolutional connected neural network](machine-learning/notebooks/session_1b_cnn.ipynb)\n",
-    "- [Machine learning using Seisbench and PhaseNet](machine-learning/notebooks/session_2_phasnet.ipynb)\n",
    "\n",
-    "This jupyter environment is already already preconfigured for you and you juste have to click on the links above to open the notebooks and play with them.\n",
+    "\n",
-    "You can download the notebook manually (with the file explorer on the left) or you can clone the [source files from the NuTS Gitlab repo](https://gitlab.univ-nantes.fr/nuts/tp).\n",
+    "> This Jupyter environment **is already preconfigured** for you and you just have to click on the links above to open the notebooks and edit them.\n",
-    "If you wish to report an [issue you can also send us an email](mailto:gitlab-incoming+nuts-tp-machine-learning-18610-issue-@univ-nantes.fr) ✉️."
+    "> If you want to, you can download the notebooks individually with the file explorer (on the left)\n",
+    "> All the notebooks are also available in the [NuTS Gitlab repo](https://gitlab.univ-nantes.fr/nuts/tp/machine-learning).\n",
+    "\n",
+    "## Thursday, June 1<sup>st</sup>\n",
+    "### Atelier 1 - First steps with data preparation (8h-12h)\n",
+    "\n",
+    "Here we will investigate the basic statistical properties and some feeling of the data. Do we have sufficient training points? Is the data stationary? What features will be relevant to solve the task at hand? Can we avoid overfitting? \n",
+    "\n",
+    "- [Basics of programming with Python and scientific libraries](machine-learning/1_inspection/1_check_basics.ipynb)\n",
+    "- [River load sensor calibration](machine-learning/1_inspection/2_calibration.ipynb)\n",
+    "- [Classification of Iris dataset using various classifiers](machine-learning/2_iris/iris_classification.ipynb)\n",
+    "\n",
+    "🍴 Lunch break and posters (12h-14h)\n",
+    "\n",
+    "### Atelier 2: Machine learning approches (14h-18h)\n",
+    "\n",
+    "If deep learning is powerful to solve a vast majority of problems, machine-learning approches, if successful, provide more insight about the physics at play for solving problems. We will investigate and criticize several supervised and unsupervised approaches to solve the task of classifying clouds of Lidar points (scenes) to infer the type of structure visible therein.\n",
+    "\n",
+    "- [Label a subset of a lidar point cloud](machine-learning/3_lidar/label.ipynb)\n",
+    "- [Three-dimensional lidar data classification of complex natural scenes with multi-scale features](machine-learning/3_lidar/lidar.ipynb)\n",
+    "\n",
+    "\n",
+    "## Friday, June 2<sup>nd</sup>\n",
+    "\n",
+    "### Atelier 3: Deep Learning with Pytorch (9h-12h)\n",
+    "\n",
+    "When the attempts to solve the problem are unsuccessful or if defining features is a challenge because of the input data's dimension, we can also learn the features that best solve the task. Here we will revisit the problem of Lidar with deep-learning approaches and see how we can efficiently design a neural network to solve the classification task. \n",
+    "\n",
+    "- [MNIST classification with a fully connected neural network](deep-learning/session_1a_fcnn.ipynb)\n",
+    "- [MNIST classification with a convolutional neural network](deep-learning/session_1b_cnn.ipynb)\n",
+    "- [Understanding and training PhaseNet on a local dataset](deep-learning/session_2_phasnet.ipynb)\n",
+    "\n",
+    "🍴 Lunch break and posters (12h-14h)\n",
+    "\n",
+    "### Atelier 4: Deeper dive in artificial intelligence (14h-15h30)\n",
+    "\n",
+    "This last class will be a free hands class on various problems. We invite people to bring datasets and tasks of interest so we can sit and try machine-learning solutions for solving them. We will also dive deeper into algorithms, try to improve performances and discuss some good practices in machine and deep learning. \n",
+    "\n",
+    "<center><img src=\"https://s3.glicid.fr/nuts/workshop-footer.svg\"/></center>"
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "096bcfe7-b208-4567-b9ee-8290b8dc325e",
   "metadata": {},
   "source": [
-    "## Test your Jupyter environement"
+    "## Explore your environement configuration\n",
+    "\n",
+    "### Hardware config"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
-   "id": "87eeb059-2ba9-47b4-ad4d-3a50e9d75741",
+   "id": "c0246502-79e3-4f50-9f95-4e4ff0045547",
-   "metadata": {},
+   "metadata": {
+    "tags": []
+   },
   "outputs": [],
   "source": [
-    "import psutil"
+    "from os import environ\n",
+    "from psutil import cpu_count, virtual_memory"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "id": "f07d76ac-e6d0-4fe5-8130-0fdd36ac815c",
-   "metadata": {},
+   "metadata": {
+    "tags": []
+   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
+      "Username:      user-id@domain.fr\n",
+      "SLURM node:    budbud020\n",
      "CPU core:      32\n",
      "CPU threads:   64\n",
      "RAM total:     251.5 GB\n",
-      "RAM available: 243.2 GB\n"
+      "RAM available: 243.1 GB\n"
     ]
    }
   ],
   "source": [
-    "print('CPU core:     ', psutil.cpu_count(logical=False))\n",
+    "print('Username:     ', environ.get('USER'))\n",
-    "print('CPU threads:  ', psutil.cpu_count(logical=True))\n",
+    "print('SLURM node:   ', environ.get('SLURMD_NODENAME'))\n",
-    "print('RAM total:    ', round(psutil.virtual_memory().total / 1024**3, 1), 'GB')\n",
+    "\n",
-    "print('RAM available:', round(psutil.virtual_memory().available / 1024**3, 1), 'GB')"
+    "print('CPU core:     ', cpu_count(logical=False))\n",
+    "print('CPU threads:  ', cpu_count(logical=True))\n",
+    "\n",
+    "print('RAM total:    ', round(virtual_memory().total / 1024 ** 3, 1), 'GB')\n",
+    "print('RAM available:', round(virtual_memory().available / 1024 ** 3, 1), 'GB')"
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
-   "id": "adc1930d-503c-4caf-b79a-c226b39ddf7d",
+   "id": "ac43551d-8768-4f9b-9d19-41b6ff9f2660",
   "metadata": {},
   "source": [
-    "## Pytorch"
+    "### Python packages"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
-   "id": "54da43ac-74ed-4505-996b-0c51f8d5dbb4",
+   "id": "47a04f49-75b4-45ca-977e-a27f4a1af324",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
-    "import torch"
+    "from platform import python_version\n",
+    "import numpy, matplotlib, seaborn, pandas, torch, sklearn, obspy, seisbench"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
-   "id": "7bc5a46e-caac-4ac6-94c8-efd317a9b8d2",
+   "id": "fd3e0810-01cc-4017-a6e4-25542a63bb84",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Python:        3.9.16\n",
+      "Numpy:         1.24.3\n",
+      "Matplotlib:    3.7.1\n",
+      "Seaborn:       0.12.2\n",
+      "Pandas:        1.5.3\n",
+      "PyTorch:       1.13.1\n",
+      "Scikit Learn:  1.2.2\n",
+      "Obspy:         1.4.0\n",
+      "Seisbench:     0.4.0\n"
+     ]
+    }
+   ],
+   "source": [
+    "print('Python:       ', python_version())\n",
+    "print('Numpy:        ', numpy.__version__)\n",
+    "print('Matplotlib:   ', matplotlib.__version__)\n",
+    "print('Seaborn:      ', seaborn.__version__)\n",
+    "print('Pandas:       ', pandas.__version__)\n",
+    "print('PyTorch:      ', torch.__version__)\n",
+    "print('Scikit Learn: ', sklearn.__version__)\n",
+    "print('Obspy:        ', obspy.__version__)\n",
+    "print('Seisbench:    ', seisbench.__version__)"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "adc1930d-503c-4caf-b79a-c226b39ddf7d",
   "metadata": {},
+   "source": [
+    "### PyTorch GPU configuration"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "7bc5a46e-caac-4ac6-94c8-efd317a9b8d2",
+   "metadata": {
+    "tags": []
+   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Pytorch verison: 1.13.1\n",
      "Using device: cuda\n",
      "Cuda version: 11.7\n",
      "\n",
@@ -108,8 +206,6 @@
    }
   ],
   "source": [
-    "print('Pytorch verison:', torch.__version__)\n",
-    "\n",
    "device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')\n",
    "print('Using device:', device)\n",
    "\n",
@@ -124,16 +220,17 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "55e02b91-907b-469a-b7e8-6de4d2c16599",
   "metadata": {},
   "source": [
-    "## Seisbench"
+    "### Seisbench datasets"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 6,
   "id": "e2c6e92d-c58c-473d-bdd0-5dddc4580c0f",
   "metadata": {
    "tags": []
@@ -145,7 +242,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": 7,
   "id": "af776464-eb23-4308-bf43-8244928d5d6b",
   "metadata": {
    "tags": []
@@ -155,8 +252,8 @@
     "name": "stderr",
     "output_type": "stream",
     "text": [
-      "2023-05-19 17:45:22,994 | seisbench | WARNING | Check available storage and memory before downloading and general use of ETHZ dataset. Dataset size: waveforms.hdf5 ~22Gb, metadata.csv ~13Mb\n",
+      "2023-05-30 12:00:44,151 | seisbench | WARNING | Check available storage and memory before downloading and general use of ETHZ dataset. Dataset size: waveforms.hdf5 ~22Gb, metadata.csv ~13Mb\n",
-      "2023-05-19 17:45:23,201 | seisbench | WARNING | Data set contains mixed sampling rate, but no sampling rate was specified for the dataset.get_waveforms will return mixed sampling rate waveforms.\n"
+      "2023-05-30 12:00:44,897 | seisbench | WARNING | Data set contains mixed sampling rate, but no sampling rate was specified for the dataset.get_waveforms will return mixed sampling rate waveforms.\n"
     ]
    },
    {
@@ -337,50 +434,50 @@
       "</div>"
      ],
      "text/plain": [
-       "   index   source_id           source_origin_time   \n",
+       "   index   source_id           source_origin_time  \\\n",
-       "0      0  2020zmwrjy  2020-12-27T02:46:42.620452Z  \\\n",
+       "0      0  2020zmwrjy  2020-12-27T02:46:42.620452Z   \n",
       "1      1  2020zmwrjy  2020-12-27T02:46:42.620452Z   \n",
       "2      2  2020zmwrjy  2020-12-27T02:46:42.620452Z   \n",
       "3      3  2020zmwrjy  2020-12-27T02:46:42.620452Z   \n",
       "4      4  2020zmwrjy  2020-12-27T02:46:42.620452Z   \n",
       "\n",
-       "   source_origin_uncertainty_sec  source_latitude_deg   \n",
+       "   source_origin_uncertainty_sec  source_latitude_deg  \\\n",
-       "0                            NaN            47.147641  \\\n",
+       "0                            NaN            47.147641   \n",
       "1                            NaN            47.147641   \n",
       "2                            NaN            47.147641   \n",
       "3                            NaN            47.147641   \n",
       "4                            NaN            47.147641   \n",
       "\n",
-       "   source_latitude_uncertainty_km  source_longitude_deg   \n",
+       "   source_latitude_uncertainty_km  source_longitude_deg  \\\n",
-       "0                        0.620493              6.371343  \\\n",
+       "0                        0.620493              6.371343   \n",
       "1                        0.620493              6.371343   \n",
       "2                        0.620493              6.371343   \n",
       "3                        0.620493              6.371343   \n",
       "4                        0.620493              6.371343   \n",
       "\n",
-       "   source_longitude_uncertainty_km  source_depth_km   \n",
+       "   source_longitude_uncertainty_km  source_depth_km  \\\n",
-       "0                         0.927755        10.965625  \\\n",
+       "0                         0.927755        10.965625   \n",
       "1                         0.927755        10.965625   \n",
       "2                         0.927755        10.965625   \n",
       "3                         0.927755        10.965625   \n",
       "4                         0.927755        10.965625   \n",
       "\n",
-       "   source_depth_uncertainty_km  ... trace_Pn_status  trace_Pn_polarity   \n",
+       "   source_depth_uncertainty_km  ... trace_Pn_status  trace_Pn_polarity  \\\n",
-       "0                     2.116958  ...             NaN                NaN  \\\n",
+       "0                     2.116958  ...             NaN                NaN   \n",
       "1                     2.116958  ...             NaN                NaN   \n",
       "2                     2.116958  ...             NaN                NaN   \n",
       "3                     2.116958  ...             NaN                NaN   \n",
       "4                     2.116958  ...             NaN                NaN   \n",
       "\n",
-       "   trace_P_arrival_sample trace_P_status trace_P_polarity   \n",
+       "   trace_P_arrival_sample trace_P_status trace_P_polarity  \\\n",
-       "0                     NaN            NaN              NaN  \\\n",
+       "0                     NaN            NaN              NaN   \n",
       "1                     NaN            NaN              NaN   \n",
       "2                     NaN            NaN              NaN   \n",
       "3                     NaN            NaN              NaN   \n",
       "4                     NaN            NaN              NaN   \n",
       "\n",
-       "   trace_Sn_arrival_sample trace_Sn_status trace_Sn_polarity trace_chunk   \n",
+       "   trace_Sn_arrival_sample trace_Sn_status trace_Sn_polarity trace_chunk  \\\n",
-       "0                      NaN             NaN               NaN              \\\n",
+       "0                      NaN             NaN               NaN               \n",
       "1                      NaN             NaN               NaN               \n",
       "2                      NaN             NaN               NaN               \n",
       "3                      NaN             NaN               NaN               \n",
@@ -396,7 +493,7 @@
       "[5 rows x 58 columns]"
      ]
     },
-     "execution_count": 6,
+     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -411,55 +508,7 @@
   ]
  },
  {
-   "cell_type": "markdown",
+   "attachments": {},
-   "id": "bbd05dfd-14a5-4d13-bd86-bf74b3b7f009",
-   "metadata": {},
-   "source": [
-    "## ObsPy API / HTTP proxy error"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 7,
-   "id": "bafe58a4-396c-42da-9754-e60d31f382fa",
-   "metadata": {
-    "tags": []
-   },
-   "outputs": [],
-   "source": [
-    "from obspy.clients.fdsn import Client"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 8,
-   "id": "d4ff92bf-8137-4389-af95-48ff8a958e1f",
-   "metadata": {
-    "tags": []
-   },
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "FDSN Webservice Client (base url: http://eida.ethz.ch)\n",
-       "Available Services: 'dataselect' (v1.1.1), 'event' (v1.2.4), 'station' (v1.1.4), 'available_event_catalogs', 'available_event_contributors', 'eida-auth'\n",
-       "\n",
-       "Use e.g. client.help('dataselect') for the\n",
-       "parameter description of the individual services\n",
-       "or client.help() for parameter description of\n",
-       "all webservices."
-      ]
-     },
-     "execution_count": 8,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "Client('ETH')"
-   ]
-  },
-  {
   "cell_type": "markdown",
   "id": "b4f41f45-c6b5-4ae1-9506-c6981f91343d",
   "metadata": {},
@@ -484,7 +533,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.6"
+   "version": "3.9.16"
  }
 },
 "nbformat": 4,

 %% Cell type:markdown id:0ef87507-4419-480b-9e05-2e1444af60d1 tags:
 <center><img src="https://s3.glicid.fr/nuts/workshop-banner.svg"/></center>
 # Welcome to the NuTS workshop praticals
-Here is a list of the notebook that we will exercice during this workshop:
+The praticals were prepared and will be presented by [Leonard Sédoux](https://leonard-seydoux.github.io/) with the help of [Benoît Seignovert](https://benoit.seignovert.fr).
- [Machine learning using MNIST classification with fully connected neural network](machine-learning/notebooks/session_1a_fcnn.ipynb)
- [Machine learning using MNIST classification with a convolutional connected neural network](machine-learning/notebooks/session_1b_cnn.ipynb)
- [Machine learning using Seisbench and PhaseNet](machine-learning/notebooks/session_2_phasnet.ipynb)
+> This Jupyter environment **is already preconfigured** for you and you just have to click on the links above to open the notebooks and edit them.
+> If you want to, you can download the notebooks individually with the file explorer (on the left)
-This jupyter environment is already already preconfigured for you and you juste have to click on the links above to open the notebooks and play with them.
+> All the notebooks are also available in the [NuTS Gitlab repo](https://gitlab.univ-nantes.fr/nuts/tp/machine-learning).
-You can download the notebook manually (with the file explorer on the left) or you can clone the [source files from the NuTS Gitlab repo](https://gitlab.univ-nantes.fr/nuts/tp).
-If you wish to report an [issue you can also send us an email](mailto:gitlab-incoming+nuts-tp-machine-learning-18610-issue-@univ-nantes.fr) ✉️.
+## Thursday, June 1<sup>st</sup>
+### Atelier 1 - First steps with data preparation (8h-12h)
+Here we will investigate the basic statistical properties and some feeling of the data. Do we have sufficient training points? Is the data stationary? What features will be relevant to solve the task at hand? Can we avoid overfitting?
+- [Basics of programming with Python and scientific libraries](machine-learning/1_inspection/1_check_basics.ipynb)
+- [River load sensor calibration](machine-learning/1_inspection/2_calibration.ipynb)
+- [Classification of Iris dataset using various classifiers](machine-learning/2_iris/iris_classification.ipynb)
+🍴 Lunch break and posters (12h-14h)
+### Atelier 2: Machine learning approches (14h-18h)
+If deep learning is powerful to solve a vast majority of problems, machine-learning approches, if successful, provide more insight about the physics at play for solving problems. We will investigate and criticize several supervised and unsupervised approaches to solve the task of classifying clouds of Lidar points (scenes) to infer the type of structure visible therein.
+- [Label a subset of a lidar point cloud](machine-learning/3_lidar/label.ipynb)
+- [Three-dimensional lidar data classification of complex natural scenes with multi-scale features](machine-learning/3_lidar/lidar.ipynb)
+## Friday, June 2<sup>nd</sup>
+### Atelier 3: Deep Learning with Pytorch (9h-12h)
+When the attempts to solve the problem are unsuccessful or if defining features is a challenge because of the input data's dimension, we can also learn the features that best solve the task. Here we will revisit the problem of Lidar with deep-learning approaches and see how we can efficiently design a neural network to solve the classification task.
+- [MNIST classification with a fully connected neural network](deep-learning/session_1a_fcnn.ipynb)
+- [MNIST classification with a convolutional neural network](deep-learning/session_1b_cnn.ipynb)
+- [Understanding and training PhaseNet on a local dataset](deep-learning/session_2_phasnet.ipynb)
+🍴 Lunch break and posters (12h-14h)
+### Atelier 4: Deeper dive in artificial intelligence (14h-15h30)
+This last class will be a free hands class on various problems. We invite people to bring datasets and tasks of interest so we can sit and try machine-learning solutions for solving them. We will also dive deeper into algorithms, try to improve performances and discuss some good practices in machine and deep learning.
+<center><img src="https://s3.glicid.fr/nuts/workshop-footer.svg"/></center>
 %% Cell type:markdown id:096bcfe7-b208-4567-b9ee-8290b8dc325e tags:
-## Test your Jupyter environement
+## Explore your environement configuration
+### Hardware config
-%% Cell type:code id:87eeb059-2ba9-47b4-ad4d-3a50e9d75741 tags:
+%% Cell type:code id:c0246502-79e3-4f50-9f95-4e4ff0045547 tags:
 ``` python
-import psutil
+from os import environ
+from psutil import cpu_count, virtual_memory
 ```
 %% Cell type:code id:f07d76ac-e6d0-4fe5-8130-0fdd36ac815c tags:
 ``` python
-print('CPU core:     ', psutil.cpu_count(logical=False))
+print('Username:     ', environ.get('USER'))
-print('CPU threads:  ', psutil.cpu_count(logical=True))
+print('SLURM node:   ', environ.get('SLURMD_NODENAME'))
-print('RAM total:    ', round(psutil.virtual_memory().total / 1024**3, 1), 'GB')
-print('RAM available:', round(psutil.virtual_memory().available / 1024**3, 1), 'GB')
+print('CPU core:     ', cpu_count(logical=False))
+print('CPU threads:  ', cpu_count(logical=True))
+print('RAM total:    ', round(virtual_memory().total / 1024 ** 3, 1), 'GB')
+print('RAM available:', round(virtual_memory().available / 1024 ** 3, 1), 'GB')
 ```
 %% Output
+    Username:      user-id@domain.fr
+    SLURM node:    budbud020
    CPU core:      32
    CPU threads:   64
    RAM total:     251.5 GB
-    RAM available: 243.2 GB
+    RAM available: 243.1 GB
-%% Cell type:markdown id:adc1930d-503c-4caf-b79a-c226b39ddf7d tags:
+%% Cell type:markdown id:ac43551d-8768-4f9b-9d19-41b6ff9f2660 tags:
-## Pytorch
+### Python packages
-%% Cell type:code id:54da43ac-74ed-4505-996b-0c51f8d5dbb4 tags:
+%% Cell type:code id:47a04f49-75b4-45ca-977e-a27f4a1af324 tags:
 ``` python
-import torch
+from platform import python_version
+import numpy, matplotlib, seaborn, pandas, torch, sklearn, obspy, seisbench
 ```
-%% Cell type:code id:7bc5a46e-caac-4ac6-94c8-efd317a9b8d2 tags:
+%% Cell type:code id:fd3e0810-01cc-4017-a6e4-25542a63bb84 tags:
 ``` python
-print('Pytorch verison:', torch.__version__)
+print('Python:       ', python_version())
+print('Numpy:        ', numpy.__version__)
+print('Matplotlib:   ', matplotlib.__version__)
+print('Seaborn:      ', seaborn.__version__)
+print('Pandas:       ', pandas.__version__)
+print('PyTorch:      ', torch.__version__)
+print('Scikit Learn: ', sklearn.__version__)
+print('Obspy:        ', obspy.__version__)
+print('Seisbench:    ', seisbench.__version__)
+```
+%% Output
+    Python:        3.9.16
+    Numpy:         1.24.3
+    Matplotlib:    3.7.1
+    Seaborn:       0.12.2
+    Pandas:        1.5.3
+    PyTorch:       1.13.1
+    Scikit Learn:  1.2.2
+    Obspy:         1.4.0
+    Seisbench:     0.4.0
+%% Cell type:markdown id:adc1930d-503c-4caf-b79a-c226b39ddf7d tags:
+### PyTorch GPU configuration
+%% Cell type:code id:7bc5a46e-caac-4ac6-94c8-efd317a9b8d2 tags:
+``` python
 device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
 print('Using device:', device)
 if device.type == 'cuda':
    print('Cuda version:', torch.version.cuda)
    for i in range(torch.cuda.device_count()):
        print()
        print(torch.cuda.get_device_name(i))
        print('Memory Usage:')
        print('Allocated:', round(torch.cuda.memory_allocated(i) / 1024 ** 3, 1), 'GB')
        print('Cached:   ', round(torch.cuda.memory_reserved(i) / 1024 ** 3, 1), 'GB')
 ```
 %% Output
-    Pytorch verison: 1.13.1
    Using device: cuda
    Cuda version: 11.7
    NVIDIA A100-PCIE-40GB
    Memory Usage:
    Allocated: 0.0 GB
    Cached:    0.0 GB
    NVIDIA A100-PCIE-40GB
    Memory Usage:
    Allocated: 0.0 GB
    Cached:    0.0 GB
 %% Cell type:markdown id:55e02b91-907b-469a-b7e8-6de4d2c16599 tags:
-## Seisbench
+### Seisbench datasets
 %% Cell type:code id:e2c6e92d-c58c-473d-bdd0-5dddc4580c0f tags:
 ``` python
 from seisbench.data import Iquique, ETHZ
 ```
 %% Cell type:code id:af776464-eb23-4308-bf43-8244928d5d6b tags:
 ``` python
 # data = Iquique()
 data = ETHZ()
 print(data)
 data.metadata.head()
 ```
 %% Output
-    2023-05-19 17:45:22,994 | seisbench | WARNING | Check available storage and memory before downloading and general use of ETHZ dataset. Dataset size: waveforms.hdf5 ~22Gb, metadata.csv ~13Mb
+    2023-05-30 12:00:44,151 | seisbench | WARNING | Check available storage and memory before downloading and general use of ETHZ dataset. Dataset size: waveforms.hdf5 ~22Gb, metadata.csv ~13Mb
-    2023-05-19 17:45:23,201 | seisbench | WARNING | Data set contains mixed sampling rate, but no sampling rate was specified for the dataset.get_waveforms will return mixed sampling rate waveforms.
+    2023-05-30 12:00:44,897 | seisbench | WARNING | Data set contains mixed sampling rate, but no sampling rate was specified for the dataset.get_waveforms will return mixed sampling rate waveforms.
    ETHZ - 36743 traces
-       index   source_id           source_origin_time
+       index   source_id           source_origin_time  \
-    0      0  2020zmwrjy  2020-12-27T02:46:42.620452Z  \
+    0      0  2020zmwrjy  2020-12-27T02:46:42.620452Z
    1      1  2020zmwrjy  2020-12-27T02:46:42.620452Z
    2      2  2020zmwrjy  2020-12-27T02:46:42.620452Z
    3      3  2020zmwrjy  2020-12-27T02:46:42.620452Z
    4      4  2020zmwrjy  2020-12-27T02:46:42.620452Z
-       source_origin_uncertainty_sec  source_latitude_deg
+       source_origin_uncertainty_sec  source_latitude_deg  \
-    0                            NaN            47.147641  \
+    0                            NaN            47.147641
    1                            NaN            47.147641
    2                            NaN            47.147641
    3                            NaN            47.147641
    4                            NaN            47.147641
-       source_latitude_uncertainty_km  source_longitude_deg
+       source_latitude_uncertainty_km  source_longitude_deg  \
-    0                        0.620493              6.371343  \
+    0                        0.620493              6.371343
    1                        0.620493              6.371343
    2                        0.620493              6.371343
    3                        0.620493              6.371343
    4                        0.620493              6.371343
-       source_longitude_uncertainty_km  source_depth_km
+       source_longitude_uncertainty_km  source_depth_km  \
-    0                         0.927755        10.965625  \
+    0                         0.927755        10.965625
    1                         0.927755        10.965625
    2                         0.927755        10.965625
    3                         0.927755        10.965625
    4                         0.927755        10.965625
-       source_depth_uncertainty_km  ... trace_Pn_status  trace_Pn_polarity
+       source_depth_uncertainty_km  ... trace_Pn_status  trace_Pn_polarity  \
-    0                     2.116958  ...             NaN                NaN  \
+    0                     2.116958  ...             NaN                NaN
    1                     2.116958  ...             NaN                NaN
    2                     2.116958  ...             NaN                NaN
    3                     2.116958  ...             NaN                NaN
    4                     2.116958  ...             NaN                NaN
-       trace_P_arrival_sample trace_P_status trace_P_polarity
+       trace_P_arrival_sample trace_P_status trace_P_polarity  \
-    0                     NaN            NaN              NaN  \
+    0                     NaN            NaN              NaN
    1                     NaN            NaN              NaN
    2                     NaN            NaN              NaN
    3                     NaN            NaN              NaN
    4                     NaN            NaN              NaN
-       trace_Sn_arrival_sample trace_Sn_status trace_Sn_polarity trace_chunk
+       trace_Sn_arrival_sample trace_Sn_status trace_Sn_polarity trace_chunk  \
-    0                      NaN             NaN               NaN              \
+    0                      NaN             NaN               NaN
    1                      NaN             NaN               NaN
    2                      NaN             NaN               NaN
    3                      NaN             NaN               NaN
    4                      NaN             NaN               NaN
      trace_component_order
    0                   ZNE
    1                   ZNE
    2                   ZNE
    3                   ZNE
    4                   ZNE
    [5 rows x 58 columns]
-%% Cell type:markdown id:bbd05dfd-14a5-4d13-bd86-bf74b3b7f009 tags:
-## ObsPy API / HTTP proxy error
-%% Cell type:code id:bafe58a4-396c-42da-9754-e60d31f382fa tags:
-``` python
-from obspy.clients.fdsn import Client
-```
-%% Cell type:code id:d4ff92bf-8137-4389-af95-48ff8a958e1f tags:
-``` python
-Client('ETH')
-```
-%% Output
-    FDSN Webservice Client (base url: http://eida.ethz.ch)
-    Available Services: 'dataselect' (v1.1.1), 'event' (v1.2.4), 'station' (v1.1.4), 'available_event_catalogs', 'available_event_contributors', 'eida-auth'
-    Use e.g. client.help('dataselect') for the
-    parameter description of the individual services
-    or client.help() for parameter description of
-    all webservices.
 %% Cell type:markdown id:b4f41f45-c6b5-4ae1-9506-c6981f91343d tags:
 <center><img src="https://s3.glicid.fr/nuts/workshop-footer.svg"/></center>