Skip to content

wenceslasdk/data-science-2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science 2 – NMFP436

Lectures: Václav Kozmík, Ondřej Týbl, Karel Kozmík

Practicals: Karel Kozmík, Ondřej Týbl

Contact:

This repository contains materials to the Data Science 2 - NMFP436 course.

Lectures plan

Date Topic Lecturer
18.2.2025 Intro Václav/Karel
25.2.2025 Decision Trees I Václav/Karel
4.3.2025 Decision Trees II Václav/Karel
11.3.2025 Decision Trees III Václav/Karel
18.3.2025 Decision Trees IV Václav/Karel
25.3.2025 Neural Networks I Ondřej
1.4.2025 Neural Networks II Ondřej
8.4.2025 Neural Networks III Ondřej
15.4.2025 Neural Networks IV Ondřej
22.4.2025 Neural Networks V Ondřej
29.4.2025 Neural Networks VI Ondřej
6.5.2025 Neural Networks VII Ondřej
13.5.2025 University Holiday no lecture
20.5.2025 Neural Networks VIII Ondřej

Note: We plan to dedicate one of the lectures in the Neural Networks block to an invited guest. We will share more details during the semester.

Practicals plan

Date Topic Lecturer
18.2.2025 Intro + Environment setup Karel
25.2.2025 Environment setup + Python Intro Karel
4.3.2025 Data Science Basics I Karel
11.3.2025 Data Science Basics II Karel
18.3.2025 Decision Trees I Karel
25.3.2025 Decision Trees II Karel
1.4.2025 Decision Trees III Karel
8.4.2025 Neural Networks I Ondřej
15.4.2025 Neural Networks II Ondřej
22.4.2025 Neural Networks III Ondřej
29.4.2025 Neural Networks IV Ondřej
6.5.2025 Neural Networks V Ondřej
13.5.2025 University Holiday no practicals
20.5.2025 Hyperparameters Optimization Ondřej

To receive the course credit, students must successfully work out two home assignments, one will be focused on decision trees and the other one on neural networks. There are only two assignments, but they will be complex and require considerable amount of work. Details will be published later in the semester. The course credit is a necessary requirement to take the final exam.

We use git to store all course files

How to set-up your python environment

The following instructions will guide you to set-up everything needed to run the course code. Long story short, we use virtual environment managed by poetry running on python3.10 to install all dependencies from the lock file provided.

1) Install PyCharm Professional and git

Note: it is not mandatory to use the Professional version, Community edition is also fine. The Professional version has some nice features like integrated Jupyter or automatic Python download.

2) Set up GitHub

3) Get the course files - several options ordered by level of recommendation

3.1) Use PyCharm integrated version control

  • PyCharm has an integration to GitHub
  • Click File -> Project from Version Control (or in the startup window you will see something like "Clone repository" or "VCS")
  • Clone your repository by selecting it in the version control and create a project from this folder
  • Do not clone the repository into a cloud folder! Use 'users//repos/data-science-2' foe example
  • This will create a project without any Interpreter
  • In the bottom right, click on the , select "Add New Interpreter" -> "Add Local Interpreter"
  • Create a new virtual environment with Python 3.10 (if not present, it will be downloaded)

3.2) Manually

  • Using the command line, clone your forked repository
cd C:\Users\tyblondr
git clone https://github.com/wenceslasdk/data-science-2.git
  • in my example the course repository would be C:\Users\tyblondr\data-science-2

  • now to create the Project

  • open PyCharm and click 'New Project'

  • set 'Name' to 'data-science-2' and 'Location' to the parent directory of your cloned repository from 2), i.e. in the example above we would set 'Location' to '\Users\tyblondr'

  • choose 'Python version' as 'Python 3.10' (will be downloaded and installed if it does not exist yet on your computer)

  • click 'Create' and choose 'Create from Existing Resources' if you are asked

3.3) Download the repository .zip from GitHub

  • if everything fails, you can always just download the files
  • click on the button <> Code, then Download ZIP

4) Install dependencies

  • open Terminal in PyCharm (one of the icons in the left-down corner)
  • make sure there is (.venv) at the beginning of your command line, denoting you are now in the activated virtual environment, that you created in the previous steps
  • install poetry by typing
pip install poetry
  • install remaining packages using poetry (we use --no-root to indicate that the environment itself already exists)
poetry install --no-root

Note for Conda Users: If you are using conda on your system, it will probably happen that your PyCharm terminal will be always opened with both base and the project virtual env being activated. This should not be a problem but we advice to double-check that 'which python' points to the python within the project and not the one in your conda installation.

5) Run Jupyter

  • You can run Jupyter directly inside PyCharm
  • Alternatively you can run Jupyter Lab by typing this into the terminal:
jupyter lab

Keeping the fork updated

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •