Skip to content

Commit

Permalink
Added questions for 0. and 1.
Browse files Browse the repository at this point in the history
  • Loading branch information
fmind committed Mar 22, 2024
1 parent c1ba2aa commit 9173a39
Show file tree
Hide file tree
Showing 16 changed files with 120 additions and 36 deletions.
16 changes: 6 additions & 10 deletions docs/0. Overview/0.0. Course.md
Original file line number Diff line number Diff line change
@@ -1,17 +1,13 @@
# 0.0. Course

## In few words
## What is this course about?

## How much does this course cost?

## Intended Audience
## Why should I pursue this course?

## Prerequisites knowledge
## What are the course prerequisites?

## What will I learn with this course?

## How to read ?


## Technology


Copyright
## How much time does this course take to complete?
10 changes: 9 additions & 1 deletion docs/0. Overview/0.1. Projects.md
Original file line number Diff line number Diff line change
@@ -1 +1,9 @@
# 0.1. Projects
# 0.1. Projects

## What is the course default project?

## Can I use my own project instead?

## Where can I get inspiration?

## Can I do an LLM project?
12 changes: 7 additions & 5 deletions docs/0. Overview/0.2. Datasets.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,11 @@

Data is often referred to as the fuel for Machine Learning, and although this course focuses on MLOps, it's crucial to have access to data to fully grasp the various concepts and technologies involved.

## Types of Data
## What is a dataset?

## Why do I need a dataset?

## What are the types of dataset?

When mentionning data, the first point is perhaps what are we talking about. When exploiting the model, data will be required at every step and will take many forms, be stored on different supports and will have different properties.

Expand All @@ -29,8 +33,8 @@ Structured data adheres to a predefined model, making it easier to search and or
* useful to model complex networks and many real workd syste,s
* graph can be directed, undirected , weighted, multiple, cyclic, acyclic


### Unstructured Data

Unstructured data does not follow a predefined model, making it more complex to process.

* *Text*:
Expand All @@ -46,9 +50,7 @@ Unstructured data does not follow a predefined model, making it more complex to
Data that does not conform to a rigid data model like structured data, but it does contain tags or other markers to separate semantic elements and enforce hierarchies of records and fields, making it easier to parse than unstructured data.
Examples are XML and JSON files.



## Which data should I use?
## Which dataset should I use?

The question of which dataset to use is common, and honestly, the best dataset is the one you're most familiar with.
While the vast array of data types and their diverse applications might seem overwhelming, it's important to remember that many MLOps concepts are universal and can be applied across different domains.
Expand Down
16 changes: 14 additions & 2 deletions docs/0. Overview/0.3. Platforms.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,19 @@
# 0.3. Platforms

Databricks, metaflow ...
## What is an MLOps platform?

## Which MLOps platform is the best?

## Why you did not choose an MLOps platform?

Already plently on content, focus on common ground

=> We want something neutral that can be used on every platform

Other courses will introduce these tools.
Other courses will introduce these tools.

## Does this course require an MLOps platform?

## How this course will be beneficial for my MLOps platform?

Databricks, metaflow ...
12 changes: 11 additions & 1 deletion docs/0. Overview/0.4. Mentoring.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,13 @@
# 0.4. Mentoring

Mentoring
## Is it possible to receive mentoring?

Yes

## How this project mentoring will cost?

## Is it possible to get a company training?

## Can I do mentoring services with this course?

License, attribution
12 changes: 11 additions & 1 deletion docs/0. Overview/0.5. Assistants.md
Original file line number Diff line number Diff line change
@@ -1 +1,11 @@
# 0.5. Assistants
# 0.5. Assistants

## What is the course assistant?

Automated agent

## How does this assistant work?

## How should I use the assistant?

## Can I fully trust the assistant?
8 changes: 7 additions & 1 deletion docs/0. Overview/0.6. Resources.md
Original file line number Diff line number Diff line change
@@ -1 +1,7 @@
# 0.6. Resources
# 0.6. Resources

## Is there additional project resources?

## Can I suggest a new project resource?

## Can I contribute to the resources?
6 changes: 5 additions & 1 deletion docs/0. Overview/index.md
Original file line number Diff line number Diff line change
@@ -1 +1,5 @@
# 0. Overview
# 0. Overview

This chapter will introduce the course and its content.

TODO
12 changes: 11 additions & 1 deletion docs/1. Initializing/1.0. System.md
Original file line number Diff line number Diff line change
@@ -1 +1,11 @@
# 1.0. System
# 1.0. System

## Which system do I need for the course?

Hardware, OS

## Do I need to install other software packages?

## Is there specific requirements for MLOps projects?

## Can I use a cloud system (e.g., Cloud Workstation, Google Colab, ...)?
13 changes: 13 additions & 0 deletions docs/1. Initializing/1.1. Python.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# 1.1. Python

## What is the Python language?

## Why most AI/ML projects use Python?

## Is Python a good or a bad language for MLOps?

## How easy is it to use another language for AI/ML?

## Which version of Python should I use to complete the course?

## How should I install Python on my system to complete the course?
1 change: 0 additions & 1 deletion docs/1. Initializing/1.2. Python.md

This file was deleted.

Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# 1.1. pyenv
# 1.2. pyenv

## What is pyenv?

Expand All @@ -15,11 +15,9 @@ The use of pyenv offers several advantages:

For a comprehensive installation guide, visit [Pyenv GitHub Page](https://github.com/pyenv/pyenv?tab=readme-ov-file#installation). It provides detailed instructions tailored for various operating systems.

## Which version of Python should I use for my project?
## Is there a specific setup for MLOps projects?

While using the latest version of Python is generally recommended, for this project, the specified version range is Python 3.12 to 4.0. Ensure compatibility within this range for optimal performance.

## How to install the version of Python required for my project?
## How to install the required version of Python for my project?

To install a specific Python version (e.g., 3.12):
```bash
Expand Down
4 changes: 1 addition & 3 deletions docs/1. Initializing/1.3. Poetry.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,5 @@
# 1.3. Poetry

# 1.3. poetry

## What is poetry?

[Poetry](https://python-poetry.org/) is an innovative package manager for Python, streamlining dependency management and package distribution. It simplifies defining project dependencies and building packages, making it a cornerstone tool for Python project management.
Expand All @@ -17,7 +15,7 @@ Poetry brings several advantages to your project:
- **Package Building and Distribution**: Easily build and share your Python packages with others.
- **Project Metadata Definition**: Define essential metadata like authors, URLs, and more in a structured format.

## How can I use poetry for my project?
## How can I use poetry for my MLOps project?

To get started with Poetry:
- Create a project directory and navigate into it.
Expand Down
6 changes: 5 additions & 1 deletion docs/1. Initializing/1.4. git.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,4 +57,8 @@ poetry.lock
__pycache__/
```

Each line in `.gitignore` specifies a file, folder, or pattern for Git to exclude.
Each line in `.gitignore` specifies a file, folder, or pattern for Git to exclude.

## Is there specific tips or tricks for MLOps projects?

Large files
8 changes: 6 additions & 2 deletions docs/1. Initializing/1.5. GitHub.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,9 @@ GitHub is a versatile choice for hosting both private and open-source projects.
- [GitLab](https://about.gitlab.com/): Offers both hosted and self-hosted options with integrated CI/CD features.
- [Cloud Source Repositories](https://cloud.google.com/source-repositories/docs): A cloud-based private Git repository service by Google Cloud.

## How to configure GitHub for my project?
## Which services are proposed by Github?

## How to configure GitHub for my MLOps project?

To get started:
1. [Create a GitHub account](https://github.com/join) if you haven't already.
Expand All @@ -28,4 +30,6 @@ git push origin main

Collaboration on GitHub depends on the repository's visibility:
- **Public Repositories**: Others can [clone](https://docs.github.com/en/repositories/creating-and-managing-repositories/cloning-a-repository) or [fork](https://docs.github.com/en/get-started/quickstart/fork-a-repo) your project without any special permissions.
- **Private Repositories**: You need to [grant access](https://docs.github.com/en/get-started/learning-about-github/access-permissions-on-github) to collaborators using their GitHub username or email.
- **Private Repositories**: You need to [grant access](https://docs.github.com/en/get-started/learning-about-github/access-permissions-on-github) to collaborators using their GitHub username or email.

See final section
12 changes: 11 additions & 1 deletion docs/1. Initializing/1.6. VS Code.md
Original file line number Diff line number Diff line change
@@ -1 +1,11 @@
# 1.6. Visual Studio Code
# 1.6. Visual Studio Code

## What is VS Code?

## Why should I use VS Code?

## Where can I learn to use VS Code?

## How to configure VS Code for MLOps projects?

My blog article

0 comments on commit 9173a39

Please sign in to comment.