Skip to content

Commit

Permalink
Merge branch 'main' into exercise-rework
Browse files Browse the repository at this point in the history
  • Loading branch information
dominik-probst committed May 14, 2024
2 parents 6bd98d9 + 2617d04 commit f388349
Show file tree
Hide file tree
Showing 5 changed files with 50 additions and 41 deletions.
18 changes: 0 additions & 18 deletions README.org
Original file line number Diff line number Diff line change
Expand Up @@ -37,24 +37,6 @@
- Tuesday, 21 May 2024
- Friday, 31 May 2024

| *Calendar Week* | *Lecture Topic* | *Lecturer* | *Exercise* | *Submission* |
|---------------+----------------------------------------+----------------------------------+------------------------------------+------------------------------------|
| 16 | Course Introduction + KDD Introduction | Dominik Probst | - | - |
| 17 | Data | Dominik Probst | Introduction to Python & Pandas | - |
| 18 | Preprocessing, Part 1 | Dominik Probst | Data analysis & data preprocessing (only Group 1 & 2) | - |
| 19 | Preprocessing, Part 2 | Dominik Probst | Data analysis & data preprocessing | - |
| 20 | OLAP | Dominik Probst | Data analysis & data preprocessing | - |
| 21 | - | Dominik Probst | Data analysis & data preprocessing (only Group 3 & 4) | - |
| 22 | Frequent Pattern, Part 1 | Dominik Probst | - | - |
| 23 | Frequent Pattern, Part 2 | Dominik Probst | Frequent Pattern | - |
| 24 | Classification, Part 1 | Dominik Probst | Frequent Pattern | Frequent Pattern |
| 25 | Classification, Part 2 | Dominik Probst | Classification | Frequent Pattern |
| 26 | Cluster Analysis, Part 1 | Dominik Probst | Classification | Classification |
| 27 | Cluster Analysis, Part 2 | Dominik Probst | Classification | Classification |
| 28 | Outlier Analysis, Part 1 | Dominik Probst | Clustering | Clustering |
| 29 | Outlier Analysis, Part 2 & Exam QnA (if there is time) | Dominik Probst | Clustering | Clustering |


* Setup for Building Lecture Slides Locally
To build these lecture slides locally on your machine you'll need an up-to-date
version of LaTeX such as [[https://www.tug.org/texlive/][texlive]] or [[https://miktex.org/][MikTex]].
Expand Down
2 changes: 1 addition & 1 deletion exercise/1-Introduction-Python-Pandas.tex
Original file line number Diff line number Diff line change
Expand Up @@ -65,7 +65,7 @@ \section*{Exercise 1: Getting started}

\begin{itemize}
\item Run the command \texttt{jupyter notebook} within the extracted additional files
folder to start the Jupyter Notebook server.
folder to start the Jupyter Notebook server\footnote{If you have problems starting the Jupyter Notebook server, try \texttt{python -m notebook} as an alternative command.}.
\item There should be a new tab in your browser with the Jupyter Notebook interface.
\end{itemize}
\end{enumerate}
Expand Down
30 changes: 15 additions & 15 deletions lecture/1-prologue/1-prologue.tex
Original file line number Diff line number Diff line change
Expand Up @@ -233,21 +233,21 @@
\centering
\begin{tabular}{|p{4em}|p{15em}|p{9.5em}|p{9.5em}|p{10em}|}
\hline
\rowcolor{faugray!62}\textbf{Calendar Week} & \textbf{Lecture} & \multicolumn{2}{|l|}{\textbf{Exercise}} & \textbf{Submission} \\ \hhline{*{5}{:=}:}
\cellcolor{faugray!25}16 & Prologue + Introduction & \multicolumn{2}{|l|}{\cellcolor{gray!50}} & \cellcolor{gray!50} \\ \hhline{|-|-|-|-|-|}
\cellcolor{faugray!25}17 & Data & \multicolumn{2}{|l|}{Introduction to Python \& pandas {\color{gray}(optional)}} & \cellcolor{gray!50} \\ \hhline{|-|-|-|-|-|}
\cellcolor{faugray!25}18 & \multirow{2}{*}{Preprocessing} & \centering\tiny\textcolor{gray}{Group 1 \& 2} & \cellcolor{gray!50} & \cellcolor{gray!50} \\ \hhline{|-|~|~|-|-|}
\cellcolor{faugray!25}19 & & \multicolumn{2}{|l|}{\multirow{2}{*}[-0.6em]{Data Analysis \& Data Preprocessing}} & \cellcolor{gray!50} \\ \hhline{|--|~|~|-|}
\cellcolor{faugray!25}20 & \textcolor{gray}{Guest lecture} +\newline Data Warehousing & \multicolumn{2}{|l|}{} & \cellcolor{gray!50} \\ \hhline{|-|-|-|~|-|}
\cellcolor{faugray!25}21 & \cellcolor{gray!50} & \cellcolor{gray!50} & \centering\tiny\textcolor{gray}{Group 3 \& 4} & \cellcolor{gray!50} \\ \hhline{|-|-|-|-|-|}
\cellcolor{faugray!25}22 & \multirow{2}{*}{Frequent Pattern} & \multicolumn{2}{|l|}{\cellcolor{gray!50}} & \cellcolor{gray!50} \\ \hhline{|-|~|-|-|-|}
\cellcolor{faugray!25}23 & & \multicolumn{2}{|l|}{\multirow{2}{*}{Frequent Pattern}} & \cellcolor{gray!50} \\ \hhline{|-|-|~|~|-|}
\cellcolor{faugray!25}24 & \multirow{2}{*}{Classification} & \multicolumn{2}{|l|}{} & \multirow{2}{*}{\cellcolor{white}Frequent Pattern} \\ \hhline{|-|~|-|-|~|}
\cellcolor{faugray!25}25 & & \multicolumn{2}{|l|}{\multirow{3}{*}{Classification}} & \\ \hhline{|-|-|~|~|-|}
\cellcolor{faugray!25}26 & \multirow{2}{*}{Cluster Analysis} & \multicolumn{2}{|l|}{} & \multirow{2}{*}{\cellcolor{white}Classification} \\ \hhline{|-|~|~|~|~|}
\cellcolor{faugray!25}27 & & \multicolumn{2}{|l|}{} & \\ \hline
\cellcolor{faugray!25}28 & \multirow{2}{*}{Outlier Analysis \textcolor{gray}{+ Exam Q\&A }} & \multicolumn{2}{|l|}{\multirow{2}{*}{Clustering}} & \multirow{2}{*}{\cellcolor{white}Clustering} \\ \hhline{|-|~|~|~|~|}
\cellcolor{faugray!25}29 & & \multicolumn{2}{|l|}{} & \\ \hline
\rowcolor{faugray!62}\textbf{Calendar Week} & \textbf{Lecture} & \multicolumn{2}{|l|}{\textbf{Exercise}} & \textbf{Submission} \\ \hhline{*{5}{:=}:}
\cellcolor{faugray!25}16 & Prologue + Introduction & \multicolumn{2}{|l|}{\cellcolor{gray!50}} & \cellcolor{gray!50} \\ \hhline{|-|-|-|-|-|}
\cellcolor{faugray!25}17 & Data & \multicolumn{2}{|l|}{Introduction to Python \& pandas {\color{gray}(optional)}} & \cellcolor{gray!50} \\ \hhline{|-|-|-|-|-|}
\cellcolor{faugray!25}18 & \multicolumn{4}{|l|}{\cellcolor{gray!50}} \\ \hhline{|-|-|-|-|-|}
\cellcolor{faugray!25}19 & \multirow{2}{*}{Preprocessing \textcolor{gray}{ + Guest lecture}} & \centering\tiny\textcolor{gray}{Group 1 \& 2} & \centering\tiny\textcolor{gray}{Group 3 \& 4} & \cellcolor{gray!50} \\ \hhline{|-|~|~~|-|}
\cellcolor{faugray!25}20 & & \multicolumn{2}{|c|}{Data Analysis \& Data Preprocessing} & \cellcolor{gray!50} \\ \hhline{|--|~|~|-|}
\cellcolor{faugray!25}21 & Data Warehousing \textcolor{gray}{\tiny Fri 08:15-09:45} & \cellcolor{gray!50} & & \cellcolor{gray!50} \\ \hhline{|-|-|-|-|-|}
\cellcolor{faugray!25}22 & \multirow{2}{*}{Frequent Pattern} & \multicolumn{1}{|c|}{\textcolor{gray}{\tiny continued}} & \cellcolor{gray!50} & \cellcolor{gray!50} \\ \hhline{|-|~|-|-|-|}
\cellcolor{faugray!25}23 & & \multicolumn{2}{|l|}{\multirow{2}{*}{Frequent Pattern}} & \cellcolor{gray!50} \\ \hhline{|-|-|~|~|-|}
\cellcolor{faugray!25}24 & \multirow{2}{*}{Classification} & \multicolumn{2}{|l|}{} & \multirow{2}{*}{\cellcolor{white}Frequent Pattern} \\ \hhline{|-|~|-|-|~|}
\cellcolor{faugray!25}25 & & \multicolumn{2}{|l|}{\multirow{3}{*}{Classification}} & \\ \hhline{|-|-|~|~|-|}
\cellcolor{faugray!25}26 & \multirow{2}{*}{Cluster Analysis} & \multicolumn{2}{|l|}{} & \multirow{2}{*}{\cellcolor{white}Classification} \\ \hhline{|-|~|~|~|~|}
\cellcolor{faugray!25}27 & & \multicolumn{2}{|l|}{} & \\ \hline
\cellcolor{faugray!25}28 & \multirow{2}{*}{Outlier Analysis \textcolor{gray}{+ Exam Q\&A }} & \multicolumn{2}{|l|}{\multirow{2}{*}{Clustering}} & \multirow{2}{*}{\cellcolor{white}Clustering} \\ \hhline{|-|~|~|~|~|}
\cellcolor{faugray!25}29 & & \multicolumn{2}{|l|}{} & \\ \hline
\end{tabular}

\end{frame}
Expand Down
14 changes: 7 additions & 7 deletions lecture/3-data/2-statistical-description.tex
Original file line number Diff line number Diff line change
Expand Up @@ -114,12 +114,12 @@ \section{Basic Statistical Descriptors of Data}
\begin{tabularx}{\textwidth}{|c|c|p{4.5em}|X|X|X|}
\hline
\rowcolor{faugray!62}\textbf{Class $i$} & \textbf{Age $x_i$} ($x_i^l$ - $x_i^u$) & \textbf{Class Width $\Delta_i$} & \textbf{Absolute\newline Frequency $n_i$} & \textbf{Relative\newline Frequency $f_i$} & \textbf{Cumulative rel.\newline Frequency $F_i$} \\ \hline
$1$ & $1-5$ & 4 & $200$ & 0.06262 & 0.06262 \\
$2$ & $6-15$ & 9 & $450$ & 0.14089 & 0.20351 \\
$3$ & $16-20$ & 4 & $300$ & 0.09393 & 0.29743 \\
\rowcolor{fauyellow!62} $4$ & $21-50$ & 29 & $1500$ & 0.46963 & 0.76706 \\
$5$ & $51-80$ & 29 & $700$ & 0.21916 & 0.98622 \\
$6$ & $81-110$ & 29 & $44$ & 0.01378 & 1.00000 \\ \hline
$1$ & $1-5$ & 5 & $200$ & 0.06262 & 0.06262 \\
$2$ & $6-15$ & 10 & $450$ & 0.14089 & 0.20351 \\
$3$ & $16-20$ & 5 & $300$ & 0.09393 & 0.29743 \\
\rowcolor{fauyellow!62} $4$ & $21-50$ & 30 & $1500$ & 0.46963 & 0.76706 \\
$5$ & $51-80$ & 30 & $700$ & 0.21916 & 0.98622 \\
$6$ & $81-110$ & 30 & $44$ & 0.01378 & 1.00000 \\ \hline
$\sum$ & & & 3194 & 1.00000 & \\ \hline
\end{tabularx}
\end{table}
Expand Down Expand Up @@ -165,7 +165,7 @@ \section{Basic Statistical Descriptors of Data}
\item $\Delta_i=x_i^u - x_i^l$ the class width.
\end{itemize}

In our example: $\tilde{x} \approx 21 +\left(\frac{\frac{3194}{2} - 950}{1500}\right)*29 \approx 33.51$, i. e. 33 years and 6 months
In our example: $\tilde{x} \approx 21 +\left(\frac{\frac{3194}{2} - 950}{1500}\right)*30 \approx 33.94$, i. e. 33 years and 11 months
\end{frame}

\begin{frame}{Measuring the Central Tendency: Mode}
Expand Down
27 changes: 27 additions & 0 deletions semester-plans.org
Original file line number Diff line number Diff line change
@@ -1,4 +1,31 @@
#+title: Semester Plans

* Summer semester 2023
- Semester duration: 17 April 2023 – 21 July 2023
- No lectures on: Tuesday, 30 May 2023 (CW 22)
- Public holidays:
- Monday, 1 May 2023 (Tag der Arbeit/May Day, CW 18)
- Thursday, 18 May 2023 (Christi Himmelfahrt/Ascension Day, CW 20)
- Monday, 29 May 2022 (Pfingstmontag/Whit Monday, CW 22)
- Thursday, 8 June 2022 (Fronleichnam/Corpus Christi, CW 23)

| *Calendar Week* | *Lecture Topic* | *Lecturer* | *Exercise* |
|---------------+----------------------------------------+----------------------------------+------------------------------------|
| 16 | Course Introduction + KDD Introduction | Dominik Probst | |
| 17 | Data | Melanie B. Sigl | Introduction to Python & Pandas |
| 18 | Preprocessing, Part 1 | Dominik Probst | Data analysis & data preprocessing |
| 19 | Preprocessing, Part 2 | Dominik Probst | Data analysis & data preprocessing |
| 20 | OLAP | Melanie B. Sigl | Data analysis & data preprocessing |
| 21 | Frequent Pattern, Part 1 | Dominik Probst | - |
| 22 | Frequent Pattern, Part 2 | Dominik Probst | Frequent Pattern |
| 23 | Classification, Part 1 | Melanie B. Sigl | Frequent Pattern |
| 24 | Classification, Part 2 | Melanie B. Sigl | Classification |
| 25 | Cluster Analysis, Part 1 | Dominik Probst | Classification |
| 26 | Cluster Analysis, Part 2 | Dominik Probst | Classification |
| 27 | Outlier Analysis, Part 1 | Melanie B. Sigl | Clustering |
| 28 | Outlier Analysis, Part 2 | Melanie B. Sigl | Clustering |
| 29 | Current Research at CS6 and Exam QnA | Melanie B. Sigl + Dominik Probst | |

* Summer semester 2022
Semester duration: 25 April 2022 – 29 July 2022

Expand Down

0 comments on commit f388349

Please sign in to comment.