This course is designed as a 3 day intensive rundown of data visualization principles, techniques, common frameworks, and how to put together some of the material developed throughout the course in a gh-pages website/html-based presentation using markdown.
Throughout these 3 days, we will look at a number of exercises. Most of these demonstrations were developed in R due to its flexibility and community support, although some were coded in Python and Mathematica to allow the comparison of different approaches to data visualization and frameworks. The full list of exercises along with the languages they were developed in is:
- Box Whisker (R)
- Bubble Chart (Python)
- Chord Diagram (R)
- Dygraph (R)
- Factorial (Mathematica)
- FFmpeg (Bash)
- Globe (R)
- Maps (R/Python)
- Network (R)
- Parallel Plot (Python)
- Random Networks (Mathematica)
- Remark (Markdown/HTML)
- Revealjs (HTML)
- Scatter Plot with Histograms (Python)
- Scatter Plot (Mathematica)
- Stacked Area Plot (R/Python)
- Streamgraph (R)
- Transitions (Mathematica)
- Time Series (R/Mathematica)
- Tree Map (R)
- Violin Plots (R)
- Word Cloud (R)
Some of the examples were developed from scratch, whilst some others were coded as variations of online sources (in which case the sources are cited within the code snippets). The used datasets are either public domain or datasets developed as part of my research group that we are allowed to share.
- To show some good practices in data visualization
- To showcase some of the newer frameworks used to generate scientific and engineering plots
- To teach examples of how to choose the correct plot type according to our data, and message we want to transmit
- To teach which graphics formats are the best for engineering/scientific purposes
- To show how our visualizations can be embedded into presentations and shown online using markdown and HTML
- To get some exposure on alternatives to make data visualization more attractive/innovative
Click on the GIF to take a look at my personal website!
This course was designed to try to accommodate the widest set of audiences in engineering and scientific disciplines. As such, it provides guidelines that are common to most disciplines. For this reason, it was developed to make use of the three aforementioned programming platforms (R, Mathematica and Python) although no deep knowledge of the platforms is required.
Installing Mathematica is as simple as following the instructions on the wizard after downloading it from the institute's server.
Install R by downloading its executable packages, and do the same for RStudio.
The most complicated of the three tools is Python with it's required libraries. However, this can be done easily by installing anaconda, and then following the instructions detailed in the python introduction (easy) and/or the dataViz conda environment readme (advanced).
Install the github desktop app. Click on the fork button in the repository's main page:
Once forked, click on the clone or download button, followed by open in desktop: