The aim of this project is to produce open source workshop materials which can be used to teach scientists how to use machine learning as a tool to assist them with their research.
- Introducing machine learning
- Workshop dataset
- Image and feature analysis
- Building a machine learning program
The aim of this workshop is to provide participants with the following skills:
- Reading images into Python
- Basic image processing
- Feature engineering skills
- Organising large datasets
- Splitting your dataset into training and testing sets
- Clustering your data
- Training a support vector machine classification program
- Testing the accuracy of your predictions.
The data used for this workshop come in the form of images of tags on the backs of honeybees which were filmed as part of an experiment into bee behaviour. We are interested in training a machine learning algorithm to automatically classify the different types of tag automatically. More information is available in the workshop dataset section of the workshop materials.
Workshop participants are assumed to have basic Python programming skills. An understanding of basic statistical concepts like regression and significance is also beneficial.
The following software is required:
- Python 3+
- NumPy
- OpenCV
- Matplotlib
- scikit-learn