Dates: 03 June - 05 June 2019
Venue: Addis Ababa University, Addis Ababa
In the tradition of previous Africa Data Science workshops, a summer school on machine learning and data science will be held prior to the main workshop. This summer school will target graduate students, researchers and professionals working with huge amounts of data or unique datasets.
The summer school will focus on introductory and advanced lectures in data science and machine learning as well as moderate to advanced practical and tutorial sessions where participants will get their hands wet wrangling and munging datasets and applying cutting edge machine learning techniques to derive inference from the data. Lectures will be given by distinguished world renown researchers and practitioners including researchers from Sheffield University, Amazon, ARM, Facebook, Google, Pulse Lab Kampala, the AI and Data Science (AIR) lab-Makerere University, and Dedan Kimathi University of Technology (DeKUT), African University of Science and Technology (AUST) among others.
The school will also involve end-to-end tutorial sessions from professionals walking the participants through a real data analytics problem from data acquisition to data presentation. To benefit from this course participants must have some background in programming particularly programming with Python and machine learning. The registration process will include submission of worked examples in Jupyter notebooks for a data science challenge.
School programme outline:
Pre-workshop
Configure a DSA Environment with Anaconda
Conda is an open source package management system and environment management system for installing multiple versions of software packages and their dependencies and switching easily between them. It works on Linux, OS X and Windows, and was created for Python programs but can package and distribute any software.
Overview
Using Anaconda consists of the following:
- Install
miniconda
on your computer - Create a new
conda
environment using this project - Each time you wish to work, activate your
conda
environment
Installation
Download the latest version of miniconda
that matches your system.
Linux | Mac | Windows | |
---|---|---|---|
64-bit | 64-bit (bash installer) | 64-bit (bash installer) | 64-bit (exe installer) |
32-bit | 32-bit (bash installer) | 32-bit (exe installer) |
Install miniconda on your machine. Detailed instructions:
- Linux: http://conda.pydata.org/docs/install/quick.html#linux-miniconda-install
- Mac: http://conda.pydata.org/docs/install/quick.html#os-x-miniconda-install
- Windows: http://conda.pydata.org/docs/install/quick.html#windows-miniconda-install
Setup the dsa2018
environment.
git clone https://github.com/emwebaze/dsa2018materials.git
cd setup
Create dsa2018. Running this command will create a new conda
environment that is provisioned with most of the libraries you need for this summerschool.
conda env create -f dsaenvironment.yml
Verify that the dsa2018 environment was created in your environments:
conda info --envs
Cleanup downloaded libraries (remove tarballs, zip files, etc):
conda clean -tp
Uninstalling
To uninstall the environment:
conda env remove -n dsa2018
Using Anaconda
Now that you have created an environment, in order to use it, you will need to activate the environment. This must be done each time you begin a new working session i.e. open a new terminal window.
Activate the dsa2018
environment:
OS X and Linux
$ source activate dsa2018
Windows
Depending on shell either:
$ source activate dsa2018
or
$ activate dsa2018
That’s it. Now you can fire up your Jupyter Notebook from this terminal and it will load all the necessary libraries.
To exit the environment when you have completed your work session, simply close the terminal window.
Troubleshooting and comments..
Use the comment section below to (a) ask questions that are not already answered (b) help your peers by providing answers to their questions, if you can.
Summer School Day 1
The first day of the data science school will start with an introduction of Machine Learning and Data Science, introduce python, jupyter notebook and pandas and Fundamentals of IoT, Data Visualisation and a tutorial on Mechanism Design. We have practical sessions for python, pandas and jupter and data Visualisation.
Time |
Activity |
Material |
08:00-09:00 |
Arrival and Registration |
|
09:00-09:30 |
Welcome and Opening Speech and Remarks |
|
09:30-10:30 |
Mechanism Design |
|
10:30-11:00 |
Break |
|
11:00-12:00 |
Mechanism Design Tutorial |
|
12:00-13:00 |
Python, Pandas and Jupyter Tutorial |
|
13:00-14:00 |
Lunch |
|
14:00-15:00 |
Introduction to Machine Learning |
|
15:00-16:00 |
Data Visualisation Lecture |
|
16:00-16:30 |
Break |
|
16:30-17:30 |
Data Visualisation Tutorial |
|
17:30-18:30 |
10 Academy |
Summer School Day 2
Time |
Activity |
Material |
---|---|---|
08:30-09:30 |
Classification | |
09:30-10:30 |
Computer Vision and Image Analysis | |
10:30-11:00 |
Break | |
11:00-11:30 |
Fundamentals of IoT | |
11:30-15:00 |
IoT Field Work | |
15:00-16:00 |
Lunch | |
16:00-17:00 |
Fundamentals of Bioinformatics | |
17:00-18:30 |
Deep Learning Fundamentals |
Summer School Day 3
Time |
Activity |
Material |
08:30-09:30 |
Data Science Challenges & Hackathons |
|
09:30-10:30 |
Time Series Analysis |
|
10:30-11:00 |
Break |
|
11:00-12:00 |
Reinforcement Learning |
|
12:00-13:00 |
Spatial Analysis |
|
13:00-14:00 |
Lunch |
|
14:00-15:30 |
Modeling with Gaussian Processes |
|
15:30-16:00 |
Break |
|
16:00-17:00 |
R for Data Science |
|
17:00-18:00 |
DSA Addis Ababa SS feedback and Overview |