Jun 4, 2:00 – 5:00 PM (UTC)
This time, we are going to focus on data version control, adhering to the normal data science workflow. This involves meticulously documenting everything we do in our in-silico laboratory. Grab your ticket today!
48 RSVPs
You are starting to become an elder wizard! We've learned that the types of data can limit the sort of analysis we can do (S.S Stevens, 1946. "On the theory of scales of measurement," Science, Vol. 103, No. 2684). However, we've enhanced our data science and machine learning capabilities by diving into exploratory data analysis (EDA), cross-validation, feature engineering, and modeling. Our goal is to tame data and extract valuable insights. Remember, we live in the matrix, but sometimes we love vectors too—so do our models. You can find advice from Elder Mage 🧙🏾🚧 here.
This time, we are going to focus on data version control, adhering to the normal data science workflow. This involves meticulously documenting everything we do in our in-silico (performed on computer or via computer simulations) laboratory. Proper workflow management is crucial, especially in production environments where issues can arise rapidly. Good workflows help us troubleshoot what might go wrong during machine learning training.
We will introduce you to the powerful Makefile, an organizational tool for project workflows. This session will cover all stages from development to production, including continuous integration (CI) and continuous deployment (CD), all orchestrated within one file. Furthermore, we will introduce DVC, a tool to help track changes in data files, plots, machine learning models, and metrics, ensuring reproducibility (ability to get the same results using the same data and code) and shareability of your workflows. This helps tackle the "it worked on my computer" problem to a significant extent.
The problem we'll tackle will be related to insurance. Using a dataset from Kaggle, a machine learning platform for learning, experimenting, and competing, we will build a predictive model to identify the factors influencing insurance costs. Our workflows will aim to run the entire data science process in single or incremental steps, providing documentation throughout. We are data storytellers, which is part of being a wizard! 🧙🏿📜🗣
Jargon simplified:
What you'll need:
Join us in this month's meetup to become an Elder Mage 🧙🏿 too. We will be closing registration early, so grab your friends and come learn to be dangerous.
Outline
Gigs:
We would love to reach out to you so that you can build for our customers, please fill out this form with details to ensure we have your details:
GIG/HACK DEVELOPER PORTFOLIO FORM
Join community channels:
Africa's Talking AI/ML Community:
Slack:
Please follow our Twitter handles too:
You can get our videos, recaps, and event interviews on our youtube channels, subscribe to get updates:
Africa's Talking community allows developers to learn skills for the modern-day African Developer. We are language and framework agnostic. All developers are welcome. This is where Africa's Talking developers community meets to build, learn and exchange knowledge.
We are helping software developers and businesses to bring their ideas to life through easy-to-use APIs easily.
Would you like to partner with us? Kindly contact the Developer Experience Team.
Tuesday, June 4, 2024
2:00 PM – 5:00 PM (UTC)
Arrival and Introduction |
The Problem |
Break |
Hands-on with DVC & Makefiles |
Break |
Q&A and Further Exploration |
Closing Remarks and Networking |
CONTACT US