Background

Supertype Coursebook is an open source project that aims to provide a structured, guided learning path for data professionals. The coursebook covers three main topics: Enterprise Data Management, Data Engineering, and Financial Data Analytics. There are more than 15 lessons in total, each with a mix of theory and practical exercises.

Open-source Learning Path

We’re open-sourcing this project to help other professionals learn valuable data skills. You can use this coursebook as a standalone resource or as part of a larger data professional development program.

Custom Enterprise Training with Supertype

If you’re interested in custom training for your organization, Supertype is a data science and AI consultancy that helps organizations build analytics and AI capabilities. We can help you build a custom training program and provide expert instructors to help your team learn valuable data skills.

Setting up

The first step to becoming a data professional is your tooling and environment setup.

Learning Path

iDateTopicDescription
116 AugSet Up and ToolingGet your programming environment set up
219 AugOverview: Enterprise Data ManagementAn overview of enterprise data management and its historical context
320 AugModern Enterprise Data ManagementAn overview of the key components of enterprise data management
421 AugPython Programming BasicsPython 101
522 AugAPI Programming with PythonEverything to get you started with creating programs using Financial APIs
623 AugData Analysis with Pandas IThe de-facto library for data manipulation and analysis in Python
726 AugData Analysis with Pandas IIGraded Quiz
827 AugLab: Data ManipulationUsing Pandas, perform a series of data manipulation and analysis tasks on Indonesia banking sector data provided by Sectors
928 AugRecap and PracticeTime for a recap of the lessons thus far!
1029 AugIntroduction to RDBMs and ERDDuckDB, Relational Database Management Systems, creating SQL from existing CSVs
1130 AugLab: SQL PracticeSQL Queries in Practice
122 SepData Visualization BasicsAn overview of data visualization history and why it is important in the big data era
Data Visualization FundamentalAn overview of the data visualization fundamental
133 SepData Visualization ToolsA summary of widely-used data visualization tools (+ hands-on into Tableau Public)
Lab: Data VisualizationPractical graded exercise for data visualization and data storytelling
144 SepLab: Data VisualizationPractical graded exercise for data visualization and data storytelling
155 SepIntroduction to NoSQLBasic understanding on NoSQL paradigm and its examples, PostgreSQL, and Redis for caching
166 SepLab (cont.): Redis for caching; Introduction to Apache KafkaLearning fundamental key concepts on Apache Kafka
179 SepLab: Data Streaming Orchestrating a simple data streaming pipeline using Apache Kafka, Spark, Cassandra, MySQL, and Streamlit (optional)
1810 SepLab (cont.) + Final ProjectOrchestrating a simple data streaming pipeline using Apache Kafka, Spark, Cassandra, MySQL, and Streamlit (optional)
1911 SepFinal ProjectWork on the final project for the Cousebook
2012 SepFinal ProjectWork on the final project for the Cousebook
2113 SepFinal ProjectWork on the final project for the Cousebook

Helpful Resources

Beyond Coursebook (this app you’re looking at), there are a number of resources you can use to supplement your learning.