Applied Machine Learning for Health Professionals

About Course

This two-part series is designed to equip clinicians, researchers, and other healthcare professionals with practical machine-learning skills tailored to medical data. By focusing on real-world examples drawn from electronic health records and cohort studies, participants will gain the tools they need to prepare data, develop predictive models, and evaluate their performance in a clinical context. In Bright Health Science, this course is instructed by Ali M. Shabestari and Dr. Motahare Shabestari.

Part I: Python Programming & Data Preprocessing

Objectives:

Introduce Python fundamentals, from variables and control flow to functions and modules.
Master NumPy & pandas libraries for cleaning, transforming, and organizing tabular health datasets.
Learn best practices for handling outlier values, categorical encoding, normalization, and feature engineering in medical data
Apply techniques directly to sample datasets drawn from cohort studies and electronic health records

Part II: Machine-Learning Models & Clinical Implementation

Objectives:

Explore core supervised-learning algorithms: classifiers (e.g., logistic regression, decision trees) and regressors (e.g., linear regression, random forest)
Understand model assumptions, strengths, and ideal use cases in health fields.
Develop skills for training, hyperparameter tuning, and cross-validation on tabular medical datasets.
Learn rigorous evaluation metrics to assess clinical applicability.

Capstone Project

In the final module, participants will apply their new skills to a real medical dataset. Guided through the end-to-end ML workflow, they will:

Prepare and preprocess raw clinical data
Select and train appropriate models
Tune hyperparameters for optimal performance
Evaluate and interpret results with an eye toward clinical deployment

Course Content

Session 01: Introduction to Python
Data Preprocessing

Prerequisites

Session 02: Variables, Input/Output, Data Types, Strings and Operators
Data Preprocessing

Session 03: Control Flow, For Loop, While Loop
Data Preprocessing

Session 04: Functions, Anonymous Function, Exeption Handling
Data Preprocessing

Session 05: Reading & Writing in Pandas, Indexing, Selecting & Assigning, Summary Functions & Map
Data Preprocessing

Session 06: Grouping & Aggregation, Merging and Combining, Data Types and Missing Values
Data Preprocessing

Session 07: Dataset Introduction, Outlier Detection & Handling, Missing Data Imputation, Data Type Preprocessing
Data Preprocessing

Session 08: Introduction to EDA, Primary Steps, Univariate Analysis, and Bivariate Analysis
Data Preprocessing

Session 09: Introduction to NumPy & Arrays, Array Operations & Indexing, Statistical Analysis with NumPy
Data Preprocessing

Session 10: Classification & Regression, Baseline Method, Prediction & Probability Threshold and Evaluation Metrics and Extra
Machine Learning

Session 11: Decision Tree, Bagging, Boosting, Overfitting & Model Selection
Machine Learning

Session 12: Hyperparameters, Hyperparameter Tuning, and Retraining
Machine Learning

Session 13: Importance of Interpretability, Global Feature Importance and SHAP
Machine Learning

Session 14: Feature Engineering, Feature Creation, Feature Selection and Dimensionality Reduction
Machine Learning

Session 15: Saving and Packaging the Model, Environment Reproducibility, Introduction to Model Deployment and Conclusion & Next Steps
Machine Learning

Session 16: Projects

Student Ratings & Reviews

No Review Yet

About Course

What Will You Learn?

Course Content

Session 01: Introduction to Python Data Preprocessing

Prerequisites

Session 02: Variables, Input/Output, Data Types, Strings and Operators Data Preprocessing

Part 1 – Variables

Part 2 – Input/Output

Part 3 – Data Types

Part 4 – Strings

Part 5 – Operators

Session 03: Control Flow, For Loop, While Loop Data Preprocessing

Part 1 – Control Flow (if / else)

Part 2 – For loop

Part 3 – While loop

Session 04: Functions, Anonymous Function, Exeption Handling Data Preprocessing

Part 1 – Functions

Part 2 – Anonymous Functions (lambda)

Part 3 – Exception Handling

Session 05: Reading & Writing in Pandas, Indexing, Selecting & Assigning, Summary Functions & Map Data Preprocessing

Part 1 – Reading & Writing in Pandas

Part 2 – Indexing, Selecting & Assigning

Part 3 – Summary Functions & Map

Session 06: Grouping & Aggregation, Merging and Combining, Data Types and Missing Values Data Preprocessing

Part 1 – Grouping & Aggregation

Part 2 – Merging & Combining

Part 3 – Data Types & Missing Values

Session 07: Dataset Introduction, Outlier Detection & Handling, Missing Data Imputation, Data Type Preprocessing Data Preprocessing

Part 1 – Dataset Introduction

Part 2 – Outlier Detection & Handling

Part 3 – Missing Data Imputation

Part 4 – Data Type Preprocessing

Session 08: Introduction to EDA, Primary Steps, Univariate Analysis, and Bivariate Analysis Data Preprocessing

Part 1 – Introduction to EDA

Part 2 – Primary Steps (review of data preprocessing)

Part 3 – Univariate Analysis

Part 4 – Bivariate Analysis

Session 09: Introduction to NumPy & Arrays, Array Operations & Indexing, Statistical Analysis with NumPy Data Preprocessing

Part 1 – Introduction to NumPy & Arrays

Part 2 – Array Operations & Indexing

Part 3 – Statistical Analysis with NumPy

Session 10: Classification & Regression, Baseline Method, Prediction & Probability Threshold and Evaluation Metrics and Extra Machine Learning

Part 1 – Classification & Regression

Part 1 – Classification & Regression

Part 3 – Prediction & Probability Threshold

Part 4 – Evaluation Metrics

Part 5 – Extra session (Notebook)

Session 11: Decision Tree, Bagging, Boosting, Overfitting & Model Selection Machine Learning

Part 1 – Decision Tree

Part 2 – Bagging (Random Forest)

Part 3 – Boosting (XGBoost)

Part 4 – Overfitting & Model Selection

Session 12: Hyperparameters, Hyperparameter Tuning, and Retraining Machine Learning

Part 1 – Hyperparameters

Part 2 – Hyperparameter Tuning

Part 3 – Retraining

Session 13: Importance of Interpretability, Global Feature Importance and SHAP Machine Learning

Part 1 – Importance of Interpretability

Part 2 – Global Feature Importance

Part 3 – SHAP

Session 14: Feature Engineering, Feature Creation, Feature Selection and Dimensionality Reduction Machine Learning

Part 1 – Feature Engineering

Part 2 – Feature Creation

Part 3 – Feature Selection

Part 4 – Dimensionality Reduction

Session 15: Saving and Packaging the Model, Environment Reproducibility, Introduction to Model Deployment and Conclusion & Next Steps Machine Learning

Part 1 – Saving & Packaging the Model

Part 2 – Environment Reproducibility

Part 3 – Introduction to Model Deployment

Part 4 – Conclusion & Next Steps