Automated retraining pipeline with PSI-based drift detection, model versioning, and champion/challenger promotion for a churn prediction model.
End-to-end ML pipeline with data validation, training, evaluation, and conditional promotion
Production ML models degrade over time as incoming data distributions shift. This pipeline wraps a churn prediction model in six automated stages: data validation, feature engineering, model training, holdout evaluation, champion/challenger comparison, and conditional promotion. PSI monitors every numeric feature for drift, and the pipeline runs automatically via GitHub Actions on every push to main.
Five-page Streamlit application for pipeline monitoring and control
Six-stage pipeline with conditional promotion and drift monitoring
The pipeline monitors data quality and model performance continuously. PSI (Population Stability Index) is computed for every numeric feature by comparing the incoming data distribution against the training baseline. Features with PSI above 0.25 trigger a retraining recommendation. The champion/challenger pattern ensures new models are only promoted when they beat the current production model by a configurable margin on ROC AUC. Every run is logged with full metrics for audit.
Local, Docker, or CI/CD — pick your deployment path