Predict Credit Default (Imbalanced)
Advanced classification project · 21 steps · Precision · recall
About
Binary credit outcomes are imbalanced: headline accuracy can hide poor detection of bad risk. The positive label is bad credit—optimize precision, recall, and threshold on probabilities, not accuracy alone. Uses a stratified train/test split.
Data
CSV at /datasets/german-credit/german.csv (German Credit data, OpenML-style labels).
Task
Complete TODOs in order. Use hints first. Switch to Sample solution when stuck.
Pro project
Unlock guided projects to solve realistic ML tasks end-to-end, run full notebooks in the browser, and build interview-ready project confidence.
Upgrade to ProDataset credit: Statlog (German Credit Data) dataset by H. Hofmann (UCI Machine Learning Repository, DOI: 10.24432/C5NC77), licensed under CC BY 4.0. Dataset page: UCI Statlog German Credit.