Back to Labs

Predict Credit Default (Imbalanced)

Advanced classification project · 21 steps · Precision · recall

About

Binary credit outcomes are imbalanced: headline accuracy can hide poor detection of bad risk. The positive label is bad credit—optimize precision, recall, and threshold on probabilities, not accuracy alone. Uses a stratified train/test split.

Data

CSV at /datasets/german-credit/german.csv (German Credit data, OpenML-style labels).

Task

Complete TODOs in order. Use hints first. Switch to Sample solution when stuck.

Pro project

Unlock guided projects to solve realistic ML tasks end-to-end, run full notebooks in the browser, and build interview-ready project confidence.

Upgrade to Pro

Dataset credit: Statlog (German Credit Data) dataset by H. Hofmann (UCI Machine Learning Repository, DOI: 10.24432/C5NC77), licensed under CC BY 4.0. Dataset page: UCI Statlog German Credit.