Train/test split

Basic train/test evaluation.

0/9

completed

hardStatistical Significance of Performance Differences
8/9

Two models are compared using the same train/test split. Model A achieves 84% test accuracy and Model B achieves 85%. A data scientist concludes Model B is better. What is missing from this analysis?