Which loss function is most appropriate for multi-class classification with mutually exclusive classes?