Compute the mean average precision for multi-class classification NOTE: The current implementation is only suitable when we have a small number of classes and data items.
For every test image, this contains a list of scores for each class
For every test image, this contains list of valid labels. Labels are assumed to be class ids.
An array containing average precision scores for each class