A data loader for the VOC 2007 Dataset. Expects input in a tar file.
Loads a data path given a spark context and labels and returns an RDD[MultiLabeledImage].
A property of the VOC dataset is that images can have multiple labels which we
have to deal with later in the pipeline.
A Spark Context
Path to image tar.
Path to label csv.