PhD student Faiza Khan Khattak and CCLS researcher Ansaf Salleb-Aouissi's paper "Robust Crowd Labeling using Little Expertise" gets accepted at the Sixteenth International Conference on Discovery Science
Crowd-labeling emerged from the need to label large-scale and complex data, a tedious, expensive, and time-consuming task. But the problem of obtaining good quality labels from a crowd and their integration is still unresolved. To address this challenge, we propose a new framework that automatically combines and boosts bulk crowd labels supported by limited number of “ground truth” labels from experts.
The ground truth labels help to estimate the individual expertise of crowd labelers and difﬁculty of each instance, both of which are used to aggregate the labels.
We show through extensive experiments that unlike other state-of-the-art approaches, our method is robust even in the presence of a large proportion of bad labelers in the crowd. We derive a lower bound on the number of expert labels needed to judge crowd and dataset as well as to get better quality labels.