The Problem of Majority Voting in Crowdsourcing with Binary Classes

When there are two classes, a majority vote can always be obtained with three labelers. Researchers can utilize this property to obtain a false sense of confidence in their ground truth labels. We demonstrate such a case with 3000 crowdsourced labels for an online hate dataset. Evaluating with percentage agreement, Gwet’s AC1, and Krippendorff’s alpha, results show that using more raters teases out the hidden nuances in raters’ preferences. We show that full agreement among the raters monotonically decreases from three raters (28.4%) to nine raters (19.5%). Ten raters have a higher agreement than any other number of raters, which supports the idea of increasing the number of raters for subjective labeling tasks. Nevertheless, while beneficial, increasing the number of raters cannot be considered as a fundamental solution to the issue of agreement in subjective crowdsourcing tasks, as even with ten raters, there is a non- negligible number of ties (4.11%). We suggest having a small sample of the data labeled by five or more raters to evaluate the stability of agreement among the raters.

Salminen, Joni; Kamel, Ahmed Mohamed; Jung, Soon-Gyo; Jansen, Bernard (2021): The Problem of Majority Voting in Crowdsourcing with Binary Classes. Proceedings of 19th European Conference on Computer-Supported Cooperative Work. DOI: 10.18420/ecscw2021_n12. European Society for Socially Embedded Technologies (EUSSET). PISSN: 2510-2591. Notes. Zurich, Switzerland. 7-11 June 2021

DOI

10.18420/ecscw2021_n12

URI

https://dl.eusset.eu/handle/20.500.12015/4163

Collections

ECSCW 2021 Exploratory Papers and Notes

Full item page

The Problem of Majority Voting in Crowdsourcing with Binary Classes

Fulltext URI

Document type

Files

Additional Information

Date

Authors

Journal Title

Journal ISSN

Volume Title

Source

Publisher

Abstract

Description

Keywords

Citation

DOI

URI

URI

Collections

Endorsement

Review

Supplemented By

Referenced By