Measuring self-focus bias in community-maintained knowledge repositories
Loading...
Fulltext URI
Document type
Text
Files
Additional Information
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
ACM Press
Abstract
Self-focus is a novel way of understanding a type of bias in community-maintained Web 2.0 graph structures. It goes beyond previous measures of topical coverage bias by encapsulating both node- and edge-hosted biases in a single holistic measure of an entire community-maintained graph. We outline two methods to quantify self-focus, one of which is very computationally inexpensive, and present empirical evidence for the existence of self-focus using a "hyperlingual" approach that examines 15 different language editions of Wikipedia. We suggest applications of our methods and discuss the risks of ignoring self-focus bias in technological applications.
Description
Keywords
Citation
URI
URI
Endorsement
Review
Supplemented By
Referenced By
Number of citations to item: 65
- Morten Warncke-Wang, Rita Ho, Marshall Miller, Isaac Johnson (2023): Increasing Participation in Peer Production Communities with the Newcomer Homepage, In: Proceedings of the ACM on Human-Computer Interaction CSCW2(7), doi:10.1145/3610071
- Taha Yasseri, Anselm Spoerri, Mark Graham, Janos Kertesz (2013): The Most Controversial Topics in Wikipedia: A Multilingual and Geographical Analysis, In: SSRN Electronic Journal, doi:10.2139/ssrn.2269392
- Patrick Gildersleve, Renaud Lambiotte, Taha Yasseri (2023): Between news and history: identifying networked topics of collective attention on Wikipedia, In: Journal of Computational Social Science 2(6), doi:10.1007/s42001-023-00215-w
- Andrew Hall, Sarah McRoberts, Jacob Thebault-Spieker, Yilun Lin, Shilad Sen, Brent Hecht, Loren Terveen (2017): Freedom versus Standardization, In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3025453.3025940
- Nicholas Vincent, Isaac Johnson, Brent Hecht (2018): Examining Wikipedia With a Broader Lens, In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3173574.3174140
- Emily Porter, P. M. Krafft, Brian Keegan (2020): Visual Narratives and Collective Memory across Peer-Produced Accounts of Contested Sociopolitical Events, In: ACM Transactions on Social Computing 1(3), doi:10.1145/3373147
- Cailean Osborne, Mark Graham, Martin Dittus (2020): Edit Wars in a Contested Digital City: Mapping Wikipedia’s Uneven Augmentations of Berlin, In: The Professional Geographer 1(73), doi:10.1080/00330124.2020.1800493
- Aileen Oeberst, Till Ridderbecks (2024): How article category in Wikipedia determines the heterogeneity of its editors, In: Scientific Reports 1(14), doi:10.1038/s41598-023-50448-y
- Marc Miquel-Ribé, David Laniado (2021): The Wikipedia Diversity Observatory: helping communities to bridge content gaps through interactive interfaces, In: Journal of Internet Services and Applications 1(12), doi:10.1186/s13174-021-00141-y
- Alexander Mehler, Wahed Hemati, Pascal Welke, Maxim Konca, Tolga Uslu (2020): Multiple Texts as a Limiting Factor in Online Learning: Quantifying (Dis-)similarities of Knowledge Networks, In: Frontiers in Education, doi:10.3389/feduc.2020.562670
- Shilad W. Sen, Heather Ford, David R. Musicant, Mark Graham, Os Keyes, Brent Hecht (2015): Barriers to the Localness of Volunteered Geographic Information, In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, doi:10.1145/2702123.2702170
- Brian C. Keegan, Jed R. Brubaker (2015): 'Is' to 'Was', In: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, doi:10.1145/2675133.2675238
- Toby Jia-Jun Li, Shilad Sen, Brent Hecht (2014): Leveraging advances in natural language processing to better understand Tobler's first law of geography, In: Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, doi:10.1145/2666310.2666493
- Esther Weltevrede, Erik Borra (2016): Platform affordances and data practices: The value of dispute on Wikipedia, In: Big Data & Society 1(3), doi:10.1177/2053951716653418
- Naveena Karusala (2019): How technology converses with local languages, In: XRDS: Crossroads, The ACM Magazine for Students 2(26), doi:10.1145/3368068
- Brent Hecht, Emily Moxley (2009): Terabytes of Tobler: Evaluating the First Law in a Massive, Domain-Neutral Representation of World Knowledge, In: Lecture Notes in Computer Science, doi:10.1007/978-3-642-03832-7_6
- Isaac L. Johnson, Yilun Lin, Toby Jia-Jun Li, Andrew Hall, Aaron Halfaker, Johannes Schöning, Brent Hecht (2016): Not at Home on the Range, In: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, doi:10.1145/2858036.2858123
- Vahid Ashrafimoghari (2023): Detecting Cross-Lingual Information Gaps in Wikipedia, In: Companion Proceedings of the ACM Web Conference 2023, doi:10.1145/3543873.3587539
- Eduardo Graells-Garrido, Mounia Lalmas, Filippo Menczer (2015): First Women, Second Sex, In: Proceedings of the 26th ACM Conference on Hypertext & Social Media - HT '15, doi:10.1145/2700171.2791036
- David Abián, Albert Meroño-Peñuela, Elena Simperl (2022): An Analysis of Content Gaps Versus User Needs in the Wikidata Knowledge Graph, In: Lecture Notes in Computer Science, doi:10.1007/978-3-031-19433-7_21
- Charles Chuankai Zhang, Loren Terveen (2021): Quantifying the Gap: A Case Study of Wikidata Gender Disparities, In: 17th International Symposium on Open Collaboration, doi:10.1145/3479986.3479992
- David Laniado, Michele Mauri, Erik Borra (2024): Chapter 6. Exploring the evolution of Wikipedia articles through Contropedia, In: Studies in Corpus Linguistics, doi:10.1075/scl.121.06lan
- Marc Miquel Ribé, David Laniado, Andreas Kaltenbrunner (2021): The Role of Local Content in Wikipedia: A Study on Reader and Editor Engagement, In: Área Abierta 2(21), doi:10.5209/arab.72801
- Claudia Wagner, Eduardo Graells-Garrido, David Garcia, Filippo Menczer (2016): Women through the glass ceiling: gender asymmetries in Wikipedia, In: EPJ Data Science 1(5), doi:10.1140/epjds/s13688-016-0066-4
- Tim Gregory (2018): Colonising antinormative sex: The flexibility of post-porn heterosex in random webcam sex, In: Sexualities 4(21), doi:10.1177/1363460717737772
- Aaron Halfaker, Brian Keegan, Andrea Forte, R. Stuart Geiger, Dario Taraborelli, Maryana Pinchuk, Mikhil Masli (2012): What aren't we measuring?, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462983
- Tiziano Piccardi, Robert West (2021): Crosslingual Topic Modeling with WikiPDA, In: Proceedings of the Web Conference 2021, doi:10.1145/3442381.3449805
- Aaron Halfaker (2017): Interpolating Quality Dynamics in Wikipedia and Demonstrating the Keilana Effect, In: Proceedings of the 13th International Symposium on Open Collaboration, doi:10.1145/3125433.3125475
- Oleksiy Gnatiuk, Victoria Glybovets, (2021): Uneven geographies in the various language editions of Wikipedia: the case of Ukrainian cities, In: Hungarian Geographical Bulletin 3(70), doi:10.15201/hungeobull.70.3.4
- Shilad Sen, Toby Jia-Jun Li, WikiBrain Team, Brent Hecht (2014): WikiBrain, In: Proceedings of The International Symposium on Open Collaboration, doi:10.1145/2641580.2641615
- Marc Miquel-Ribé, David Laniado (2016): Cultural Identities in Wikipedias, In: Proceedings of the 7th 2016 International Conference on Social Media & Society - SMSociety '16, doi:10.1145/2930971.2930996
- Daniel Rinser, Dustin Lange, Felix Naumann (2013): Cross-lingual entity matching and infobox alignment in Wikipedia, In: Information Systems 6(38), doi:10.1016/j.is.2012.10.003
- Ewa S. Callahan, Susan C. Herring (2011): Cultural bias in Wikipedia content on famous persons, In: Journal of the American Society for Information Science and Technology 10(62), doi:10.1002/asi.21577
- Brent Hecht, Darren Gergle (2000): A Beginner’s Guide to Geographic Virtual Communities Research, In: Handbook of Research on Methods and Techniques for Studying Virtual Communities, doi:10.4018/978-1-60960-040-2.ch019
- Afra Mashhadi, Giovanni Quattrone, Licia Capra (2013): Putting ubiquitous crowd-sourcing into context, In: Proceedings of the 2013 conference on Computer supported cooperative work, doi:10.1145/2441776.2441845
- Taryn Bipat, Negin Alimohammadi, Yihan Yu, David W. McDonald, Mark Zachry (2021): Wikipedia Beyond the English Language Edition, In: Proceedings of the ACM on Human-Computer Interaction CSCW1(5), doi:10.1145/3449129
- Brent Hecht, Darren Gergle (2010): The tower of Babel meets web 2.0, In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, doi:10.1145/1753326.1753370
- Florian Lemmerich, Diego Sáez-Trumper, Robert West, Leila Zia (2019): Why the World Reads Wikipedia, In: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, doi:10.1145/3289600.3291021
- Scott Hale (2012): Impact of platform design on cross-language information exchange, In: CHI '12 Extended Abstracts on Human Factors in Computing Systems, doi:10.1145/2212776.2212456
- Mo Houtti, Isaac Johnson, Joel Cepeda, Soumya Khandelwal, Aviral Bhatnagar, Loren Terveen (2022): "We Need a Woman in Music": Exploring Wikipedia's Values on Article Priority, In: Proceedings of the ACM on Human-Computer Interaction CSCW2(6), doi:10.1145/3555156
- Morten Warncke-Wang, Anuradha Uduwage, Zhenhua Dong, John Riedl (2012): In search of the ur-Wikipedia, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462959
- Paul Laufer, Claudia Wagner, Fabian Flöck, Markus Strohmaier (2015): Mining cross-cultural relations from Wikipedia, In: Proceedings of the ACM Web Science Conference, doi:10.1145/2786451.2786452
- Brent J. Hecht, Darren Gergle (2010): On the "localness" of user-generated content, In: Proceedings of the 2010 ACM conference on Computer supported cooperative work, doi:10.1145/1718918.1718962
- Ofer Arazy, Keren Kaplan-Mintz, Dan Malkinson, Yiftach Nagar (2024): A local community on a global collective intelligence platform: A case study of individual preferences and collective bias in ecological citizen science, In: PLOS ONE 8(19), doi:10.1371/journal.pone.0308552
- Cheong-Iao Pang, Robert P. Biuk-Aghai (2011): Wikipedia world map, In: Proceedings of the 7th International Symposium on Wikis and Open Collaboration, doi:10.1145/2038558.2038579
- Dwaipayan Roy, Sumit Bhatia, Prateek Jain (2021): Information asymmetry in Wikipedia across different languages: A statistical analysis, In: Journal of the Association for Information Science and Technology 3(73), doi:10.1002/asi.24553
- Scott A. Hale (2014): Multilinguals and Wikipedia editing, In: Proceedings of the 2014 ACM conference on Web science, doi:10.1145/2615569.2615684
- Taryn Bipat, David W. McDonald, Mark Zachry (2018): Do We All Talk Before We Type?, In: Proceedings of the 14th International Symposium on Open Collaboration, doi:10.1145/3233391.3233542
- Molly G. Hickman, Viral Pasad, Harsh Kamalesh Sanghavi, Jacob Thebault-Spieker, Sang Won Lee (2021): Understanding Wikipedia Practices Through Hindi, Urdu, and English Takes on an Evolving Regional Conflict, In: Proceedings of the ACM on Human-Computer Interaction CSCW1(5), doi:10.1145/3449108
- Scott A. Hale (2014): Okinawa in Japanese and English wikipedia, In: CHI '14 Extended Abstracts on Human Factors in Computing Systems, doi:10.1145/2559206.2579413
- Maitraye Das, Brent Hecht, Darren Gergle (2019): The Gendered Geography of Contributions to OpenStreetMap, In: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3290605.3300793
- Molly G. Hickman, Viral Pasad, Harsh Sanghavi, Jacob Thebault-Spieker, Sang Won Lee (2020): Wiki HUEs, In: Proceedings of the 2020 International Conference on Information and Communication Technologies and Development, doi:10.1145/3392561.3397586
- Mark Graham, Stefano De Sabbata (2015): Mapping information wealth and poverty: the geography of gazetteers, In: Environment and Planning A: Economy and Space 6(47), doi:10.1177/0308518x15594899
- Anna Samoilenko, Fariba Karimi, Daniel Edler, Jérôme Kunegis, Markus Strohmaier (2016): Linguistic neighbourhoods: explaining cultural borders on Wikipedia through multilingual co-editing activity, In: EPJ Data Science 1(5), doi:10.1140/epjds/s13688-016-0070-8
- Laura K. Nelson, Rebekah Getman, Syed Arefinul Haque (2021): And the Rest is History: Measuring the Scope and Recall of Wikipedia’s Coverage of Three Women’s Movement Subgroups, In: Sociological Methods & Research 4(51), doi:10.1177/00491241211067514
- Brent Hecht, Johannes Schöning, Thomas Erickson, Reid Priedhorsky (2011): Geographic human-computer interaction, In: CHI '11 Extended Abstracts on Human Factors in Computing Systems, doi:10.1145/1979742.1979532
- Erik Borra, Esther Weltevrede, Paolo Ciuccarelli, Andreas Kaltenbrunner, David Laniado, Giovanni Magni, Michele Mauri, Richard Rogers, Tommaso Venturini (2015): Societal Controversies in Wikipedia Articles, In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, doi:10.1145/2702123.2702436
- Paolo Massa, Maurizio Napolitano, Federico Scrinzi, Michela Ferron (2012): WikiTrip, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462980
- Patti Bao, Brent Hecht, Samuel Carton, Mahmood Quaderi, Michael Horn, Darren Gergle (2012): Omnipedia, In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, doi:10.1145/2207676.2208553
- Paolo Massa, Federico Scrinzi (2012): Manypedia, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462960
- Aileen Oeberst, Ina von der Beck, Mitja D. Back, Ulrike Cress, Steffen Nestler (2017): Biases in the production and reception of collective knowledge: the case of hindsight bias in Wikipedia, In: Psychological Research 5(82), doi:10.1007/s00426-017-0865-7
- Jacob Thebault-Spieker, Aaron Halfaker, Loren G. Terveen, Brent Hecht (2018): Distance and Attraction, In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3173574.3173722
- Yaxuan Yin, Longjie Guo, Jacob Thebault-Spieker (2024): Productivity or Equity? Tradeoffs in Volunteer Microtasking in Humanitarian OpenStreetMap, In: Proceedings of the ACM on Human-Computer Interaction CSCW1(8), doi:10.1145/3637390
- Marc Miquel-Ribé, David Laniado (2018): Wikipedia Culture Gap: Quantifying Content Imbalances Across 40 Language Editions, In: Frontiers in Physics, doi:10.3389/fphy.2018.00054