Text Document

Measuring self-focus bias in community-maintained knowledge repositories

Loading...
Thumbnail Image

Fulltext URI

Document type

Text

Additional Information

Date

Journal Title

Journal ISSN

Volume Title

Publisher

ACM Press

Abstract

Self-focus is a novel way of understanding a type of bias in community-maintained Web 2.0 graph structures. It goes beyond previous measures of topical coverage bias by encapsulating both node- and edge-hosted biases in a single holistic measure of an entire community-maintained graph. We outline two methods to quantify self-focus, one of which is very computationally inexpensive, and present empirical evidence for the existence of self-focus using a "hyperlingual" approach that examines 15 different language editions of Wikipedia. We suggest applications of our methods and discuss the risks of ignoring self-focus bias in technological applications.

Description

Hecht, Brent; Gergle, Darren (2009): Measuring self-focus bias in community-maintained knowledge repositories. Communities and Technologies 2009: Proceedings of the Fourth Communities and Technologies Conference. DOI: 10.1145/1556460.1556463. ACM Press. pp. 11-20. Full Papers

Keywords

Citation

URI

URI

Endorsement

Review

Supplemented By

Referenced By


Number of citations to item: 65

  • Morten Warncke-Wang, Rita Ho, Marshall Miller, Isaac Johnson (2023): Increasing Participation in Peer Production Communities with the Newcomer Homepage, In: Proceedings of the ACM on Human-Computer Interaction CSCW2(7), doi:10.1145/3610071
  • Taha Yasseri, Anselm Spoerri, Mark Graham, Janos Kertesz (2013): The Most Controversial Topics in Wikipedia: A Multilingual and Geographical Analysis, In: SSRN Electronic Journal, doi:10.2139/ssrn.2269392
  • Patrick Gildersleve, Renaud Lambiotte, Taha Yasseri (2023): Between news and history: identifying networked topics of collective attention on Wikipedia, In: Journal of Computational Social Science 2(6), doi:10.1007/s42001-023-00215-w
  • Andrew Hall, Sarah McRoberts, Jacob Thebault-Spieker, Yilun Lin, Shilad Sen, Brent Hecht, Loren Terveen (2017): Freedom versus Standardization, In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3025453.3025940
  • Nicholas Vincent, Isaac Johnson, Brent Hecht (2018): Examining Wikipedia With a Broader Lens, In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3173574.3174140
  • Emily Porter, P. M. Krafft, Brian Keegan (2020): Visual Narratives and Collective Memory across Peer-Produced Accounts of Contested Sociopolitical Events, In: ACM Transactions on Social Computing 1(3), doi:10.1145/3373147
  • Cailean Osborne, Mark Graham, Martin Dittus (2020): Edit Wars in a Contested Digital City: Mapping Wikipedia’s Uneven Augmentations of Berlin, In: The Professional Geographer 1(73), doi:10.1080/00330124.2020.1800493
  • Aileen Oeberst, Till Ridderbecks (2024): How article category in Wikipedia determines the heterogeneity of its editors, In: Scientific Reports 1(14), doi:10.1038/s41598-023-50448-y
  • Marc Miquel-Ribé, David Laniado (2021): The Wikipedia Diversity Observatory: helping communities to bridge content gaps through interactive interfaces, In: Journal of Internet Services and Applications 1(12), doi:10.1186/s13174-021-00141-y
  • Alexander Mehler, Wahed Hemati, Pascal Welke, Maxim Konca, Tolga Uslu (2020): Multiple Texts as a Limiting Factor in Online Learning: Quantifying (Dis-)similarities of Knowledge Networks, In: Frontiers in Education, doi:10.3389/feduc.2020.562670
  • Shilad W. Sen, Heather Ford, David R. Musicant, Mark Graham, Os Keyes, Brent Hecht (2015): Barriers to the Localness of Volunteered Geographic Information, In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, doi:10.1145/2702123.2702170
  • Brian C. Keegan, Jed R. Brubaker (2015): 'Is' to 'Was', In: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, doi:10.1145/2675133.2675238
  • Toby Jia-Jun Li, Shilad Sen, Brent Hecht (2014): Leveraging advances in natural language processing to better understand Tobler's first law of geography, In: Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, doi:10.1145/2666310.2666493
  • Esther Weltevrede, Erik Borra (2016): Platform affordances and data practices: The value of dispute on Wikipedia, In: Big Data & Society 1(3), doi:10.1177/2053951716653418
  • Naveena Karusala (2019): How technology converses with local languages, In: XRDS: Crossroads, The ACM Magazine for Students 2(26), doi:10.1145/3368068
  • Brent Hecht, Emily Moxley (2009): Terabytes of Tobler: Evaluating the First Law in a Massive, Domain-Neutral Representation of World Knowledge, In: Lecture Notes in Computer Science, doi:10.1007/978-3-642-03832-7_6
  • Isaac L. Johnson, Yilun Lin, Toby Jia-Jun Li, Andrew Hall, Aaron Halfaker, Johannes Schöning, Brent Hecht (2016): Not at Home on the Range, In: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, doi:10.1145/2858036.2858123
  • Vahid Ashrafimoghari (2023): Detecting Cross-Lingual Information Gaps in Wikipedia, In: Companion Proceedings of the ACM Web Conference 2023, doi:10.1145/3543873.3587539
  • Eduardo Graells-Garrido, Mounia Lalmas, Filippo Menczer (2015): First Women, Second Sex, In: Proceedings of the 26th ACM Conference on Hypertext & Social Media - HT '15, doi:10.1145/2700171.2791036
  • David Abián, Albert Meroño-Peñuela, Elena Simperl (2022): An Analysis of Content Gaps Versus User Needs in the Wikidata Knowledge Graph, In: Lecture Notes in Computer Science, doi:10.1007/978-3-031-19433-7_21
  • Charles Chuankai Zhang, Loren Terveen (2021): Quantifying the Gap: A Case Study of Wikidata Gender Disparities, In: 17th International Symposium on Open Collaboration, doi:10.1145/3479986.3479992
  • David Laniado, Michele Mauri, Erik Borra (2024): Chapter 6. Exploring the evolution of Wikipedia articles through Contropedia, In: Studies in Corpus Linguistics, doi:10.1075/scl.121.06lan
  • Marc Miquel Ribé, David Laniado, Andreas Kaltenbrunner (2021): The Role of Local Content in Wikipedia: A Study on Reader and Editor Engagement, In: Área Abierta 2(21), doi:10.5209/arab.72801
  • Claudia Wagner, Eduardo Graells-Garrido, David Garcia, Filippo Menczer (2016): Women through the glass ceiling: gender asymmetries in Wikipedia, In: EPJ Data Science 1(5), doi:10.1140/epjds/s13688-016-0066-4
  • Tim Gregory (2018): Colonising antinormative sex: The flexibility of post-porn heterosex in random webcam sex, In: Sexualities 4(21), doi:10.1177/1363460717737772
  • Aaron Halfaker, Brian Keegan, Andrea Forte, R. Stuart Geiger, Dario Taraborelli, Maryana Pinchuk, Mikhil Masli (2012): What aren't we measuring?, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462983
  • Tiziano Piccardi, Robert West (2021): Crosslingual Topic Modeling with WikiPDA, In: Proceedings of the Web Conference 2021, doi:10.1145/3442381.3449805
  • Aaron Halfaker (2017): Interpolating Quality Dynamics in Wikipedia and Demonstrating the Keilana Effect, In: Proceedings of the 13th International Symposium on Open Collaboration, doi:10.1145/3125433.3125475
  • Oleksiy Gnatiuk, Victoria Glybovets, (2021): Uneven geographies in the various language editions of Wikipedia: the case of Ukrainian cities, In: Hungarian Geographical Bulletin 3(70), doi:10.15201/hungeobull.70.3.4
  • Shilad Sen, Toby Jia-Jun Li, WikiBrain Team, Brent Hecht (2014): WikiBrain, In: Proceedings of The International Symposium on Open Collaboration, doi:10.1145/2641580.2641615
  • Marc Miquel-Ribé, David Laniado (2016): Cultural Identities in Wikipedias, In: Proceedings of the 7th 2016 International Conference on Social Media & Society - SMSociety '16, doi:10.1145/2930971.2930996
  • Daniel Rinser, Dustin Lange, Felix Naumann (2013): Cross-lingual entity matching and infobox alignment in Wikipedia, In: Information Systems 6(38), doi:10.1016/j.is.2012.10.003
  • Ewa S. Callahan, Susan C. Herring (2011): Cultural bias in Wikipedia content on famous persons, In: Journal of the American Society for Information Science and Technology 10(62), doi:10.1002/asi.21577
  • Brent Hecht, Darren Gergle (2000): A Beginner’s Guide to Geographic Virtual Communities Research, In: Handbook of Research on Methods and Techniques for Studying Virtual Communities, doi:10.4018/978-1-60960-040-2.ch019
  • Afra Mashhadi, Giovanni Quattrone, Licia Capra (2013): Putting ubiquitous crowd-sourcing into context, In: Proceedings of the 2013 conference on Computer supported cooperative work, doi:10.1145/2441776.2441845
  • Taryn Bipat, Negin Alimohammadi, Yihan Yu, David W. McDonald, Mark Zachry (2021): Wikipedia Beyond the English Language Edition, In: Proceedings of the ACM on Human-Computer Interaction CSCW1(5), doi:10.1145/3449129
  • Brent Hecht, Darren Gergle (2010): The tower of Babel meets web 2.0, In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, doi:10.1145/1753326.1753370
  • Florian Lemmerich, Diego Sáez-Trumper, Robert West, Leila Zia (2019): Why the World Reads Wikipedia, In: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, doi:10.1145/3289600.3291021
  • Scott Hale (2012): Impact of platform design on cross-language information exchange, In: CHI '12 Extended Abstracts on Human Factors in Computing Systems, doi:10.1145/2212776.2212456
  • Mo Houtti, Isaac Johnson, Joel Cepeda, Soumya Khandelwal, Aviral Bhatnagar, Loren Terveen (2022): "We Need a Woman in Music": Exploring Wikipedia's Values on Article Priority, In: Proceedings of the ACM on Human-Computer Interaction CSCW2(6), doi:10.1145/3555156
  • Morten Warncke-Wang, Anuradha Uduwage, Zhenhua Dong, John Riedl (2012): In search of the ur-Wikipedia, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462959
  • Paul Laufer, Claudia Wagner, Fabian Flöck, Markus Strohmaier (2015): Mining cross-cultural relations from Wikipedia, In: Proceedings of the ACM Web Science Conference, doi:10.1145/2786451.2786452
  • Brent J. Hecht, Darren Gergle (2010): On the "localness" of user-generated content, In: Proceedings of the 2010 ACM conference on Computer supported cooperative work, doi:10.1145/1718918.1718962
  • Ofer Arazy, Keren Kaplan-Mintz, Dan Malkinson, Yiftach Nagar (2024): A local community on a global collective intelligence platform: A case study of individual preferences and collective bias in ecological citizen science, In: PLOS ONE 8(19), doi:10.1371/journal.pone.0308552
  • Cheong-Iao Pang, Robert P. Biuk-Aghai (2011): Wikipedia world map, In: Proceedings of the 7th International Symposium on Wikis and Open Collaboration, doi:10.1145/2038558.2038579
  • Dwaipayan Roy, Sumit Bhatia, Prateek Jain (2021): Information asymmetry in Wikipedia across different languages: A statistical analysis, In: Journal of the Association for Information Science and Technology 3(73), doi:10.1002/asi.24553
  • Scott A. Hale (2014): Multilinguals and Wikipedia editing, In: Proceedings of the 2014 ACM conference on Web science, doi:10.1145/2615569.2615684
  • Taryn Bipat, David W. McDonald, Mark Zachry (2018): Do We All Talk Before We Type?, In: Proceedings of the 14th International Symposium on Open Collaboration, doi:10.1145/3233391.3233542
  • Molly G. Hickman, Viral Pasad, Harsh Kamalesh Sanghavi, Jacob Thebault-Spieker, Sang Won Lee (2021): Understanding Wikipedia Practices Through Hindi, Urdu, and English Takes on an Evolving Regional Conflict, In: Proceedings of the ACM on Human-Computer Interaction CSCW1(5), doi:10.1145/3449108
  • Scott A. Hale (2014): Okinawa in Japanese and English wikipedia, In: CHI '14 Extended Abstracts on Human Factors in Computing Systems, doi:10.1145/2559206.2579413
  • Maitraye Das, Brent Hecht, Darren Gergle (2019): The Gendered Geography of Contributions to OpenStreetMap, In: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3290605.3300793
  • Molly G. Hickman, Viral Pasad, Harsh Sanghavi, Jacob Thebault-Spieker, Sang Won Lee (2020): Wiki HUEs, In: Proceedings of the 2020 International Conference on Information and Communication Technologies and Development, doi:10.1145/3392561.3397586
  • Mark Graham, Stefano De Sabbata (2015): Mapping information wealth and poverty: the geography of gazetteers, In: Environment and Planning A: Economy and Space 6(47), doi:10.1177/0308518x15594899
  • Anna Samoilenko, Fariba Karimi, Daniel Edler, Jérôme Kunegis, Markus Strohmaier (2016): Linguistic neighbourhoods: explaining cultural borders on Wikipedia through multilingual co-editing activity, In: EPJ Data Science 1(5), doi:10.1140/epjds/s13688-016-0070-8
  • Laura K. Nelson, Rebekah Getman, Syed Arefinul Haque (2021): And the Rest is History: Measuring the Scope and Recall of Wikipedia’s Coverage of Three Women’s Movement Subgroups, In: Sociological Methods & Research 4(51), doi:10.1177/00491241211067514
  • Brent Hecht, Johannes Schöning, Thomas Erickson, Reid Priedhorsky (2011): Geographic human-computer interaction, In: CHI '11 Extended Abstracts on Human Factors in Computing Systems, doi:10.1145/1979742.1979532
  • Erik Borra, Esther Weltevrede, Paolo Ciuccarelli, Andreas Kaltenbrunner, David Laniado, Giovanni Magni, Michele Mauri, Richard Rogers, Tommaso Venturini (2015): Societal Controversies in Wikipedia Articles, In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, doi:10.1145/2702123.2702436
  • Paolo Massa, Maurizio Napolitano, Federico Scrinzi, Michela Ferron (2012): WikiTrip, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462980
  • Patti Bao, Brent Hecht, Samuel Carton, Mahmood Quaderi, Michael Horn, Darren Gergle (2012): Omnipedia, In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, doi:10.1145/2207676.2208553
  • Paolo Massa, Federico Scrinzi (2012): Manypedia, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462960
  • Aileen Oeberst, Ina von der Beck, Mitja D. Back, Ulrike Cress, Steffen Nestler (2017): Biases in the production and reception of collective knowledge: the case of hindsight bias in Wikipedia, In: Psychological Research 5(82), doi:10.1007/s00426-017-0865-7
  • Jacob Thebault-Spieker, Aaron Halfaker, Loren G. Terveen, Brent Hecht (2018): Distance and Attraction, In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3173574.3173722
  • Yaxuan Yin, Longjie Guo, Jacob Thebault-Spieker (2024): Productivity or Equity? Tradeoffs in Volunteer Microtasking in Humanitarian OpenStreetMap, In: Proceedings of the ACM on Human-Computer Interaction CSCW1(8), doi:10.1145/3637390
  • Marc Miquel-Ribé, David Laniado (2018): Wikipedia Culture Gap: Quantifying Content Imbalances Across 40 Language Editions, In: Frontiers in Physics, doi:10.3389/fphy.2018.00054
Please note: Providing information about citations is only possible thanks to to the open metadata APIs provided by crossref.org and opencitations.net. These lists may be incomplete due to unavailable citation data.source: opencitations.net, crossref.org