Measuring self-focus bias in community-maintained knowledge repositories

dc.contributor.authorHecht, Brent
dc.contributor.authorGergle, Darren
dc.date.accessioned2017-04-15T12:04:04Z
dc.date.available2017-04-15T12:04:04Z
dc.date.issued2009
dc.description.abstractSelf-focus is a novel way of understanding a type of bias in community-maintained Web 2.0 graph structures. It goes beyond previous measures of topical coverage bias by encapsulating both node- and edge-hosted biases in a single holistic measure of an entire community-maintained graph. We outline two methods to quantify self-focus, one of which is very computationally inexpensive, and present empirical evidence for the existence of self-focus using a "hyperlingual" approach that examines 15 different language editions of Wikipedia. We suggest applications of our methods and discuss the risks of ignoring self-focus bias in technological applications.
dc.identifier.doi10.1145/1556460.1556463
dc.language.isoen
dc.publisherACM Press
dc.relation.ispartofCommunities and Technologies 2009: Proceedings of the Fourth Communities and Technologies Conference
dc.relation.ispartofseriesCommunities and Technologies
dc.titleMeasuring self-focus bias in community-maintained knowledge repositories
dc.typeText
gi.citation.endPage20
gi.citation.startPage11
gi.citations.count65
gi.citations.elementDavid Abián, Albert Meroño-Peñuela, Elena Simperl (2022): An Analysis of Content Gaps Versus User Needs in the Wikidata Knowledge Graph, In: Lecture Notes in Computer Science, doi:10.1007/978-3-031-19433-7_21
gi.citations.elementCheong-Iao Pang, Robert P. Biuk-Aghai (2011): Wikipedia world map, In: Proceedings of the 7th International Symposium on Wikis and Open Collaboration, doi:10.1145/2038558.2038579
gi.citations.elementShilad W. Sen, Heather Ford, David R. Musicant, Mark Graham, Os Keyes, Brent Hecht (2015): Barriers to the Localness of Volunteered Geographic Information, In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, doi:10.1145/2702123.2702170
gi.citations.elementIsaac L. Johnson, Yilun Lin, Toby Jia-Jun Li, Andrew Hall, Aaron Halfaker, Johannes Schöning, Brent Hecht (2016): Not at Home on the Range, In: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, doi:10.1145/2858036.2858123
gi.citations.elementMolly G. Hickman, Viral Pasad, Harsh Sanghavi, Jacob Thebault-Spieker, Sang Won Lee (2020): Wiki HUEs, In: Proceedings of the 2020 International Conference on Information and Communication Technologies and Development, doi:10.1145/3392561.3397586
gi.citations.elementMorten Warncke-Wang, Anuradha Uduwage, Zhenhua Dong, John Riedl (2012): In search of the ur-Wikipedia, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462959
gi.citations.elementDwaipayan Roy, Sumit Bhatia, Prateek Jain (2021): Information asymmetry in Wikipedia across different languages: A statistical analysis, In: Journal of the Association for Information Science and Technology 3(73), doi:10.1002/asi.24553
gi.citations.elementAaron Halfaker (2017): Interpolating Quality Dynamics in Wikipedia and Demonstrating the Keilana Effect, In: Proceedings of the 13th International Symposium on Open Collaboration, doi:10.1145/3125433.3125475
gi.citations.elementErik Borra, Esther Weltevrede, Paolo Ciuccarelli, Andreas Kaltenbrunner, David Laniado, Giovanni Magni, Michele Mauri, Richard Rogers, Tommaso Venturini (2015): Societal Controversies in Wikipedia Articles, In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, doi:10.1145/2702123.2702436
gi.citations.elementOfer Arazy, Keren Kaplan-Mintz, Dan Malkinson, Yiftach Nagar (2024): A local community on a global collective intelligence platform: A case study of individual preferences and collective bias in ecological citizen science, In: PLOS ONE 8(19), doi:10.1371/journal.pone.0308552
gi.citations.elementBrian C. Keegan, Jed R. Brubaker (2015): 'Is' to 'Was', In: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, doi:10.1145/2675133.2675238
gi.citations.elementScott Hale (2012): Impact of platform design on cross-language information exchange, In: CHI '12 Extended Abstracts on Human Factors in Computing Systems, doi:10.1145/2212776.2212456
gi.citations.elementMarc Miquel-Ribé, David Laniado (2016): Cultural Identities in Wikipedias, In: Proceedings of the 7th 2016 International Conference on Social Media & Society - SMSociety '16, doi:10.1145/2930971.2930996
gi.citations.elementMarc Miquel-Ribé, David Laniado (2021): The Wikipedia Diversity Observatory: helping communities to bridge content gaps through interactive interfaces, In: Journal of Internet Services and Applications 1(12), doi:10.1186/s13174-021-00141-y
gi.citations.elementAileen Oeberst, Ina von der Beck, Mitja D. Back, Ulrike Cress, Steffen Nestler (2017): Biases in the production and reception of collective knowledge: the case of hindsight bias in Wikipedia, In: Psychological Research 5(82), doi:10.1007/s00426-017-0865-7
gi.citations.elementScott A. Hale (2014): Multilinguals and Wikipedia editing, In: Proceedings of the 2014 ACM conference on Web science, doi:10.1145/2615569.2615684
gi.citations.elementPatti Bao, Brent Hecht, Samuel Carton, Mahmood Quaderi, Michael Horn, Darren Gergle (2012): Omnipedia, In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, doi:10.1145/2207676.2208553
gi.citations.elementMarc Miquel Ribé, David Laniado, Andreas Kaltenbrunner (2021): The Role of Local Content in Wikipedia: A Study on Reader and Editor Engagement, In: Área Abierta 2(21), doi:10.5209/arab.72801
gi.citations.elementPatrick Gildersleve, Renaud Lambiotte, Taha Yasseri (2023): Between news and history: identifying networked topics of collective attention on Wikipedia, In: Journal of Computational Social Science 2(6), doi:10.1007/s42001-023-00215-w
gi.citations.elementAileen Oeberst, Till Ridderbecks (2024): How article category in Wikipedia determines the heterogeneity of its editors, In: Scientific Reports 1(14), doi:10.1038/s41598-023-50448-y
gi.citations.elementBrent Hecht, Darren Gergle (2000): A Beginner’s Guide to Geographic Virtual Communities Research, In: Handbook of Research on Methods and Techniques for Studying Virtual Communities, doi:10.4018/978-1-60960-040-2.ch019
gi.citations.elementPaul Laufer, Claudia Wagner, Fabian Flöck, Markus Strohmaier (2015): Mining cross-cultural relations from Wikipedia, In: Proceedings of the ACM Web Science Conference, doi:10.1145/2786451.2786452
gi.citations.elementAfra Mashhadi, Giovanni Quattrone, Licia Capra (2013): Putting ubiquitous crowd-sourcing into context, In: Proceedings of the 2013 conference on Computer supported cooperative work, doi:10.1145/2441776.2441845
gi.citations.elementMorten Warncke-Wang, Rita Ho, Marshall Miller, Isaac Johnson (2023): Increasing Participation in Peer Production Communities with the Newcomer Homepage, In: Proceedings of the ACM on Human-Computer Interaction CSCW2(7), doi:10.1145/3610071
gi.citations.elementClaudia Wagner, Eduardo Graells-Garrido, David Garcia, Filippo Menczer (2016): Women through the glass ceiling: gender asymmetries in Wikipedia, In: EPJ Data Science 1(5), doi:10.1140/epjds/s13688-016-0066-4
gi.citations.elementMark Graham, Stefano De Sabbata (2015): Mapping information wealth and poverty: the geography of gazetteers, In: Environment and Planning A: Economy and Space 6(47), doi:10.1177/0308518x15594899
gi.citations.elementAaron Halfaker, Brian Keegan, Andrea Forte, R. Stuart Geiger, Dario Taraborelli, Maryana Pinchuk, Mikhil Masli (2012): What aren't we measuring?, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462983
gi.citations.elementScott A. Hale (2014): Okinawa in Japanese and English wikipedia, In: CHI '14 Extended Abstracts on Human Factors in Computing Systems, doi:10.1145/2559206.2579413
gi.citations.elementToby Jia-Jun Li, Shilad Sen, Brent Hecht (2014): Leveraging advances in natural language processing to better understand Tobler's first law of geography, In: Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, doi:10.1145/2666310.2666493
gi.citations.elementBrent Hecht, Emily Moxley (2009): Terabytes of Tobler: Evaluating the First Law in a Massive, Domain-Neutral Representation of World Knowledge, In: Lecture Notes in Computer Science, doi:10.1007/978-3-642-03832-7_6
gi.citations.elementJacob Thebault-Spieker, Aaron Halfaker, Loren G. Terveen, Brent Hecht (2018): Distance and Attraction, In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3173574.3173722
gi.citations.elementVahid Ashrafimoghari (2023): Detecting Cross-Lingual Information Gaps in Wikipedia, In: Companion Proceedings of the ACM Web Conference 2023, doi:10.1145/3543873.3587539
gi.citations.elementPaolo Massa, Federico Scrinzi (2012): Manypedia, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462960
gi.citations.elementEmily Porter, P. M. Krafft, Brian Keegan (2020): Visual Narratives and Collective Memory across Peer-Produced Accounts of Contested Sociopolitical Events, In: ACM Transactions on Social Computing 1(3), doi:10.1145/3373147
gi.citations.elementCharles Chuankai Zhang, Loren Terveen (2021): Quantifying the Gap: A Case Study of Wikidata Gender Disparities, In: 17th International Symposium on Open Collaboration, doi:10.1145/3479986.3479992
gi.citations.elementMo Houtti, Isaac Johnson, Joel Cepeda, Soumya Khandelwal, Aviral Bhatnagar, Loren Terveen (2022): "We Need a Woman in Music": Exploring Wikipedia's Values on Article Priority, In: Proceedings of the ACM on Human-Computer Interaction CSCW2(6), doi:10.1145/3555156
gi.citations.elementTaha Yasseri, Anselm Spoerri, Mark Graham, Janos Kertesz (2013): The Most Controversial Topics in Wikipedia: A Multilingual and Geographical Analysis, In: SSRN Electronic Journal, doi:10.2139/ssrn.2269392
gi.citations.elementShilad Sen, Toby Jia-Jun Li, WikiBrain Team, Brent Hecht (2014): WikiBrain, In: Proceedings of The International Symposium on Open Collaboration, doi:10.1145/2641580.2641615
gi.citations.elementLaura K. Nelson, Rebekah Getman, Syed Arefinul Haque (2021): And the Rest is History: Measuring the Scope and Recall of Wikipedia’s Coverage of Three Women’s Movement Subgroups, In: Sociological Methods & Research 4(51), doi:10.1177/00491241211067514
gi.citations.elementAndrew Hall, Sarah McRoberts, Jacob Thebault-Spieker, Yilun Lin, Shilad Sen, Brent Hecht, Loren Terveen (2017): Freedom versus Standardization, In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3025453.3025940
gi.citations.elementAlexander Mehler, Wahed Hemati, Pascal Welke, Maxim Konca, Tolga Uslu (2020): Multiple Texts as a Limiting Factor in Online Learning: Quantifying (Dis-)similarities of Knowledge Networks, In: Frontiers in Education, doi:10.3389/feduc.2020.562670
gi.citations.elementNaveena Karusala (2019): How technology converses with local languages, In: XRDS: Crossroads, The ACM Magazine for Students 2(26), doi:10.1145/3368068
gi.citations.elementEwa S. Callahan, Susan C. Herring (2011): Cultural bias in Wikipedia content on famous persons, In: Journal of the American Society for Information Science and Technology 10(62), doi:10.1002/asi.21577
gi.citations.elementNicholas Vincent, Isaac Johnson, Brent Hecht (2018): Examining Wikipedia With a Broader Lens, In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3173574.3174140
gi.citations.elementFlorian Lemmerich, Diego Sáez-Trumper, Robert West, Leila Zia (2019): Why the World Reads Wikipedia, In: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, doi:10.1145/3289600.3291021
gi.citations.elementAnna Samoilenko, Fariba Karimi, Daniel Edler, Jérôme Kunegis, Markus Strohmaier (2016): Linguistic neighbourhoods: explaining cultural borders on Wikipedia through multilingual co-editing activity, In: EPJ Data Science 1(5), doi:10.1140/epjds/s13688-016-0070-8
gi.citations.elementTiziano Piccardi, Robert West (2021): Crosslingual Topic Modeling with WikiPDA, In: Proceedings of the Web Conference 2021, doi:10.1145/3442381.3449805
gi.citations.elementBrent Hecht, Johannes Schöning, Thomas Erickson, Reid Priedhorsky (2011): Geographic human-computer interaction, In: CHI '11 Extended Abstracts on Human Factors in Computing Systems, doi:10.1145/1979742.1979532
gi.citations.elementDavid Laniado, Michele Mauri, Erik Borra (2024): Chapter 6. Exploring the evolution of Wikipedia articles through Contropedia, In: Studies in Corpus Linguistics, doi:10.1075/scl.121.06lan
gi.citations.elementPaolo Massa, Maurizio Napolitano, Federico Scrinzi, Michela Ferron (2012): WikiTrip, In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, doi:10.1145/2462932.2462980
gi.citations.elementTim Gregory (2018): Colonising antinormative sex: The flexibility of post-porn heterosex in random webcam sex, In: Sexualities 4(21), doi:10.1177/1363460717737772
gi.citations.elementYaxuan Yin, Longjie Guo, Jacob Thebault-Spieker (2024): Productivity or Equity? Tradeoffs in Volunteer Microtasking in Humanitarian OpenStreetMap, In: Proceedings of the ACM on Human-Computer Interaction CSCW1(8), doi:10.1145/3637390
gi.citations.elementMaitraye Das, Brent Hecht, Darren Gergle (2019): The Gendered Geography of Contributions to OpenStreetMap, In: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, doi:10.1145/3290605.3300793
gi.citations.elementEduardo Graells-Garrido, Mounia Lalmas, Filippo Menczer (2015): First Women, Second Sex, In: Proceedings of the 26th ACM Conference on Hypertext & Social Media - HT '15, doi:10.1145/2700171.2791036
gi.citations.elementTaryn Bipat, Negin Alimohammadi, Yihan Yu, David W. McDonald, Mark Zachry (2021): Wikipedia Beyond the English Language Edition, In: Proceedings of the ACM on Human-Computer Interaction CSCW1(5), doi:10.1145/3449129
gi.citations.elementCailean Osborne, Mark Graham, Martin Dittus (2020): Edit Wars in a Contested Digital City: Mapping Wikipedia’s Uneven Augmentations of Berlin, In: The Professional Geographer 1(73), doi:10.1080/00330124.2020.1800493
gi.citations.elementBrent J. Hecht, Darren Gergle (2010): On the "localness" of user-generated content, In: Proceedings of the 2010 ACM conference on Computer supported cooperative work, doi:10.1145/1718918.1718962
gi.citations.elementDaniel Rinser, Dustin Lange, Felix Naumann (2013): Cross-lingual entity matching and infobox alignment in Wikipedia, In: Information Systems 6(38), doi:10.1016/j.is.2012.10.003
gi.citations.elementEsther Weltevrede, Erik Borra (2016): Platform affordances and data practices: The value of dispute on Wikipedia, In: Big Data & Society 1(3), doi:10.1177/2053951716653418
gi.citations.elementOleksiy Gnatiuk, Victoria Glybovets, (2021): Uneven geographies in the various language editions of Wikipedia: the case of Ukrainian cities, In: Hungarian Geographical Bulletin 3(70), doi:10.15201/hungeobull.70.3.4
gi.citations.elementTaryn Bipat, David W. McDonald, Mark Zachry (2018): Do We All Talk Before We Type?, In: Proceedings of the 14th International Symposium on Open Collaboration, doi:10.1145/3233391.3233542
gi.citations.elementBrent Hecht, Darren Gergle (2010): The tower of Babel meets web 2.0, In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, doi:10.1145/1753326.1753370
gi.citations.elementMolly G. Hickman, Viral Pasad, Harsh Kamalesh Sanghavi, Jacob Thebault-Spieker, Sang Won Lee (2021): Understanding Wikipedia Practices Through Hindi, Urdu, and English Takes on an Evolving Regional Conflict, In: Proceedings of the ACM on Human-Computer Interaction CSCW1(5), doi:10.1145/3449108
gi.citations.elementMarc Miquel-Ribé, David Laniado (2018): Wikipedia Culture Gap: Quantifying Content Imbalances Across 40 Language Editions, In: Frontiers in Physics, doi:10.3389/fphy.2018.00054
gi.conference.sessiontitleFull Papers

Files

Original bundle

1 - 1 of 1
Loading...
Thumbnail Image
Name:
00401.pdf
Size:
2.52 MB
Format:
Adobe Portable Document Format

License bundle

1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: