GitHubInclusifier: Finding and fixing non-inclusive language in GitHub Repositories

Liam Todd, John Grundy, Christoph Treude

Research output: Chapter in Book/Report/Conference proceedingConference PaperOtherpeer-review

Abstract

Non-inclusive language in software artefacts has been recognised as a serious problem.We describe a tool to find and fix non-inclusive language in a variety of GitHub repository artefacts. These include various README files, PDFs, code comments, and code. A wide variety of non-inclusive language including racist, ageist, ableist, violent and others are located and issues created, tagging the artefacts for checking. Suggested fixes can be generated using third-party LLM APIs, and approved changes made to documents, including code refactorings, and committed to the repository. The tool and evaluation data are available from: https://github.com/LiamTodd/github-inclusifier The demo video is available at: https://www.youtube.com/watch?v=1z1QKdQg-nM.

Original languageEnglish
Title of host publication2024 ACM/IEEE 44th International Conference on Software Engineering: Companion Proceedings, ICSE-Companion 2024
EditorsAna C.R. Paiva, Rui Abreu
Place of PublicationNew York NY USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages89-93
Number of pages5
ISBN (Electronic)9798400705021
DOIs
Publication statusPublished - 2024
EventInternational Conference on Software Engineering 2024 - Lisbon, Portugal
Duration: 14 Apr 202420 Apr 2024
Conference number: 46th
https://dl.acm.org/doi/proceedings/10.1145/3597503 (Proceedings)
https://conf.researchr.org/home/icse-2024 (Website)

Conference

ConferenceInternational Conference on Software Engineering 2024
Abbreviated titleICSE 2024
Country/TerritoryPortugal
CityLisbon
Period14/04/2420/04/24
Internet address

Keywords

  • biased language
  • inappropriate language,software documentation
  • Inclusive language
  • refactoring
  • software maintenance tools

Cite this