JMIR Preprints #51954: Auditing Natural Language Processing for Gender Equality in Sub-Saharan African Healthcare Systems: Framework Development and Evaluation

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Auditing Natural Language Processing for Gender Equality in Sub-Saharan African Healthcare Systems: Framework Development and Evaluation

David Tresner-Kirsch;
Chika Yinka-Banjo;
Mary Akinyemi;
Olasupo Ajayi

ABSTRACT

Background:

Natural Language Processing models have wide and growing use in clinical and healthcare domains. Such applications enable scalable, efficient delivery of health information, but they are prone to equity challenges in their effectiveness across demographics and contexts. These models are only as good as the data they are trained on, the type of training, and parameters. Moreover, they are highly sensitive to latent demographic signals such as gender, age, nationality, and native language. Applications with biased components lead to inequitable outcomes. These accessibility challenges are more prevalent in rural regions of the world.

Objective:

This paper describes and evaluates a novel active learning approach for incrementally improving the accuracy of a Natural Language Processing (NLP), while optimizing for gender-equitable outcomes in healthcare systems. The approach employs an iterative cyclic model, incorporating data annotation using NLP, human auditing to improve the annotation accuracy especially for data with demographic segmentation, testing on new data (with intentional bias favoring underperforming demographics), and a loopback system for retraining the model and applying it on new data.

Methods:

We describe experimental integration of an audit tool and workflow with distinct NLP tasks in two separate contexts: i.) annotation of medical symptoms collected in Hausa and English languages based on responses to a research questionnaire about health access in Northern Nigeria; ii.) message intent classification in English and Swahili languages based on spontaneous user messages to a health guide chatbot in both Nigeria and Kenya.

Results:

Baseline results showed an equity gap in both precision (P) and recall (R): p=.725 and r=.676 for the over-reprsented class versus p=.669 and r=.651 for the under-represented class. Application of the active learning tool and workflow mitigated this gap after three increments of auditing and retraining (p=.721 and r=.760 for the under-represented class).

Conclusions:

Our findings indicate that this gender-aware audit workflow is language agnostic and capable of mitigating demographic inequity while improving overall system accuracy.

Citation

Please cite as:

Tresner-Kirsch D, Yinka-Banjo C, Akinyemi M, Ajayi O

Auditing Natural Language Processing for Gender Equality in Sub-Saharan African Healthcare Systems: Framework Development and Evaluation

JMIR Preprints. 18/08/2023:51954

DOI: 10.2196/preprints.51954

URL: https://preprints.jmir.org/preprint/51954

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Previously submitted to: JMIR AI (no longer under consideration since Jul 28, 2024)

Date Submitted: Aug 18, 2023

Open Peer Review Period: Aug 17, 2023 - Oct 12, 2023

(closed for review but you can still tweet)

NOTE: This is an unreviewed Preprint

Auditing Natural Language Processing for Gender Equality in Sub-Saharan African Healthcare Systems: Framework Development and Evaluation

ABSTRACT

Citation

Copyright

JMIR Preprints

Previously submitted to: JMIR AI (no longer under consideration since Jul 28, 2024)

Date Submitted: Aug 18, 2023

Open Peer Review Period: Aug 17, 2023 - Oct 12, 2023

(closed for review but you can still tweet)

NOTE: This is an unreviewed Preprint

Auditing Natural Language Processing for Gender Equality in Sub-Saharan African Healthcare Systems: Framework Development and Evaluation

ABSTRACT

Citation

The author of this paper has made a PDF available, but requires the user to login, or create an account.

Copyright