Shunit Agmon

Bar-Ilan University
Bridging the Gender Gap in Clinical AI: Temporal Adaptation with TeDi-BERT
Shunit Agmon

Abstract

Clinical research as a corpus contains years’ worth of biases, one of the most prominent being the longstanding underrepresentation of women in clinical trials. Even in obstetrics and gynecology, where the research subjects are only women, there is a bias in the topics chosen for research: they focus more on the reproductive role of women, the successful completion of pregnancy and fetus health, rather than on women’s health.

Machine-learning models trained on clinical trial abstracts may capture and amplify biases in the data. Specifically, word embeddings are models that enable representing words as vectors and are the building block of most natural language processing systems. If word embeddings are trained on clinical trial abstracts, predictive models that use the embeddings will exhibit gender performance gaps.

However, positive trends exist in clinical research, in the increase of women inclusion and in the general quality and practices used. I will present TeDi-BERT, a method to capture these temporal trends through temporal distribution matching on contextual word embeddings (specifically, BERT) and explore its effect on the bias manifested in downstream tasks.
Temporal distribution matching is implemented through an adversarial classifier, trying to distinguish old from new clinical trial abstracts based on their embeddings. The temporal distribution matching acts as a form of domain adaptation from older to more recent clinical trials. I will show an evaluation of our model on two clinical tasks: prediction of unplanned readmission to the intensive care unit, and hospital length of stay prediction.

In both clinical tasks, TeDi-BERT improved performance for female patients, as expected; but it also improved performance for male patients. Our results show that accuracy for one gender does not need to be exchanged for bias reduction, but rather that good science improves clinical results for all. Contextual word embedding models trained to capture temporal trends can help mitigate the effects of bias that changes over time in the training data.

Bio

Shunit Agmon is a PhD candidate in the Computer Science Department at the Technion – Israel Institute of Technology, under the supervision of Dr. Kira Radinsky and Professor Benny Kimelfeld. Her research focuses on mitigating the effects of social biases on machine learning algorithms. During her doctorate studies she also worked as a research intern at Amazon. Prior to that, she completed her BSc (summa cum laude) and MSc at the Technion as well. Shunit is a two-time recipient of the student research prize for cross-discipline collaboration in data science, funded by the Israeli planning and budgeting committee.

Agenda

8:45 Reception
9:30 Opening remarks by WiDS TLV ambassadors
9:45 Dr. Mor Geva , Tel Aviv University: “MRI for Large Language Models: Mechanistic Interpretability from Neurons to Attention Heads”
10:15 Panel: “Pioneering Progress: a strategic look at the GenAI revolution and the new role of data scientists“
Shani Gershtein, Melingo
Mirit Elyada Bar, Intuit
Dr. Asi Messica, Lightricks
Moderated by Nitzan Gado, Intuit
10:45 Poster pitches
10:55 Break
11:10 Lightning talks session
12:30 Lunch & poster session
13:30 Roundtable session & poster session
14:30 Roundtable closing
14:40 Shunit Agmon, Technion: “Bridging the Gender Gap in Clinical AI: Temporal Adaptation with TeDi-BERT”
15:00 Shaked Naor Hoffmann, Apartment List: “Building Generative AI Agents for Production: Turning Ideas into Real-World Applications”
15:20 Closing remarks
15:30 The end