Almog Gueta

Google, Technion
LLMs Accelerate Annotation for Medical Information Extraction

Bio

Almog Gueta is an MSc student in Data Science and Engineering at the Technion, under the supervision of Professor Roi Reichart. Her research focuses on Computational Social Science, exploring whether LLMs can learn Macroeconomic Narratives from Social Media. She is an intern in Google Research in medical NLP where she tackled real-world healthcare challenges. In addition to her studies, Almog is a lecturer for the Data Science courses at the Technion Continuing Education Unit.

Abstract

The unstructured nature of clinical notes within electronic health records often conceals vital patient-related information, making it challenging to access or interpret. To uncover this hidden information, specialized Natural Language Processing (NLP) models are required. However, training these models necessitates large amounts of labeled data, a process that is both time-consuming and costly when relying solely on human experts for annotation. In this paper, we propose an approach that combines Large Language Models (LLMs) with human expertise to create an efficient method for generating ground truth labels for medical text annotation. By utilizing LLMs in conjunction with human annotators, we significantly reduce the human annotation burden, enabling the rapid creation of labeled datasets. We rigorously evaluate our method on a medical information extraction task, demonstrating that our approach not only substantially cuts down on human intervention but also maintains high accuracy. The results highlight the potential of using LLMs to improve the utilization of unstructured clinical data, allowing for the swift deployment of tailored NLP solutions in healthcare.

Paper: https://proceedings.mlr.press/v225/goel23a/goel23a.pdf
Published in ML4H 2023, the health symposium of NeurIPS.

Agenda

8:45 Reception
9:30 Opening remarks by WiDS TLV ambassadors Noah Eyal Altman, Or Basson, and Nitzan Gado
9:45 Dr. Aya Soffer, IBM: "Putting Generative AI to Work: What Have We Learned So Far?"
10:15 Prof. Reut Tsarfaty, Bar-llan University: "Will Hebrew Speakers Be Able to Use Generative AI in Their Native Tongue?"
10:45 Break
11:00 Lightning talks
12:20 Lunch & poster session
13:20 Roundtable session & poster session
14:05 Roundtable closing
14:20 Break
14:30 Dr. Orna Amir & Hila Kantor, Google: "A User-Centric Framework for Quantifying Notification Harm"
14:50 Naomi Ken Korem, Lightricks: "Mastering the Art of Generative Models: Training and Controlling Text-to-Video Models"
15:10 Dr. Yael Mathov, Intuit: "Surviving the AI-pocalypse: Your Guide to LLM Security"
15:30 Closing remarks
15:40 The end

WiDS TLV important update

Dear WiDS TLV attendees,

In light of recent developments, we regret to inform you that the WIDS TLV 2024 event scheduled for tomorrow has been postponed to June 3rd, 2024. We apologize for this last-minute change and look forward to seeing all of you on June 3rd.