Dr. Kinneret Misgav

ivrit.ai: Building World-Class AI for Hebrew
Kinneret Misgav

Abstract

At ivrit.ai, a non-profit initiative, we’ve successfully developed the world’s leading open-source Hebrew speech-to-text engine. We did it with no budget through volunteer-based efforts. In addition, we developed the largest openly licensed Hebrew speech corpus with over 15,000 hours of diverse content and created a free-to-use transcription service that achieves 8-29% lower error rates than existing solutions.

In this roundtable, we’ll share our journey and then open the discussion to explore:
– How data scientists and AI practitioners are currently working with Hebrew language models
– The key challenges you face when developing AI applications for Hebrew
– Which direction we should take next – from OCR and mixed-language processing to TTS and beyond

Join us to share your experiences, discuss the unique obstacles of Hebrew AI development, and help shape the roadmap for making Hebrew a first-class citizen in the AI world.

Bio

With over 10 years specializing in data research, I currently lead the data research unit at Hadassah Research Fund, where we manage significant projects and foster collaborations with researchers and industry on nuanced medical data endeavors. My research also delves into the intricate interplay between medical treatments and societal factors, including gender, ethnicity, decision-making, and medical education. Beyond this, I’m a proud co-founder and volunteer at ivrit.ai, a pioneering non-profit dedicated to elevating Hebrew in the AI realm through extensive, favorably-licensed datasets. My educational foundation is rooted in a PhD in child psychology, with a notable focus on human text analysis and NLP.

Agenda

8:45 Reception
9:30 Opening remarks by WiDS TLV ambassadors
9:45 Dr. Mor Geva , Tel Aviv University: “MRI for Large Language Models: Mechanistic Interpretability from Neurons to Attention Heads”
10:15 Panel: “Pioneering Progress: a strategic look at the GenAI revolution and the new role of data scientists“
Shani Gershtein, Melingo
Mirit Elyada Bar, Intuit
Dr. Asi Messica, Lightricks
Moderated by Nitzan Gado, Intuit
10:45 Poster pitches
10:55 Break
11:10 Lightning talks session
12:30 Lunch & poster session
13:30 Roundtable session & poster session
14:30 Roundtable closing
14:40 Shunit Agmon, Technion: “Bridging the Gender Gap in Clinical AI: Temporal Adaptation with TeDi-BERT”
15:00 Shaked Naor Hoffmann, Apartment List: “Building Generative AI Agents for Production: Turning Ideas into Real-World Applications”
15:20 Closing remarks
15:30 The end