Sheli Kohan

Alice
Vector: Adaptive Agentic Distillation for Production AI Safety
Sheli Kohan

Abstract

AI systems in production evolve continuously. New jailbreaks appear, misuse patterns shift, and guardrails drift.
Static datasets and one-time fine-tuning cannot keep pace.
In this talk, I present Vector, a production-grade agentic teacher–student distillation framework that converts real-world failures into structured learning signals for adaptive AI safety.
Vector operates as a closed feedback loop. A Teacher model generates adversarial and policy-aligned examples based on observed gaps. A deployment-optimized Student model retrains on evolving behavior distributions. An orchestration layer detects structured failure clusters, identifies weak decision boundaries, and triggers targeted data generation, rebalancing, and retraining.
Distillation becomes a control system rather than a one-time compression step.
All data generation and retraining cycles are traceable and governed. Trade-offs such as recall versus precision, coverage versus overfitting, and synthetic expansion versus drift are explicitly managed.
The key insight is that AI safety at scale is not only a modeling problem. It is a feedback systems problem.
This session provides a practical blueprint for building traceable, adaptive guardrails for production AI, with lessons applicable to personalization, domain adaptation, and continuously evolving systems.

Bio

Sheli Kohan is a Data Scientist at Alice (formerly ActiveFence), where she focuses on GenAI security, trust, and safety. Over the past two years, she has led production-grade data science initiatives protecting user-facing generative AI systems from jailbreaks, misuse patterns, and evolving policy risks. She holds an M.Sc. in Data Science and brings over seven years of experience developing and deploying advanced AI and NLP systems in enterprise environments, with expertise spanning research, experimentation, and large-scale production deployment, and a strong emphasis on robustness, scalability, and real-world impact.

Agenda

08:45

Reception & gathering

09:30

Opening remarks by WiDS TLV ambassadors

09:45

Keynote session: Prof. Michal Rosen Zvi

10:15

Keynote session: Hadas Grossmon Ella

10:45

Poster pitches

10:55

Break

11:10

Lightning talks session

12:45

Lunch & poster session

13:30

Roundtable session & poster session

14:20

Roundtable closing

14:30

Talk by Hila Paz

14:50

Talk by Dr. Moran Mizrahi

15:15

Closing remarks

15:30

End