Virtual Agentic Lab!
SCIPE Workshop on Large Language Models • Final Presentation
SCIPE Workshop on Large Language Models • Final Presentation
SCIPE Workshop on LLMs - Day 3
SCIPE Workshop on LLMs - Day 2
SCIPE Workshop on LLMs
M.Sc. Thesis in Computer Science
A principled Bayesian framework that replaces Pass@k with posterior estimates, credible intervals, and stable rankings for LLM evaluation