🎞️ SlidePsychometric Modeling of LLM Evaluation DatasetsOctober 27, 2025A principled Bayesian framework that replaces Pass@k with posterior estimates, credible intervals, and stable rankings for LLM evaluation