Test-Time Scaling Under BudgetNovember 21, 2025M.Sc. Thesis in Computer ScienceLLMsBayesian MethodsReasoningCompressionQuantization
Psychometric Modeling of LLM Evaluation DatasetsOctober 27, 2025A principled Bayesian framework that replaces Pass@k with posterior estimates, credible intervals, and stable rankings for LLM evaluationLLMsEvaluationReasoningDatasets