Selected recent publications
- Testoni, A., Calixto, I., Calibrated? Not for Everyone: How Sexual Orientation and Religious Markers Distort LLM Accuracy and Confidence in Medical QA. (to appear in) Proceedings of The 64th Annual Meeting of the Association for Computational Linguistics, ACL 2026. Paper
- Testoni, A., Calixto, I., Mind the Gap: Benchmarking LLM Uncertainty and Calibration with Specialty-Aware Clinical QA and Reasoning-Based Behavioural Features. (Outstanding Paper Award) Proceedings of the 19th Conference of The European Chapter of the Association for Computational Linguistics, EACL 2026. Paper
- Testoni, A., Plank, B., Fernández, R., RAcQUEt: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs. Proceedings of The 2025 Conference on Empirical Methods in Natural Language Processing, EMNLP 2025 Paper