LPDS: Evaluating LLM Robustness Through Logic-Preserving Difficulty Scaling
Philipp Mondorf, Samuel J. Bell, Jesse Dodge, and Dieuwke Hupkes. 2026. LPDS: Evaluating LLM Robustness Through Logic-Preserving Difficulty Scaling. arXiv preprint arXiv:2605.15393.











