Training-Free Looped Transformers
UT Austin — Lizhang Chen, Jonathan Li, Chen Liang, Ni Lao, Qiang Liu. A lightweight inference-time wrapper that loops frozen checkpoint layers. +2.64 pp MMLU-Pro on Qwen3-4B, 87% non-negative across 45 evaluation cells, ~0% overhead in bypass mode.





围绕这条内容继续补充观点或上下文。