Pinned
Are ViTs secretly RNNs? #ICLR2026
Our 2-block recurrent transformer recovers 96% of DINOv2’s IN-1k accuracy & reproduces its activations 1-to-1, motivating the Block-Recurrent Hypothesis: arxiv.org/abs/2512.19941
w/ @thomas_fel_ @RichieHakim @ABrondetta Demba Ba @t_andy_keller
GIF






