Mikel Bober-Irizar
Subscribe
Sign in
Home
Archive
LLMs struggle with perception, not reasoning, in ARC-AGI
What made o3 so much better than previous models on this benchmark?
Dec 24, 2024
•
Mikel Bober-Irizar
87
5
8
o3 and ARC-AGI: The unsolved tasks
The 34 puzzles that money can't buy.
Dec 21, 2024
•
Mikel Bober-Irizar
40
11
1
Mikel Bober-Irizar
Some musings on AI, gaming, security, machine reasoning and whatever else I'm working on.
Subscribe
Mikel Bober-Irizar
Subscribe
About
Archive
Sitemap
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts