Francesco Orabona (@bremen79) / X

Francesco Orabona

2,567 posts

Francesco Orabona

@bremen79

Dad and associate professor at @KAUST_News. Formerly @BU_ece, @sbucompsc, @YahooResearch, @TTIC_Connect. ML theory&practice, obsessed with history of science

Thuwal, Saudi Arabia

Joined February 2010

Pinned
Francesco Orabona
@bremen79
Apr 1
Article
"Sharp minima can generalize for deep nets" or why the community does not learn
Every few months in machine learning, we have this recurrent debate about the role of sharpness in generalization. The short story is that the solutions found by SGD seem to be in "flat" areas of the...
24K
Francesco Orabona
@bremen79
Mar 24, 2021
For the future PhD students that are deciding in these days which offer to accept, let me give you a very simple advice: Avoid choosing a**holes as supervisors. It does not matter how famous they are or how prestigious is the school, you'll regret it and they don't deserve you.
Francesco Orabona
@bremen79
Jan 7, 2022
When you debug a machine learning model
00:00
Francesco Orabona
@bremen79
Dec 6, 2020
New blog post: Neural Networks (Maybe) Evolved to Make ADAM the Best Optimizer For the first time, I wrote a blog post without math :) I discuss a *conjecture* I have regarding Adam and the way the deep learning community produces new ideas
Neural Networks (Maybe) Evolved to Make Adam The Best Optimizer
From parameterfree.com
Francesco Orabona
@bremen79
Jan 1, 2020
As promised, I compiled all my lecture notes on Online Learning in a single PDF. Feedback is welcome! "A Modern Introduction to Online Learning" arxiv.org/abs/1912.13213 PS Happy New Year!
arxiv.org
A Modern Introduction to Online Learning
In this book, I introduce the basic concepts of Online Learning through the modern view of Online Convex Optimization. Here, online learning refers to the framework of regret minimization under...
Francesco Orabona
@bremen79
May 10, 2022
You cannot take the logarithm of the Lipschitz constant of a function! A 🧵 about a super common mistake in ML papers 1/10
Francesco Orabona
@bremen79
Feb 2, 2021
It is now official: I got the @NSF CAREER award for "Parameter-free Optimization Algorithms for Machine Learning"! Super happy :) Soon, I'll tweet about an exciting new result about parameter-free algorithms for non-convex functions: I had this idea writing this proposal :)
Francesco Orabona
@bremen79
Apr 22, 2025
This is a turning point: I just proved a complex math result useful for my research using an LLM. I am not sure if I should be happy or scared...
98K
Francesco Orabona
@bremen79
May 2, 2025
New version of my book on Online Learning! arxiv.org/abs/1912.13213 This is a BIG update, with 43 pages of additional content. Let me list some of the changes. 🧵1/8
arxiv.org
A Modern Introduction to Online Learning
In this book, I introduce the basic concepts of Online Learning through the modern view of Online Convex Optimization. Here, online learning refers to the framework of regret minimization under...
39K
Francesco Orabona
@bremen79
Jul 2, 2020
Dear PhD students with a submission to @NeurIPSConf, soon ~20% of you will receive a desk reject. Here some suggestions to deal with it in a healthy way. First, this is not personal: a paper written by you was rejected, not you. Keep your self-worth unlinked from your work. 1/5
Francesco Orabona
@bremen79
Aug 30, 2023
New blog post: Yet Another ICML Award Fiasco The story of the @icmlconf 2023 Outstanding Paper Award to the D-Adaptation paper with worse results that the ones from 9 years ago Please share it to start a needed conversation on mistakenly granted awards
Yet Another ICML Award Fiasco
From parameterfree.com
186K
Francesco Orabona
@bremen79
May 28, 2025
As promised, we put on Arxiv the proof we did with Gemini. arxiv.org/pdf/2505.20219 This shows that the Polyak stepsize not only will not reach the optimum, but it can cycle, when used without the knowledge of f*. Gemini failed when prompted directly ("Find an example where the
Francesco Orabona
@bremen79
Apr 22, 2025
This is a turning point: I just proved a complex math result useful for my research using an LLM. I am not sure if I should be happy or scared...
70K
Francesco Orabona
@bremen79
Dec 2, 2020
Unpopular opinion: "Optimization for Deep Learning" should be removed as a subtopic in ML conferences. Numerical optimization is estabilished field: we don't need a worse variant of it with algorithms justified by "intuition", rarely converging, and not properly evaluated.
Francesco Orabona
@bremen79
May 15, 2021
Given that even @BU_ece changed my title on the website, I guess it is now official: I have been awarded tenure and promoted to Associate Professor This was a long and stressful journey, but I f*cking did it!!! 💪🥳🎉