user avatar
Nando de Freitas
@NandoDF
I seek to understand intelligence, agency and awareness, and build AI aligned with compassion, freedom, universal human empowerment, and progress science.
London, England
Joined April 2009
Posts
  • user avatar
    The Llama 3 paper is a must-read for anyone in AI and CS. It’s an absolutely accurate and authoritative take on what it takes to build a leading LLM, the tech behind ChatGPT, Gemini, Copilot, and others. The AI part might seem small in comparison to the gargantuan work on *data*
    Why do 16k GPU jobs fail? The Llama3 paper has many cool details -- but notably, has a huge infrastructure section that covers how we parallelize, keep things reliable, etc. We hit an overall 90% effective-training-time. ai.meta.com/research/publi…
  • user avatar
    Hmmm, from what I see my colleagues in AI at Google London work bloody long ours and are extremely committed. This guy once came to London and told us to abandon Torch and use TensorFlow. That set the field of AI back by at least 6 months.
  • user avatar
    I’ve walked through poor neighbourhoods in India, Africa and LatAm many times. Yet, I recently walked through one of the most depressing ones in terms of poverty, drug abuse, and sheer hopelessness: San Francisco. Giant tech AI companies promise to make the world a better
  • user avatar
    I believe I have written more papers than Alan Turing + John Nash! Numbers of papers alone is a wrong misleading metric. Please focus instead on writing good papers that advance the field, help the world, and that you’ll be proud of when you look back in 20 or 50 years.
    Yes, @GoogleAI (well, all of @AlphabetINC) produces a lot of awesome AI research, but @Stanford + @MIT together produce more (judging by @NeurIPSConf papers!), and @Stanford + @MIT + @UCBerkeley + @CarnegieMellon produces more than @AlphabetINC + @Microsoft + @facebook
  • user avatar
    It’s time to say thank you and goodbye to @GoogleDeepMind. I had the immense fortune of working there for 10 years. They were undoubtedly the most exciting years in the history of AI, and I feel that I grew beyond all my expectations thanks to my uniquely smart, generous and
  • user avatar
    Can AI researchers please tweet: I am against racism, sexism, bullying and cancelling, and I believe in improving diversity, equity and inclusion in our AI community. We need to hear your voices! The students and the public need to know what most believe.
  • user avatar
    RL is not all you need, nor attention nor Bayesianism nor free energy minimisation, nor an age of first person experience. Such statements are propaganda. You need thousands of people working hard on data pipelines, scaling infrastructure, HPC, apps with feedback to drive
  • user avatar
    I still remember that 2013 @NeurIPSConf party with Mark Zuckerberg. He had a bottle of water at that first Neurips corporate party. I thought it was out of character for a Neurips party - what was the matter with this kid? And why did he speak like that? We were so naive! …
  • user avatar
    View point invariance is an important inductive bias in how we perceive objects - here tested to the limit by a smart artist.
  • user avatar
    There appears to be a mismatch between publishing criteria in AI conferences and "what actually works". It is easy to publish new mathematical constructs (e.g. new models, new layers, new modules, new losses), but as Apple's MM1 paper concludes: 1. Encoder Lesson: Image
  • user avatar
    Mike Jordan defending ML engineering! Good engineering gave us the GPU convnets, the transformers, torch, numpy, etc. The popular diminishing meme “it’s-just-engineering” is silly, and holds us back. I ❤️ creative, rigorous, robust, safe engineering.
  • user avatar
    I’ve joined @Microsoft AI to advance the frontier of large scale multimodal AI research and to build products for people to achieve meaningful goals and dreams. The MAI team is small, but well resourced and ambitious. We are now looking for exceptional ICs, who like to ship. If
  • user avatar
    Game over. Scale is essential to AI.
  • user avatar
    Nvidia released the megatron language model before the pandemic. It’s amazing how influential this paper became. A must read for people wanting to learn about AI.