user avatar
trieu
@thtrieu_
inventor of #alphageometry. thinking about thinking @ deepmind.
Mountain View
Joined April 2014
Posts
  • Pinned
    user avatar
    Mathematicians 🤝AI researchers arxiv.org/abs/2601.22401. Our take on AI solving Erdos problems: * Many "Open" problems are actually just obscure: many cases the AI didn't find something new, only rediscovered solutions buried in the literature. We present our systematic approach
    Replying to @lmthang
    Here's the paper link to our scaled effort for tackling Erdős problems. We started with 700 problems marked ‘Open’ in the database. Our agent #Aletheia identified potential solutions to 200 problems. Initial human grading revealed 63 correct answers, followed by deep expert
    arXiv logo
    arxiv.org
    Semi-Autonomous Mathematics Discovery with Gemini: A Case Study on...
    We present a case study in semi-autonomous mathematics discovery, using Gemini to systematically evaluate 700 conjectures labeled 'Open' in Bloom's Erdős Problems database. We employ a hybrid...
  • user avatar
    Proud of this work. Here's my 22min video explanation of the paper: youtube.com/watch?v=TuZhU1…
    Introducing AlphaGeometry: an AI system that solves Olympiad geometry problems at a level approaching a human gold-medalist. 📐 It was trained solely on synthetic data and marks a breakthrough for AI in mathematical reasoning. 🧵 dpmd.ai/alphageometry
    GIF
  • user avatar
    We show large language models trained on massive text corpora (LM1b, CommonCrawl, Gutenberg) can be used for commonsense reasoning and obtain SOTA on Winograd Schema Challenge. Paper at arxiv.org/abs/1806.02847, results reproducible at git.io/vhaaf
  • user avatar
    As also observed by OpenAI's GPT-2, training data quality is important. We release the STORIES corpus introduced in our work arxiv.org/abs/1806.02847. The corpus is a high quality subset of CommonCrawl with a total of ~7B words (~32GB) can be found here: console.cloud.google.com/storage/browse…
  • user avatar
    Wow! An old project of mine github.com/thtrieu/darkfl… is now the 7th most popular Machine Learning project across all Github in 2018, alongside with Tensorflow and Scikit-learn? I really need to spend some time polishing it now...
    From the programming languages you used most to the most popular data science packages, we’re digging into the data on Machine Learning from 2018. Find out what we discovered github.blog/2019-01-24-the…
  • user avatar
    Human reasoning is not manipulating symbolic expressions
    Replying to @CIFAR_News
    What is thought?: Big questions from CIFAR Distinguished Fellow Geoffrey Hinton #dlrl2018
  • user avatar
    Had the chance to sit next to Daniel @xpearhead in the early days of the project and tried out the interactive Meena. It has always been *this* surprising and funny :) BIG Congrats to the team with this publication. The possibilities to build up from here is endless.
    um.. google's latest chatbot is 😳 arxiv.org/abs/2001.09977
  • user avatar
    Our work on learning longer-term dependencies is accepted at @icmlconf #icml2018
    Excited to share a new work by #GoogleAI resident @thtrieu_ (with @andrewdai, me, & Quoc Le) on training very long RNNs (up to 16K long). See paper for extreme cases of zero or little backprop on RNNs ;) arxiv.org/pdf/1803.00144…
  • user avatar
    I'll be presenting my work #ICLR2018 on Wednesday. Come and have a chat :)
  • user avatar
    "We see contributions to traditional conferences and publications in journals as an important part of our work, but also support efforts that go “beyond the research paper"".
    I'm excited to finally share what I have been working on. Today we are officially launching Cohere For AI @forai_ml a non-profit research lab that aims to reimagine how, where, and by whom research is done. cohere.for.ai
  • user avatar
    A commonsense reasoning task is "solved" even before its official introduction.
    It's amazing how fast #NLProc is moving these days. We have now reached super-human performance on SWAG, a commonsense task that will only be introduced at @emnlp2018 in November! We need even more challenging tasks! BERT: arxiv.org/abs/1810.04805 SWAG: arxiv.org/abs/1808.05326
  • user avatar
    10M toks with almost perfect needle in haystack, amazing.
    Gemini 1.5 Pro - A highly capable multimodal model with a 10M token context length Today we are releasing the first demonstrations of the capabilities of the Gemini 1.5 series, with the Gemini 1.5 Pro model. One of the key differentiators of this model is its incredibly long
  • user avatar
    Yes, the fact that we can do this means the geometry we considered is quite narrow.
    Making a synthetic dataset of mathematical proofs is hard! It's easy to make a whole lot of "1+1+1+...=491" style theorems. I'm surprised this method of random construction and transformation finds so many classical geometric theorems. Maybe because the domain is somewhat
  • user avatar
    Want to see space-time contract/dilation? This series on Special Relativity (SR) is beautiful. The author squashed space-time to 2D, explained the two postulates by geometric intuition and run a simulator on top of it. It is the 3blue1brown of SR!