Lukasz Kaiser (@lukaszkaiser) / X

Lukasz Kaiser

1,334 posts

Lukasz Kaiser

@lukaszkaiser

San Francisco

scholar.google.com/citations?user…

Joined June 2009

Lukasz Kaiser
@lukaszkaiser
Aug 3, 2022
For mathematicians studying Transformers, a very nice concise, precise and complete definition: apronus.com/math/transform…
Lukasz Kaiser
@lukaszkaiser
Sep 12, 2024
I'm so happy to see o1 launch! Leading this research with my colleagues for almost 3 years and working on related ideas even longer convinced me: it's a new paradigm. Models that train hidden CoTs are more powerful than raw Transformers, learn from less data, generalize better.
OpenAI
@OpenAI
Sep 12, 2024
We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. openai.com/index/introduc…
52K
Lukasz Kaiser
@lukaszkaiser
Nov 20, 2023
OpenAI is nothing without its people
135K
Lukasz Kaiser
@lukaszkaiser
Nov 22, 2023
Believe in MIRAcles
OpenAI
@OpenAI
Nov 22, 2023
We have reached an agreement in principle for Sam Altman to return to OpenAI as CEO with a new initial board of Bret Taylor (Chair), Larry Summers, and Adam D'Angelo. We are collaborating to figure out the details. Thank you so much for your patience through this.
153K
Lukasz Kaiser
@lukaszkaiser
Dec 20, 2024
Replying to @apples_jimmy
On the 12th day of shipmas Santa brought to me... AGI in a model called ooo3
14K
Lukasz Kaiser
@lukaszkaiser
Mar 26, 2023
Code Interpreter with GPT4 is magic indeed
ChatGPT + Code Interpreter = Magic | Andrew Mayne
From andrewmayne.com
50K
Lukasz Kaiser
@lukaszkaiser
Sep 23, 2024
o1 is the start of a new paradigm; a lot of work remains which makes it so exciting to do research in this domain
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)
@rao2z
Sep 23, 2024
Replying to @rao2z
tldr; LRM o1's AlphaGo-like RL-training & inference, if only on pseudo CoT moves rather than task specific ones, does seem to lift it beyond the approximate retrieval nature of LLMs to a sort of approximate reasoning. This is however sans guarantees, and orders higher cost. 9/
26K
Lukasz Kaiser
@lukaszkaiser
Nov 19, 2023
♥️
Sam Altman
@sama
Nov 19, 2023
i love the openai team so much
50K
Lukasz Kaiser
@lukaszkaiser
Dec 20, 2024
So long AIME, we hardly used you as a benchmark and now you're gone... At least in a good company with ARC-AGI.
OpenAI
@OpenAI
Dec 20, 2024
Day 12: Early evals for OpenAI o3 (yes, we skipped a number) openai.com/12-days/?day=12
14K
Lukasz Kaiser
@lukaszkaiser
Sep 16, 2024
When you know the right CoT you can compute anything. As for learning good CoTs, o1 is a start :)
Denny Zhou
@denny_zhou
Sep 16, 2024
What is the performance limit when scaling LLM inference? Sky's the limit. We have mathematically proven that transformers can solve any problem, provided they are allowed to generate as many intermediate reasoning tokens as needed. Remarkably, constant depth is sufficient.
15K
Lukasz Kaiser
@lukaszkaiser
Oct 17, 2024
Honored to have been awarded the 2024 @NEC C&C Prize together with my wonderful friends and Transformer coauthors @ashVaswani @NoamShazeer @nikiparmar09 @ilblackdragon @aidangomez @kyosu @YesThisIsLion Learn more:
nec.com
NEC C&C Foundation Awards 2024 C&C Prize
The NEC C&C Foundation today announced that the 2024 C&C Prize will be awarded to two groups for their contributions to the development and implementation of large-capacity, wavelength division...
23K
Lukasz Kaiser
@lukaszkaiser
Nov 20, 2023
Would 500/700 employees of your company sign to quit to fight for it? Between 1am and 5am on a Sunday night before a company wide holiday on Thanksgiving week?
39K
Lukasz Kaiser
@lukaszkaiser
Sep 12, 2024
We're only beginning to understand this new paradigm of CoT-LLMs. There're so many new phenomena to study, research on it will be very exciting. You know it's a start of something good when your first model (with extra tuning) gets 93% on AIME’24 and does IOI-level coding :).
Steven Heidel
@stevenheidel
Sep 12, 2024
introducing our new reasoning models: o1-preview and o1-mini, available in the API today (no waitlist) for tier 5 users: openai.com/index/learning…
9.9K
Lukasz Kaiser
@lukaszkaiser
Oct 10, 2024
This is one of my favorite ways to evaluate models and a capability that I'm very much looking forward to have in our models.
OpenAI
@OpenAI
Oct 10, 2024
We’re releasing a new benchmark, MLE-bench, to measure how well AI agents perform at machine learning engineering. The benchmark consists of 75 machine learning engineering-related competitions sourced from Kaggle. openai.com/index/mle-benc…
14K