Agents need servers not computers: The emergence of hyper-personalized software. Why localhost is dead and why it's all exciting.
spolu.sh/notes/20260522…
ChatGPT is not (only) about the model, it’s (mainly) about the UI/X. Many shared their amazement at capabilities that have been here for year(s); because ChatGPT’s UI made them accessible… Reinforcing the idea that AI deployment is more about product than research at this point.
Posted my first paper on arXiv💥🙌
GPT-f is a Transformer-based automated theorem prover. We show that Transformer + Search is suitable to formal reasoning and continuous self-improvement 🦾
arxiv.org/abs/2009.03393
When I started this project 2 years ago I couldn't have dreamt of us getting that far. But this is also only the beginning💥
Some thoughts on what we achieved so far 🧵
For the past couple weeks I've been experimenting with a new way to interact with LLMs: a GPT-based assistant that has access to my browser tabs content.
It's called XP1 🧵
... and It's now available here:
Every now and then I need to remind myself of the details of the self-attention mechanism.
This picture (the dimensions in particular) is generally ~all I need to recover everything.
Also this primer is fantastic, among the best I’ve seen: aman.ai/primers/ai/tra…
I’ll make a bold statement: creating a delightful AI assistant is not anymore a problem of getting smarter models. It is a now product problem.
Better models will help but the main blocker is 100% a product problem at this point.
Yesterday was my last day at OpenAI. I had the most fantastic experience there over the past 3 years. I deeply believe everything I say in my farewell email. @OpenAI truly is an exceptional place to work on AI.
This is Day 0 of a new project for me, code-named "Dust" 🚀
This paper is quite under-rated IMHO: arxiv.org/pdf/2103.05247…
It shows that fine-tuning only the input/output layers of a large pre-trained LM leads to comparable performance to fine-tuning.
Related is Learned soft-prompts: ai.googleblog.com/2022/02/guidin…