Kangwook Lee (@Kangwook

Kangwook Lee

1,413 posts

Kangwook Lee

@Kangwook_Lee

CAIO @KRAFTON_AI / CTO @LudoRobotics (Prev) Associate Professor @UWMadisonECE, PhD @Berkeley_EECS

Joined July 2009

Pinned
Kangwook Lee
@Kangwook_Lee
May 9
A horse could not build its own harness. AI can. Let the smart horses build their own harness.
Kangwook Lee
@Kangwook_Lee
May 9
Article
Why We Should Stop Designing Harnesses for AI Agents
(For those who aren't familiar with "the horse-carriage analogy", please read my recent article first) In this article, I want to explain why we all should stop hand-designing harnesses for AI agents....
11K
Kangwook Lee
@Kangwook_Lee
Oct 21, 2025
A more serious thread on the DeepSeek-OCR hype / serious misinterpretation going on. 1. On token reduction via representing text in images, researchers from Cambridge have previously shown that 500x prompt token compression is possible (ACL'25, Li, Su, and Collier). Without
90K
Kangwook Lee
@Kangwook_Lee
Sep 26, 2024
🚀 Excited to share our latest research on Looped Transformers for Length Generalization! TL;DR: We trained a Looped Transformer that dynamically adjusts the number of iterations based on input difficulty—and it achieves near-perfect length generalization on various tasks! 🧵👇
101K
Kangwook Lee
@Kangwook_Lee
Oct 4, 2024
🚀 Excited to share our work on Encoder-only Next Token Prediction (ENTP)! While most successful LLMs are decoder-based, we asked: Can encoder-only TFs be used for next-token prediction? Yes! Moreover, ENTP might be better than decoder-only models!!! 😎
62K
Kangwook Lee
@Kangwook_Lee
Aug 7, 2025
Q. Prove using an LLM-as-a-judge still doesn't work A.
61K
Kangwook Lee
@Kangwook_Lee
Jun 14, 2022
😎! Finetuning a pretrained lang model (e.g., GPT3) has become a popular approach to solve many text-based tasks. This paradigm is making ML very accessible as all you need to prepare is text data for finetuning. Does it also work for non-text tasks? Surprisingly, yes!!! (1/8)
Kangwook Lee
@Kangwook_Lee
Feb 28, 2025
1/ Super excited to share our new work “LLM-Lasso,” led by my collaborators from Stanford! tldr; We've reimagined the classic Lasso algorithm (by @robtibshirani), which uses ℓ1 regularization to select a sparse subset of features!
33K
Kangwook Lee
@Kangwook_Lee
Oct 16, 2025
DLLMs seem promising... but parallel generation is not always possible Diffusion-based LLMs can generate many tokens at different positions at once, while most autoregressive LLMs generate tokens one by one. This makes diffusion-based LLMs highly attractive when we need fast
67K
Kangwook Lee
@Kangwook_Lee
Mar 26, 2025
Is 4o Image Generation really good at native in-context learning (as written on the whiteboard 😀)? About a year ago, @yzeng58 et al. proposed a very challenging Text-to-Image in-context learning benchmark called CoBSAT... All models completely failed it. 4o crushed it. 🧵
50K
Kangwook Lee
@Kangwook_Lee
Oct 15, 2024
🚀 Excited to share our latest research: "Parameter-Efficient Fine-Tuning of SSMs" Summary: 🧵
55K
Kangwook Lee
@Kangwook_Lee
Mar 12, 2024
🧵Let me explain why the early ascent phenomenon occurs🔥 We must first understand that in-context learning exhibits two distinct modes. When given samples from a novel task, the model actually learns the pattern from the examples. We call this mode the "task learning" mode.
72K
Kangwook Lee
@Kangwook_Lee
Sep 2, 2025
Happy to share that I got tenured last month! While every phase in life is special, this one feels a bit more meaningful, and it made me reflect on the past 15+ years in academia. I'd like to thank @UWMadison and @UWMadisonECE for tremendous support throughout the past six
24K
Kangwook Lee
@Kangwook_Lee
May 22, 2023
1/10: The summer break is the perfect time to share recent research from my lab. Our first story revolves around a fresh interpretation of diffusion-based generative modeling by my brilliant student @yingfan_bot. She proposed "diffusion models are solving a control problem".
43K
Kangwook Lee
@Kangwook_Lee
Mar 19, 2024
I'm honored to receive the NSF CAREER Award! Our group will develop a unified theory and new algorithms with provable guarantees for learning with frozen pretrained models, also known as foundation models. Huge thanks to NSF and my amazing collaborators and students! 🥳
22K