Tom McCoy (@RTomMcCoy) / X

Tom McCoy

2,014 posts

Tom McCoy

@RTomMcCoy

Assistant professor @YaleLinguistics. Studying computational linguistics, cognitive science, and AI. He/him.

New Haven, CT

Joined December 2018

Pinned
Tom McCoy
@RTomMcCoy
Oct 10, 2024
🤖🧠NOW OUT IN PNAS🧠🤖 Language models show many surprising behaviors. E.g., they can count 30 items more easily than 29 In Embers of Autoregression, we explain such effects by analyzing what LMs are trained to do pnas.org/doi/10.1073/pn… Major updates since the preprint! 1/n
54K
Tom McCoy
@RTomMcCoy
Sep 26, 2023
🤖🧠NEW PAPER🧠🤖 Language models are so broadly useful that it's easy to forget what they are: next-word prediction systems Remembering this fact reveals surprising behavioral patterns: 🔥Embers of Autoregression🔥 (counterpart to "Sparks of AGI") arxiv.org/abs/2309.13638 1/8
240K
Tom McCoy
@RTomMcCoy
Jul 18, 2022
It has become acceptable for acronyms to use any letters within a word, not just the first letter. E.g., ORNATE = acrOnyms fRom noN-initial chAracTErs But why stick with whole letters? In my new paradigm CLIP, an acronym can use any curves or line segments from the base phrase!
Tom McCoy
@RTomMcCoy
Jan 12, 2020
How am I only learning now that Latvia's prime minister has a PhD in linguistics from Penn?? I've seen many lists of "jobs for linguists outside academia" but they never include Prime Minister of Latvia.
Tom McCoy
@RTomMcCoy
Apr 10, 2020
Linguists: In case you could use a diversion, I've made a phonetic crossword - all the answers must be written in the IPA, one phoneme per square. (Non-linguists: Here's a chance to learn some phonetics!) Puzzle: rtmccoy.com/crosswords/cha… Answers: rtmccoy.com/crosswords/cha…
Tom McCoy
@RTomMcCoy
May 4, 2022
🤖🧠NEW PAPER🧠🤖 What explains the dramatic recent progress in AI? The standard answer is scale (more data & compute). But this misses a crucial factor: a new type of computation. Shorter opinion piece: arxiv.org/abs/2205.01128 Longer tutorial: microsoft.com/en-us/research… 1/5
Tom McCoy
@RTomMcCoy
May 30, 2023
🤖🧠NEW PAPER🧠🤖 Bayesian models can learn rapidly. Neural networks can handle messy, naturalistic data. How can we combine these strengths? Our answer: Use meta-learning to distill Bayesian priors into a neural network! Paper: arxiv.org/abs/2305.14701 1/n
81K
Tom McCoy
@RTomMcCoy
Nov 19, 2021
*NEW PREPRINT* Neural-network language models (e.g., GPT-2) can generate high-quality text. Are they simply copying text they have seen before, or do they have generalizable linguistic abilities? Answer: Some of both! Paper: arxiv.org/abs/2111.09509 1/n
Tom McCoy
@RTomMcCoy
May 20, 2025
🤖🧠Paper out in Nature Communications! 🧠🤖 Bayesian models can learn rapidly. Neural networks can handle messy, naturalistic data. How can we combine these strengths? Our answer: Use meta-learning to distill Bayesian priors into a neural network! nature.com/articles/s4146… 1/n
46K
Tom McCoy
@RTomMcCoy
Oct 17, 2020
Transformers are the current state of the art, but one day LSTMs may overtake them. That would make LSTMs current again. You could even say…re-current.
Tom McCoy
@RTomMcCoy
Jan 7, 2020
Flying home from #LSA2020? Remember to put your liquids in a separate bag!
Tom McCoy
@RTomMcCoy
Nov 14, 2024
🤖🧠 I'll be considering applications for postdocs & PhD students to start at Yale in Fall 2025! If you are interested in the intersection of linguistics, cognitive science, & AI, I encourage you to apply! Postdoc link: rtmccoy.com/prospective_po… PhD link: rtmccoy.com/prospective_st…
40K
Tom McCoy
@RTomMcCoy
Jun 9, 2022
Excited to share some updates, which all still feel surreal: - Just defended my dissertation advised by @TalLinzen & @Paul_Smolensky! - Next up: Postdoc w/ Tom Griffiths @cocosci_lab! - Then joining @YaleLinguistics as an asst prof w 2ndary appt @YaleCompsci! A thank-you thread:
Tom McCoy
@RTomMcCoy
Dec 19, 2019
Takeaways from #NeurIPS: 1) In-distribution generalization is out 2) Out-of-distribution generalization is in 3) We want compositionality (whatever it is) 4) "GPT-2" is very hard to say