☞ New preprint with @gregd_nlp: arxiv.org/abs/2112.07660
We propose a new search algorithm for neural text generation models to efficiently produce thousands of outputs encoded as a lattice🕸️.
Two key ideas: (1) best first search, (2) hypothesis recombination. Thread 🧵
Jiacheng Xu
118 posts
Researcher at Nvidia. Prev PhD @UTAustin, advised by @gregd_nlp; ex Google and Microsoft intern. Account was recently hacked (June 2025).
- Thrilled to share that three of my papers on text generation and summarization have been accepted at #ACL2023NLP! 🎉 Unable to attend conference in person due to visa issues, but I'll be available on Twitter and Gathertown for discussions.
- Excited to share my ACL21 paper with @gregd_nlp "Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution": arxiv.org/pdf/2106.01518… We propose a two-stage framework to interpret abstractive summarization models’ decisions at each time step.
- In our #EMNLP2020 paper “Understanding Neural Abstractive Summarization Models via Uncertainty“ (w/ @shreydesai @gregd_nlp), we investigate the behavior of abstractive summarization models by measuring the *uncertainty* of the model prediction. arXiv:
- Do you want a text decoding algorithm to discover diverse options and can be customized as you wish? Excited to share our recent ACL paper on text generation, and hope to see you at the Virtual Poster Session 3, tomorrow morning.
- Replying to @qi2peng2 and @xwang_lkACL reviewers should be able to see the NAACL review if the paper was withdrawn from NAACL. I spent a good amount of time reviewing a paper and writing a constructive long review but it was withdrawn without any author's feedback. I feel my work is not appreciated.
- Drago was a great mentor, friend and thought leader in the community. I still remember the time I visited Yale, we met during conferences, and talked about research in summarization. A very kind and friendly person I'll always miss. R.I.P.The #AI community, the #computerscience community, the @YaleSEAS community, and humanity have suddenly lost a remarkable person, @dragomir_radev - kind and brilliant, devoted to his family and friends... gone too soon. A sad day @Yale @YINSedge @YaleCompsci #NLP2023
- Replying to @JiachengNLPCode: github.com/jiacheng-xu/la… Examples: cs.utexas.edu/~jcxu/data/sum… If you’re interested in customizing or reranking generation outputs, we encourage you to give this a try! We’re very excited about the potential of this technique.
- Thanks for your advising and generous support! I am also grateful to be part of the growth of the larger UT NLP group.Tenure! I cannot express how grateful I am for the support I’ve got from family, colleagues, students, and friends & collaborators 🥰 I’m incredibly lucky and honored to belong in @UT_Linguistics, and the larger UT NLP group! liberalarts.utexas.edu/linguistics/ne…
- Looking forward to hosting the Generation BoF session!Replying to @naaclmeetingEthics by @kaiwei_chang : 11-noon in 502 Cowlitz Generation by @JiachengNLP : 11-noon in 506 Samish Semantics/NLU by @ssshanest : 2-3 pm in 501 Chiwawa Multimodal by @LHung1610 : 2-3 pm in 506 Samish Interpretability by @sarahwiegreffe + @yanaiela: 3-4 pm in 502 Cowlitz (2/2)
- Time to build more length-calibrated/conscious models, datasets, and eval metrics? Any other factors playing a role like length here? A long way to go indeed; great work by @prasann_singhal!Why does RLHF make outputs longer? arxiv.org/pdf/2310.03716… w/ @tanyaagoyal @JiachengNLP @gregd_nlp On 3 “helpfulness” settings - Reward models correlate strongly with length - RLHF makes outputs longer - *only* optimizing for length reproduces most RLHF gains 🧵 below:
- Replying to @JiachengNLPOur idea: construct a lattice for text generation which could hold a massive number of generation options in a compact space. Two ideas make this possible.
- Amazing work! Love the emphasis on interpretable, scoped AI for science. VLM-as-a-judge feels like a promising direction — curious how it compares to human eval in these tasks.Great to work on this benchmark with astronomers in our NSF-Simons CosmicAI institute! What I like about it: (1) focus on data processing & visualization, a "bite-sized" AI4Sci task (not automating all of research) (2) eval with VLM-as-a-judge (possible with strong, modern VLMs)
- Replying to @JiachengNLPMotivation: we started thinking about how to get more factual outputs out of summarization models. Techniques like beam search just don’t give enough diverse options to rerank over – but what if we could generate a massive number of high-quality generation outputs?














