Carles Domingo-Enrich (@cdomingoenrich) / X

Carles Domingo-Enrich

66 posts

Carles Domingo-Enrich

@cdomingoenrich

Senior Researcher @ Microsoft Research New England. Formerly: Visiting Researcher @ Meta FAIR and CS PhD @ NYU.

Cambridge, MA

cdenrich.github.io

Joined September 2024

Pinned
Carles Domingo-Enrich
@cdomingoenrich
Mar 16
(1/9) Most LM fine-tuning optimizes next-token loss or scalar rewards. What if we fine-tune language models so that feature statistics of partial rollouts match those of ground-truth completions? That leads to Energy-Based Fine-Tuning (EBFT). arXiv: arxiv.org/abs/2603.12248
00:00
44K
Carles Domingo-Enrich
@cdomingoenrich
Nov 4, 2024
If you are a PhD student and want to intern with me or my colleagues at @MSRNE @MSFTResearch, please apply at jobs.careers.microsoft.com/global/en/job/…
49K
Carles Domingo-Enrich
@cdomingoenrich
Oct 2, 2024
New paper! A taxonomy of loss functions for stochastic optimal control: arxiv.org/pdf/2410.00345 If our recent work on Adjoint Matching (arxiv.org/abs/2409.08861) made you want to learn about deep-learning SOC techniques, check my systematic study on all available losses!
7.3K
Carles Domingo-Enrich
@cdomingoenrich
Oct 10, 2024
Undergrad internship opportunities at MSR! If you are a rising junior or senior undergraduate student interested in working with me, apply here and mention my name: aka.ms/msr-ugrad. Topics: fine-tuning and inference of generative models for continuous and discrete data.
microsoft.com
Undergraduate Research Internship – Computing - Microsoft Research
Accepting applications for 12-week summer research internships for juniors & senior undergrads w/ demonstrated leadership in diversity.
5.7K
Carles Domingo-Enrich
@cdomingoenrich
Sep 18, 2024
Replying to @guanhorng_liu and @RickyTQChen
Thank you!!
62
Carles Domingo-Enrich
@cdomingoenrich
Sep 17, 2024
Replying to @PatrickKidger and @RickyTQChen
We did not think of a discretize-then-optimize version of this, and probably there isn't any. Because we removed some terms of the gradient, the resulting algorithm cannot be directly regarded as an optimization algorithm on any particular objective.
61