user avatar
Patrick Esser
@pess_r
Walking on the generative side of computer vision @bfl_ml . he/him
Heidelberg
Joined September 2016
Posts
  • Pinned
    user avatar
    #stablediffusion text-to-image checkpoints are now available for research purposes upon request at github.com/CompVis/stable… Working on a more permissive release & inpainting checkpoints. Soon™ coming to @runwayml for text-to-video-editing
    00:00
  • user avatar
    The model behind Erase & Replace is now available at github.com/runwayml/stabl… @runwayml jointly with @robrombach
    00:00
  • user avatar
    Watching transformers do their thing in 3D github.com/CompVis/taming…
    00:00
  • user avatar
    Pre-tamed transformers now at github.com/CompVis/taming… Just added a Colab demo to start sampling right away: colab.research.google.com/github/CompVis…
    00:00
  • user avatar
    We present Latent Diffusion Models in tomorrow's Image & Video Synthesis and Generation session @CVPR in Hall B1 (Thur. 08:30-10:18). Join us for a chat at Poster #7 between 10:00-12:30 in Hall B2-C and sample some jazz! With @robrombach @andi_blatt D. Lorenz @runwayml B. Ommer
    00:00
  • user avatar
    braindance.py demo to run our GeoGPT models on images in the wild. Code at github.com/CompVis/geomet… Detes at arxiv.org/abs/2104.07652
    00:00
  • user avatar
    wouldn't be where we are are without @EMostaque ❤️
  • user avatar
    Jumping into a thread about video inpainting @runwayml 🦘👇 1/7
    00:00
  • user avatar
    Geometry-Free View Synthesis: We don't need no 3D priors. Leave them transformers unbiased! Without coding 3D transformations into the model, they learn to synthesize novel views from a single input image. arxiv.org/abs/2104.07652
    00:00
  • user avatar
    Excited about ML for creative video editing? Come work with an amazing team @runwayml ! We have open positions for Research Scientists and other roles. The team is diverse and distributed all over the world with a great remote work culture. runwayml.com/about/#open-po…
    00:00
    Thrilled to announce Runway’s $35M Series B led by Coatue. Content creation and video editing are being massively transformed by machine learning and the web. Excited to double down on our mission to keep reimagining how we tell stories. More here: runwayml.com/blog/runway-ra…
  • user avatar
    Denoising Diffusion Probabilistic Models converted to PyTorch with Streamlit Demo github.com/pesser/pytorch… Run demo: pip install -e git+github.com/pesser/pytorch… pytorch_diffusion_demo
    00:00
  • user avatar
    So many good news! 🥳 I joined @runwayml as a Research Scientist! I'll present Geometry-Free View Synthesis (arxiv.org/abs/2104.07652) at #ICCV2021 today 6 pm and Friday 11 am EDT 👋 ImageBART (arxiv.org/abs/2108.08827) was accepted at #NeurIPS2021 w/ @robrombach @andi_blatt & BO
  • user avatar
    ❤️ let's gooo 🚀
    Party time! The SD3 paper made it to arxiv: arxiv.org/abs/2403.03206 Key takeaways: - flow matching is very nice. - back to work with @pess_r and a fantastic team ♥️ The paper is full of details on improved flow matching, scaling and engineering. Enjoy!
  • user avatar
    sooooo good! 🤩 amazing work, congrats to the whole @runwayml team!!
    Introducing Gen-3 Alpha: Runway’s new base model for video generation. Gen-3 Alpha can create highly detailed videos with complex scene changes, a wide range of cinematic choices, and detailed art directions. runwayml.com/gen-3-alpha (1/10)
    00:00