user avatar
Russ Salakhutdinov
@rsalakhu
CSO @ Sooth Labs, Professor @ CMU, President Elect ICML Board, Ex-VP of Research @ Meta (Multimodal LLMs, AI Agents), ex-Director of AI at @Apple.
Pittsburgh, PA
Joined January 2015
Posts
  • user avatar
    Got promoted to a full professor at CMU:) Thanks to my amazing students, collaborators, and my mentors. CMU is a remarkable place, but what makes it truly unique is its brilliant students! Hard to find a better place to do ML and AI.
  • user avatar
    I get asked a lot: why stay in academia, all the excitement in AI is happening in industry with massive compute. And I am seeing some profs leaving academia, but also seeing lots of researchers in industry looking to go back to academia, especially those who don’t work on LLMs.
  • user avatar
    1) My observation when working at Apple and many other industrial partners is that deep learning researchers who are well versed in graphical models, variational inference, CRFs, MCMC, uncertainty quantification are on average much more creative when solving real-world problems.
    Researchers in speech recognition, computer vision, and natural language processing in the 2000s were obsessed with accurate representations of uncertainty. 1/N
  • user avatar
    This semester I am teaching a graduate course in Deep RL with over 500 graduate students enrolled so far, the biggest class I ever taught! One of my colleagues was joking that in the near future, we might be teaching ML and deep learning classes at the CMU football stadium.😀
  • user avatar
    Being in academia is amazing. Being in industry is exciting. Being in both is electrifying.
  • user avatar
    I hear a lot of folks in our AI community complain about openAI -- they don't publish, don't release models, maximum-for-profit, etc., so they are more like closedAI rather than openAI. This is true, but you have to give it to those guys -- they showed the true potential of LLMs
  • user avatar
    Someone told me once: You can do AI or you can just talk about it. #AIDebate
  • user avatar
    Excited about joining Apple as a director of AI research in addition to my work at CMU. Apply to work with my team
  • user avatar
    Slides from my talk on Integrating Domain Knowledge into Deep Learning at the New York Academy of Sciences @NYASciences. Special shoutout to @ZhitingHu and Bhuwan Dhingra for leading this amazing work and helping me with the slides: cs.cmu.edu/~rsalakhu/NY_2…
  • user avatar
    Very excited for the launch of Apple’s Machine Learning blog: machinelearning.apple.com. Check it out!
  • user avatar
    To be fair, @OpenAI had a big impact on AI/ML. Their OpenAI Gym env for RL, their CLIP - a de facto model for any image-text research, their Whisper speech recognition is amazing - all open sourced. In my view, they've had a bigger impact than many labs publishing 100s papers 🧵
    Data on the intellectual contribution to AI from various research organizations. Some of organizations publish knowledge and open-source code for the entire world to use. Others just consume it.
  • user avatar
    Congratulations to Zhilin Yang for successfully defending his PhD thesis at CMU in just 4 years! Zhilin introduced XL-Net, Transformer-XL, Mixture of Softmaxes High-Rank LM, HotpotQA, GLoMo Unsupervised Learning of Relational Graphs, just to name a few: kimiyoung.github.io
  • user avatar
    New #ICLR2021 paper on Self-supervised Learning with Relative Predictive Coding: A new contrastive learning objective that maintains a good balance between training stability, minibatch size sensitivity, & downstream task performance. arxiv.org/abs/2103.11275 w/ H. Tsai et al.
  • user avatar
    Replying to @rsalakhu
    3) My advise to deep learning PhD students is to also learn variational inference, belief propagation, MCMC, HMMs, graphical models, generative models, etc. -- no matter how irrelevant you think this is for the current speech, vision, and NLP systems.