user avatar
Ranjay Krishna
@RanjayKrishna
Assistant Professor @ University of Washington, Co-Director of RAIVN lab (raivn.cs.washington.edu), Director of PRIOR team (prior.allenai.org)
California, USA
Joined August 2011
Posts
  • user avatar
    Replying to @sitzikbs and @CVPR
    I am so confused by this thread. Science is absolutely political. What you work on, what funds you - it is deeply political.
  • user avatar
    I successfully defended my PhD a few days ago. Huge thanks to my amazing advisors @drfeifei and @msbernst for supporting me throughout my journey.
    A hearty congratulations to my student @RanjayKrishna (co-advised by @msbernst ) for a successful PhD thesis defense! Great pioneering work in combining human cognition, human-computer interaction and #AI! Thank you PhD committee members @chrmanning @syeung10 @magrawala 🌹
  • user avatar
    🎓 I'm on the faculty job market this year! Please send me a message if your department (or one you know) is interested in a Computer Vision / HCI researcher who designs models inspired by human perception and social interaction! My application materials: ranjaykrishna.com
  • user avatar
    Our submission received my first ever 10/10 review from NeurIPS. Check out our #NeurIPS2023 Oral. We release the largest vision-language dataset for histopathology and train a SOTA model for classifying histopathology images across 13 benchmarks across 8 sub-pathologies.
    Quilt-1M has been accepted for an oral presentation at @NeurIPSConf. As promised, we have also released our data and our model: quilt1m.github.io See you all in New Orleans!
  • user avatar
    We are happy to introduce Action Genome: a new representation, new dataset, and new model for decomposing actions into spatio-temporal scene graphs. Action Genome has 1.7M relationships between 0.4M object instances and enables few-shot action prediction. arxiv.org/pdf/1912.06992…
    GIF
  • user avatar
    My latest #CVPR2018 paper with Ines, @msbernst and @drfeifei is now live with fully documented open source training/testing code. We treat visual relationships as shifts in attention and perform attention saccades around scene graphs. goo.gl/mqSTrA
  • user avatar
    I expect a future where ML agents will dynamically learn through real-world interactions with people. My #CVPR2019 paper with @msbernst and @drfeifei pushes us towards that goal by learning to pose directed questions to learn about the visual world. bit.ly/2CKLdDu
  • user avatar
    Our new paper finds something quite neat: We easily scale up how many tools LLMs can use to over 200 tools (APIs, models, python functions, etc.) ...without any training, without a single tool-use demonstration!!
    Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models paper page: huggingface.co/papers/2308.00… Today, large language models (LLMs) are taught to use new tools by providing a few demonstrations of the tool's usage. Unfortunately, demonstrations are hard to
  • user avatar
    Announcing the first 𝗜𝗖𝗖𝗩 𝘄𝗼𝗿𝗸𝘀𝗵𝗼𝗽 𝗼𝗻 𝗦𝗰𝗲𝗻𝗲 𝗚𝗿𝗮𝗽𝗵 𝗥𝗲𝗽𝗿𝗲𝘀𝗲𝗻𝘁𝗮𝘁𝗶𝗼𝗻 𝗮𝗻𝗱 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴. If your research involves structured data or graph-based learning, consider submitting to us by August 15, 2019: sgrl.stanford.edu
  • user avatar
    On my way to Seoul for #ICCV2019. If you’re at the conference on October 28th, come check out a full day workshop I am organizing on Scene Graph Representation and Learning (sgrl.stanford.edu). We have a great lineup of speakers and posters.
  • user avatar
    Academic quarter recap: here's a staff photo after the last lecture of @cs231n. It's crazy that we were the largest course at Stanford this quarter. This year, we added new lectures and assignments (open sourced) on attention, transformers, and self-supervised learning.
  • user avatar
    Thank you all for coming to my talk yesterday at #ECCV2024 where we discussed the lessons learned from evaluating VLMs and designing Molmo. In today's talk, I will discuss limitations still left with VLMs and what we can do about it: **Suite 9 at 2pm** green-fomo.github.io/ECCV2024/
    🤯 What?? #Molmo is a COMPLETELY open #SOTA VLM. Exciting talk by Ranjay Krishna at #ECCV2024.
  • user avatar
    Someone made an in-depth video of our recent work at #CVPR2018 on Referring Relationships. If you are interested in how we train models to disambiguate between different people or objects in images, go check it out. #ComputerVision #MachineLearning youtube.com/watch?v=G7Ti_S…
  • user avatar
    Deploying LLMs continues to be a challenge as they grow in model size and consume more data. We introduce a simple distillation mechanism to make even 770M T5 models outperform 540B PaLM. Led by my PhD student @cydhsieh and with collaborators @chunliang_tw @ajratner
    Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes reduce both the model size and the amount of data required to outperform LLMs; our 770M T5 model outperforms the 540B PaLM model using only 80% of available data on a