Videos for my Fall 2019 course "Deep Learning for Computer Vision" are now on YouTube!
This is an evolution of @cs231n that I used to teach at Stanford:
- All content refreshed
- New topics: Transformers, Video, 3D, etc
- HW in @PyTorch + @GoogleColab
I'm excited about Segment Anything released from FAIR today. It tackles an old problem (find objects in images) at large scale: trained on 11M images and 1B objects.
This is a new Foundation Model for Computer Vision - it recognizes any object in any context.
10 years ago, deep learning was in its infancy. PyTorch didn't exist. Language models were recurrent, and not large. But it felt important: a new technology that would change everything.
That's why @drfeifei , @karpathy, and I started @cs231n back in 2015 - to teach the world's
Our new paper (w/@kdexd) argues that "language is all you need" for good visual features: we train CNN+Transformer *from scratch* on ~100k images+captions from COCO, transfer the CNN to 6 downstream vision tasks, and match/exceed ImageNet features despite using 10x fewer images!
Introducing "VirTex": a pretraining approach to learn visual features via language using fewer images.
Pretrain: CNN+Transformer from scratch on COCO Captions.
Transfer CNN: Results on 6 vision tasks match/exceed ImageNet pretraining (10x size wrt COCO)!
arxiv.org/abs/2006.06666
Today we released PyTorch3D v0.2, adding new features around point clouds:
- Point cloud renderer
- Point-to-mesh distances
- Normal estimation
- Umeyama, ICP, PnP, and KNN
All batched and differentiable, ready to drop into your deep learning models!
Today we released code for SynSin, our CVPR'20 oral
that generates novel views from a single image:
github.com/facebookresear…
We have:
- Pretrained models
- Jupyter notebook demos
- Training and evaluation
- #pytorch3d integration
Congrats to @OliviaWiles1 on the release!
Super proud of my PhD student Justin @jcjohnss for successfully defending his PhD dissertation today. Justin’s thesis symbolizes a new era of computer vision and #AI research moving towards deeper visual reasoning and intelligence. Congrats Justin!!
This week we open-sourced pycls, a flexible research framework for image classification with @PyTorch encapsulating current best practices. Used internally for research @FacebookAI -- excited to share with the community! Led by Ilija Radosavovic @ir413
PyTorch3D is our new library for accelerating 3D deep learning research, and provides:
- Easy batching of heterogeneous triangle meshes
- Optimized implementations of common mesh ops
- Modular, efficient, differentiable mesh renderer
- More to come!
We just released PyTorch3D, a new toolkit for researchers and engineers that’s fast and modular for 3D deep learning research: ai.facebook.com/blog/-introduc…
I'm excited to share our new paper that jointly detects objects and predicts 3D triangle meshes in real-world images, called Mesh R-CNN.
With Georgia Gkioxari and Jitendra Malik
arxiv.org/abs/1906.02739