AI Breakfast
3,081 posts
The latest rumors and developments in the world of artificial intelligence. DM to include your AI project in the email newsletter with 100k subscribers!
- Meta jumps years ahead with lifelike Avatars. LLMs can be trained to speak like you. Text-to-voice can sound like you. The Simulation is coming.
00:00 - AI image recognition models are powering the world’s next agricultural workforce: Watch as these drones use multispectral color grading to determine the ripeness + sugar content of apples, then gently pick them:
00:00 - Microsoft CEO Satya Nadella to board members: "If OpenAl disappeared tomorrow, we have all the IP rights and all the capability. We have the people, we have the compute, we have the data, we have everything. We are below them, above them, around them." Wow.
- 🤯 Full body tracking now possible using only WiFi signals A deep neural network maps the phase and amplitude of WiFi signals to UV coordinates within 24 human regions The model can estimate the dense pose of multiple subjects by utilizing WiFi signals as the only input 🧵
- This is wild: DraGAN: Interactive point-based manipulation of images using AI. This gives you controllability of the pose, shape, expression, and layout of the objects in your images.
00:00 - Multilingual voice cloning from Elevenlabs is insane: Joe Rogan reads Athletic Greens ad in flawless Spanish (100% AI generated)
00:00 - Added compounds may have just produced a fully levitated LK-99 sample:
00:00 - From Beijing World Robotics Conference: x.com/froggyups/stat…
- AI + AR and the future of cooking. Now apply this to every specialty profession.
00:00 - 🚨 Breaking news from the land of paywalls: Microsoft, OpenAI’s biggest investor, plans to integrate ChatGPT into Bing search by March 2023. Microsoft already has plans in place for a DALLE-2 integration through “Bing Image Creator” as seen here: bing.com/create
- ElevenLabs just lost the crown - to open source. Chatterbox by @resembleai (I remember writing about this team back in 2022 as a pioneer of AI audio) just released as an open-source alternative for audio generation and voice cloning. - Zero-shot voice cloning from just 5
00:00 - Use ChatGPT on your own files This is going to be big: humata.ai lets you upload a .pdf up to 60 pages long and allows you to ask questions about it in plain English ↓
GIF - These AI images are almost impossible to comprehend as fake (until you look closely) Prompting Midjourney with "phone photo" adds an eerie sense of photorealism. From Reddit user u/KudzuEye








