When I was 12 and my dad sat me down in front of an Apple II computer and said, "We're going to learn to program. First thing to know, you can't just say, 'Make me a race car game.'" Then he made me type that in just to see it not work.
Wow! Ultravox is an *open source* speech to speech model โ understands non-textual speech elements โ paralinguistic information. @juberti just showed how it can pick up on tone, pauses, and more! @AITinkerers Seattle @FixieAI
I created an AI-generated podcast to help me learn and keep up with topics I care about, using AI to find, read and summarize articles it thinks I would I care about, and to generate a podcast in my own voice using voice cloning from @elevenlabs ๐
I can't believe this is real: Transcribing a 60 second mp3 with today's top consumer LLM UIs...
โ Gemini
โ Claude
โ ChatGPT
โ Grok
โ @perplexity_ai
AI browser control in Ruby! Inspired by @sharifshameem a while back, this uses GPT-3 prompt chains to control a browser. @natfriedman said I should release the code, so here it is: github.com/jheitzeb/ruby-โฆ - hope to allow Ruby devs to explore something that feels magical :)
Well on my way to delegating lots of reading and understanding to my AI agent. Next up: ingest a ton of papers, blog posts, news articles, prompt with the things I care about, have it summarize, synthesis and bring the most important things to a regular "1:1" with my AI.
"The simplest thing I can build that can build itself"
โ thatโs the guiding principle of Yohei Nakajima @yoheinakajima, the mind behind BabyAGI, as he challenges the boundaries of what AI can achieve independently.
In my latest episode of @AITinkerers "One-Shot", we dive into