Inspiration
The idea for this extension came to me while reflecting on how often I listen to music without fully understanding the lyrics, especially when the song is in a foreign language or features complex vocal delivery. I realized that many users face the same challenge: they want to enjoy music deeply but struggle to grasp the meaning behind the words. Over 80% of the world’s population doesn’t speak English.
I wanted to create something that not only solves this problem but also adds a layer of fun and emotional engagement. That’s how the concept of humorous summaries and live character chats was born - blending utility with entertainment.
What it does
This extension automatically generates a summary of any song’s lyrics, regardless of language or genre, without requiring users to search for or read the full text.
Humorous summaries are generated through the lens of fictional professionals: lawyers, programmers, and more.
While a song is playing, randomized characters from the persona pool engage in a simulated chat, offering humorous, emotional, or thematic reactions to the music.
The extension operates directly within YouTube and YouTube Music, ensuring users never need to leave the platform to understand a song’s meaning.
How we built it
JavaScript - UI, captions fetching, AI integration
Flask - API, external sources integration, caching layer, logging, AI integration
AI - characters, chat generation, text summarization, advertisement catching
Challenges we ran into
Lyric availability and accuracy - not all songs have reliable sources, so fallback logic was needed.
CORS and cross-origin restrictions - required careful handling of API calls and content scripts.
Persona consistency - designing character logic that feels coherent and entertaining.
Accomplishments that we're proud of
Multilingual interface - supports 7 languages (Russian, English, Spanish, French, Italian, Japanese, Korean), automatically adapting to the user's YouTube or YouTube Music language settings.
AI-powered lyric analysis - combines natural language understanding with creative reinterpretation, offering both entertainment and insight.
Ready for global scale - built with reproducible logic and extensibility in mind - including future support for additional languages and personas.
What we learned
Hands-on experience with AI agents - designing agentic logic and persona-driven interactions for real-time lyric interpretation.
Deeper understanding of CORS - managing cross-origin requests securely and effectively in a browser extension environment.
Sharper prompt engineering - refining prompts for consistent, context-aware outputs across multiple languages and professional personas.
Deployment to the web - building and packaging a Chrome extension with multilingual support and dynamic AI integration.
What's next for Music Summary
Publish to the Chrome Web Store - make the extension publicly available for global users.
Expand language support - add more interface languages to reach broader audiences, especially in Latin America and Asia.
Introduce new personas - develop additional AI characters with diverse professional backgrounds for fresh interpretations.
Add new features - such as fun facts about singers, celebratory reactions for users, or artist-themed interactions.
Explore cross-platform support - consider adapting the extension for other browsers and mobile environments.
Built With
- claude
- gemini
- genius
- javascript
- lrclib
- python
- render
- youtube
Log in or sign up for Devpost to join the conversation.