VibeSmith

logo

Inspiration

Music has always been a universal language for expressing emotions, but creating it has been limited to those with musical training or expensive equipment. We were inspired by the explosion of AI tools in creative spaces and saw an opportunity to democratize music creation. We wanted to build something that lets anyone—regardless of musical ability—turn their feelings and ideas into real, shareable songs. The rise of platforms like TikTok and Reels showed us that short-form, mood-driven content is what people crave, and we wanted to put that power in everyone's hands.

What it does

VibeSmith transforms your mood and creativity into fully original songs in seconds. Users can:

Describe their vibe using text or voice (e.g., "dark phonk with chill jazz energy")
Generate original lyrics written by AI based on themes, emotions, or stories
Create custom melodies and beats across any genre or hybrid style
Add unique AI vocals using ElevenLabs voices—no celebrity clones, completely original
Mix genres creatively (phonk + jazz, lo-fi + rock, etc.)
Create AI duets between different voice characters
Generate optional video clips optimized for TikTok or Instagram Reels

Everything—music, lyrics, and vocals—is 100% AI-generated and copyright-safe, making it perfect for content creators, social media enthusiasts, and anyone who wants to express themselves through music.

How we built it

We leveraged cutting-edge AI and cloud infrastructure to make VibeSmith fast, scalable, and intelligent:

Raindrop Smart Components: Used SmartMemory for user preference tracking, SmartInference for intelligent music generation decisions, and SmartSQL for efficient data management
Vultr Cloud: Hosted our AI workloads with high-performance compute instances to handle real-time music generation
ElevenLabs API: Integrated voice synthesis for original, expressive vocals across multiple voice profiles
Raindrop MCP Server: Deployed our backend with secure API endpoints and workflow automation
Custom AI Pipeline: Built a multi-stage generation system that processes mood inputs → lyric generation → melody composition → vocal synthesis → final mix
Optional Stripe Integration: Set up payment infrastructure for premium features and subscriptions

Challenges we ran into

Latency optimization: AI music generation can be slow. We had to architect smart caching and parallel processing to get generation times under 30 seconds
Genre blending: Teaching the AI to meaningfully combine disparate genres (like phonk and jazz) required extensive prompt engineering and model fine-tuning
Voice consistency: Ensuring AI vocals sounded natural and emotionally appropriate for different music styles took significant iteration with ElevenLabs parameters
Copyright safety: Implementing safeguards to ensure all generated content is truly original and legally defensible
User experience: Balancing creative control with simplicity—giving users enough options without overwhelming non-musicians

Accomplishments that we're proud of

Successfully generated our first fully AI-composed song that genuinely sounded professional
Achieved sub-30-second generation times for complete songs
Created a genre-blending algorithm that produces surprisingly cohesive hybrid styles
Built an intuitive interface that even non-technical users can navigate effortlessly
Implemented a robust voice selection system with 10+ distinct AI character voices
Deployed a scalable architecture that can handle concurrent users without performance degradation

What we learned

AI orchestration is complex: Coordinating multiple AI models (language, music, voice) requires careful pipeline design
User input matters: The quality of mood descriptions dramatically impacts output—we learned to guide users with examples and suggestions
Performance trade-offs: Real-time generation requires compromises between quality and speed; we found the sweet spot
Creative AI is unpredictable: Sometimes the AI produces surprising genre combinations that are actually brilliant
Cloud infrastructure choices matter: Vultr's GPU instances gave us the performance we needed at a sustainable cost

What's next for VibeSmith

Collaborative creation: Allow multiple users to co-create songs in real-time
Extended song lengths: Move beyond 30-60 second clips to full 2-3 minute tracks
Live performance mode: Generate background music that adapts to live streaming or gaming
Voice cloning (opt-in): Let users create songs in their own voice (with explicit consent)
Music library: Build a community where users can share, remix, and discover others' creations
Advanced editing: Give users granular control over individual instruments, tempo, and arrangement
Mobile app: Native iOS/Android apps for on-the-go music creation
API access: Let developers integrate VibeSmith into their own applications

Built With

elevenlabs-api
elevenlabs-api-(voice-synthesis)
ffmpeg
frontend:-react
node.js
openai
payments:
pydub
python
raindrop-mcp-server
raindrop-mcp-server-ai/ml:-openai-api-(gpt-4-for-lyrics)
raindrop-smart-components-(smartmemory
react
smartinference
smartsql)
smartsql)-audio-processing:-ffmpeg
stable-audio/musicgen-(music-generation)-infrastructure:-vultr-cloud-(gpu-compute)
stripe
tailwind-css-backend:-python
vultr-cloud

Submitted to

Chroma Awards: AI Film, Music Videos, and Games

Created by

VibeSmith looks like a cool AI music generation project submitted to the Chroma Awards, a competition focused on AI-driven films, music videos, and games. From the screenshot, it seems you're filling out the "Created By" contribution section on Devpost for your entry under the AI Music Videos category (given the project's focus on generating songs and optional video clips).
Since Alayham Almajali is listed as the starter/creator, and the project involves back-end work with Node.js (as hinted in the example text), here's a suggested description you could use or adapt for that field. I based it on the project's tech stack and workflow to make it authentic and highlight your role:
"I led the development of VibeSmith from concept to completion as a solo project. I built the back-end using Node.js and Python to orchestrate the AI pipeline, integrating APIs from OpenAI for lyrics, Stable Audio for melodies, and ElevenLabs for vocals. On the front-end, I used React and Tailwind CSS to create an intuitive interface for vibe inputs. Handling GPU-intensive tasks on Vultr Cloud was a new challenge, but it taught me a lot about scalable AI workflows and ensured fast, copyright-safe outputs for users.

ALAYHAM ALMAJALI
Expert in Blockchain & AI .

Updates

ALAYHAM ALMAJALI started this project — Nov 18, 2025 12:43 PM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.