๐Ÿš€ Trainify: Convert 2D image into 3D environments to train RL agents in the cloud

๐ŸŒŸ Project Overview

Trainify simplifies reinforcement learning by enabling users to create and train AI agents in 3D environments directly on the cloud. Leveraging cutting-edge 2D/3D computer vision and NLP, our platform bridges the gap between idea and execution, allowing you to generate complex environments with simple natural language prompts and image references.

๐Ÿ’ก Inspiration

We were inspired by the increasing need for realistic 3D environments in various fields such as robotics training, game development, and simulation. Current solutions often require advanced technical skills and specialized knowledge. We wanted to democratize this process by leveraging the power of generative AI to make 3D environment creation accessible to everyone, regardless of their technical background.

๐Ÿ› ๏ธ What It Does

Trainify offers a seamless workflow:

  • ๐Ÿ“ Creation Interface: Users can describe their desired environment using natural language prompts with their uploaded reference images for simplistic workflow
  • ๐ŸŒ 3D Visualization: Segment objects of interest using the text prompt and turn images into 3D object representation where you can explore and modify in real time
  • โœ๏ธ Editing Mode: Intuitive tools allow users to adjust object positions, properties, and relationships within the 3D space with our cloud-based collaborative editor
  • ๐Ÿค– Training Mode: Built-in reinforcement learning capabilities enable users to train AI agents to perform tasks within their custom environments.
  • ๐Ÿ’ฌ AI Assistant: An integrated Gemini-powered chatbot provides context-aware assistance during every step of the process.

โš™๏ธ How We Built It

Our platform uses a modern tech stack:

  • ๐ŸŽจ Navigable 3D Frontend: React with styled-components for a responsive and intuitive UI and Three.js for real-time 3D visualization in the browser
  • ๐Ÿ”ง Cloud Native Deployment: Highly scalable cloud microservices deployed using GCP Cloud Run and efficient and customized compute units on the cloud for real time training using Docker and Kubernetes (GKE)
  • ๐Ÿง  Generative AI: Google's Gemini API for natural language understanding and generation, CLIPSeg for contrastive learning based image segmentation, Trellis for 3D gaussian splatting
    • Custom reinforcement learning framework built with PyBullet
  • ๐Ÿ—ƒ๏ธ Database & Storage: Firebase for user authentication and project management and Cloud Storage for highly accessible, available storage for large model files

๐Ÿงฉ Challenges We Ran Into

  • ๐Ÿšง Performance Optimization: Rendering complex 3D environments smoothly in the browser required significant optimization.
  • โ˜๏ธ Navigating GCP: Google Cloud Platform has a lot of complex features that take time to learn and setup
  • ๐Ÿค Real-time Collaboration: Implementing collaborative features presented synchronization challenges.

๐ŸŽ‰ Accomplishments That We're Proud Of

  • Created an intuitive interface accessible to non-technical users.
  • Integrating latest GenAI research from multiple fields like computer vision, agentic LLMs, and reinforcement learning.
  • Developed a novel approach for training reinforcement learning agents in user-generated environments directly from daily life scenarios
  • Built a scalable, cloud native architecture handling complex environments and simulations using Google Cloud Platform

๐Ÿ“š What We Learned

  • Importance of balancing automation and user control in AI-assisted creative tools.
  • Techniques for optimizing 3D rendering performance in browsers.
  • Effective communication strategies between frontend and backend components.
  • Methods to translate natural language descriptions into structured 3D scene representations.

๐Ÿ”ฎ What's Next for Trainify

  • ๐Ÿ“ฆ Expanded Asset Library: Developing a more comprehensive library of pre-built objects and environments.
  • ๐Ÿ‘ฅ Enhanced Collaboration: Adding real-time collaboration features for teams like Google Docs!
  • ๐Ÿงช Integration with Industry Standard Toolkits: By extending native support to OpenUSD, we allow easy export and import between industry standard physical AI and robotics framework such as Nvidia Omniverse and Issac Sim, taking a step closer to training physical robots!

๐ŸŽˆ Try It Out

Experience the future of 3D environment creation with Trainifyโ€”where your imagination becomes reality with the power of AI!

Built With

Share this project:

Updates