Accelerated intelligent onboarding for company success
Getting started in a new internship or job is never easy. Navigating a new physical environment, codebase, and tech stack often means spending the first few weeks learning how to download dependencies and navigate documentation instead of contributing meaningfully. Many onboarding processes rely on outdated, hard-to-find documentation, and even mentors struggle to help due to the passage of time since their own onboarding.
We saw an opportunity to address these inefficiencies using AI-driven technologies inspired by tools like GitHub CoPilot. This led to NavigAIt: an AI assistant designed to streamline onboarding, providing live, personalized support for new employees while enabling a seamless start to their journey.
NavigAIt provides real-time, hands-on assistance to users during the onboarding process by analyzing their screens and responding to their questions. Key features include:
- Live Feedback: Continuous audio and display input are analyzed to guide users through tasks.
- Error Detection: Identifies mistakes (e.g., downloading the wrong version of a tool) and provides corrections.
- Multimodal Assistance: Combines Text-to-Speech (TTS), Speech-to-Text (STT), and display analysis for a highly interactive experience.
- 24/7 Support: Employees can get instant help without waiting for mentor availability.
- Consistent Training: Ensures that all employees receive uniform onboarding, improving team cohesion.
-
Frontend:
- Built using Next.js and React for a seamless user interface.
- Live user input and screen capture modules for real-time interaction.
-
Backend:
- Node.js and Express.js for the RESTful server.
- Integration with Google Cloud Platform (GCP) for:
- Vertex AI/Gemini for multimodal AI support.
- Text-to-Speech (TTS) and Speech-to-Text (STT) for voice interactions.
-
LLM System:
- Developed a Retrieval-Augmented Generation (RAG) system using Gemini.
- Optimized the LLM with context-specific prompts and conversation memory using a Queue data structure.
- Data-Driven Feedback: Enable continuous improvement of onboarding modules based on user interaction data.
- Memory Summarization: Introduce summarization modules to improve LLM memory management.
- Direct Audio Input: Extend Gemini's capabilities to accept live audio as input.
- Additional Use Cases: Expand to general coding assistance, IT support, and beyond.
- Node.js (v18 or later).
- Google Cloud SDK with Vertex AI and TTS APIs enabled.
- Sox and FFmpeg for audio playback.
- Clone the repository:
git clone https://github.com/fuselierr/navigAIt.git cd navigait - Install dependencies:
npm install
- Configure environment variables in a .env file:
GCLOUD_PROJECT= GCLOUD_CREDENTIALS= GOOGLE_APPLICATION_CREDENTIALS= BUCKET_NAME= KEY_PATH=
- Start the frontend server:
npm run dev
- Start the backend server:
cd src/backend node server.js - Access the frontend, open http://localhost:3000 in your browser.
