Skip to content

stripathy1999/Cortex-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cortex-AI Logo

Cortex-AI

Control your Mac or PC with AI — from your phone.

iMessage · Discord · Slack · WhatsApp · FaceTime voice

View on GitHub

Platform Python License


✨ What it does

Send a message from your phone. Your computer does the rest.

You send Your Mac/PC does
"FaceTime me" Starts FaceTime, shares screen — then use voice to control the computer
"Open Chrome and search X on Google" Opens Chrome, runs the search, opens the first result
"Open Notepad" Opens Notepad and types a test message
"Open Calculator" Opens the Calculator app
"Go to YouTube Music and play some music" Opens Chrome, navigates to YouTube Music

Only your messages are processed; everyone else is ignored.


🖼️ How it looks

Cortex-AI in action — message your PC from phone, agent status and Bright Data scraping

Message from your phone → task runs on your computer. Architecture image generated by AI and made with love by Aditya Punjani and Sakshi Tripathy.


🎬 Watch the demo

Watch Cortex-AI on YouTube

▶ Watch on YouTube


🏗️ How it works

flowchart LR
    subgraph Phone
        A[iMessage / Discord / Slack / WhatsApp]
    end
    subgraph Your Computer
        B[Message bridge]
        C[Backend]
        D[FaceTime voice]
        E[Stagehand · Browser]
        F[Agent-S · Desktop]
    end
    A --> B --> C
    C --> D
    C --> E
    C --> F
    D --> G[Your Mac/PC]
    E --> G
    F --> G
Loading
Layer Role
Message bridge Receives your messages (iMessage, Discord, Slack, WhatsApp).
Backend Routes each message to the right service.
FaceTime Text "FaceTime me" → call starts, screen shares; then use voice to control the Mac.
Stagehand Browser tasks (Chrome, Google, YouTube, Gmail, etc.).
Agent-S Desktop tasks (Notepad, Calculator, files, apps).

🚀 Quick start

Mac (iMessage + FaceTime)

git clone https://github.com/AdityaYC/Cortex-Treehacks.git
cd Cortex-Treehacks
cp .env.example .env   # Add API keys, FACETIME_APPLE_ID, etc.
./run_order.sh

Then text your Mac from iMessage. Say "FaceTime me" for a call + screen share and voice control, or send any other task (browser or desktop).

Windows

git clone https://github.com/AdityaYC/Cortex-Treehacks.git
cd Cortex-Treehacks
copy .env.example .env
.\run_windows.ps1

This starts the message bridge, backend, Agent-S, Stagehand, and frontend.


📱 Message your Mac/PC

Platform ngrok? Setup
iMessage (Mac) No Run with run_order.sh; text from your iPhone to trigger tasks and "FaceTime me" for voice
Discord No Discord & Slack setup →
Slack No Discord & Slack setup →
WhatsApp Yes WhatsApp setup →

🔑 Required API keys

Add to .env:

# Agent-S (required)
ANTHROPIC_API_KEY=sk-ant-your-key

# Stagehand (browser tasks)
OPENAI_API_KEY=sk-your-openai-key

# Discord / Slack / WhatsApp — see setup docs

📁 Project structure

Folder Purpose
Agent-S Desktop automation (vision + pyautogui)
backend Flask server, message bridges, Stagehand service
frontend Electron UI
docs Setup guides (Discord, Slack, WhatsApp, testing)
scripts Utilities and run scripts

📚 Documentation

Doc Description
Video calls & FaceTime "FaceTime me", Zoom, Meet, WhatsApp
Windows setup Full Windows-only setup
Discord & Slack Bot tokens and allowed user IDs
WhatsApp (Twilio) Twilio and ngrok
Testing How to test messages and flows

🙏 Credits

  • Cortex-AI — Logo and architecture imagery generated by AI and made with love by Aditya Punjani and Sakshi Tripathy.
  • Agent-S — computer-use agent (simular-ai)
  • Stagehand — browser automation (browserbase)

Last updated: February 2026

About

Treehacks Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors