🌊 DeepVision: AI-Powered Underwater Vision System

Vision beneath the surface - AI-driven image enhancement to improve visibility and detect underwater threats

📋 Table of Contents

Overview
Features
Architecture
Technology Stack
Installation
Usage
API Documentation
Project Structure
Contributing
License

🌟 Overview

DeepVision is a comprehensive AI-powered system designed for underwater image enhancement and threat detection. The platform combines advanced computer vision techniques with modern web technologies to provide real-time image processing, enhancement, and automated threat identification for underwater environments.

Key Capabilities

🔍 Underwater Image Enhancement: AI-powered dehazing and visibility improvement using UNet architecture
🎯 Object Detection: YOLO-based detection of underwater threats and objects
📧 Real-time Alerts: Automated email notifications for detected threats
☁️ Cloud Storage: Integrated Cloudinary storage for processed images
📊 Analytics Dashboard: Historical data visualization and system monitoring
🔧 Operator Profiles: Configurable alert settings and contact management

✨ Features

🤖 AI-Powered Image Processing

Underwater Image Dehazing: Advanced UNet-based model for improving underwater image clarity
Object Detection: YOLO v8/v11 models trained for underwater object detection
Real-time Processing: Fast inference with GPU acceleration support
Batch Processing: Support for video and image batch processing

🌐 Modern Web Interface

Responsive Design: Mobile-first design with Tailwind CSS
Real-time Updates: Live processing status and results
Interactive Dashboard: Comprehensive analytics and historical data
File Upload: Drag-and-drop image and video upload support

🔔 Alert System

Automated Notifications: Email alerts for detected threats
Configurable Settings: Operator profile management
Threat Classification: Detailed threat detection reporting
System Logs: Comprehensive logging and monitoring

🏗️ Scalable Architecture

Microservices: Modular design with separate services
Docker Support: Containerized deployment options
Cloud Integration: Cloudinary for image storage
Database Integration: MongoDB for data persistence

🏛️ Architecture

graph TB
    subgraph "Frontend"
        A[React Client]
        B[Tailwind CSS]
        C[Vite Build System]
    end
    
    subgraph "Backend Services"
        D[Node.js Express Server]
        E[FastAPI Object Detection]
        F[Flask Image Enhancement]
    end
    
    subgraph "AI Models"
        G[YOLO Object Detection]
        H[UNet Image Dehazing]
    end
    
    subgraph "External Services"
        I[Cloudinary Storage]
        J[MongoDB Database]
        K[Email Service]
    end
    
    A --> D
    D --> E
    D --> F
    E --> G
    F --> H
    D --> I
    D --> J
    D --> K

🛠️ Technology Stack

Frontend

React 19.1+ - Modern UI library
Vite - Fast build tool and dev server
Tailwind CSS - Utility-first CSS framework
React Router - Client-side routing
Axios - HTTP client for API calls

Backend

Node.js - JavaScript runtime
Express.js - Web framework
FastAPI - Python web framework for object detection
Flask - Python web framework for image enhancement
MongoDB - NoSQL database
Mongoose - MongoDB object modeling

AI/ML

PyTorch 2.0+ - Deep learning framework
Ultralytics YOLO - Object detection models
OpenCV - Computer vision library
NumPy - Numerical computing
PIL/Pillow - Image processing

DevOps & Deployment

Docker - Containerization
Cloudinary - Cloud image/video management
Nodemailer - Email service
Multer - File upload handling

🚀 Installation

Prerequisites

Node.js 18.0 or higher
Python 3.10 or higher
MongoDB (local or cloud instance)
Git

1. Clone the Repository

git clone https://github.com/AyushRaj-10/DeepVision.git
cd DeepVision

2. Environment Setup

Create environment files for each service:

# Server/.env
MONGODB_URI=mongodb://localhost:27017/deepvision
CLOUDINARY_CLOUD_NAME=your_cloud_name
CLOUDINARY_API_KEY=your_api_key
CLOUDINARY_API_SECRET=your_api_secret
EMAIL_HOST=smtp.gmail.com
EMAIL_PORT=587
EMAIL_USER=your_email@gmail.com
EMAIL_PASS=your_app_password

# object_detection/.env
HF_TOKEN=your_huggingface_token
HF_USERNAME=your_hf_username

# under-water_imaging-main/.env
FLASK_ENV=production

3. Backend Setup

Node.js Server

cd Server
npm install
npm start

Object Detection Service (FastAPI)

cd object_detection
pip install -r requirements.txt
uvicorn app:app --host 0.0.0.0 --port 7860

Image Enhancement Service (Flask)

cd under-water_imaging-main
pip install -r requirements.txt
python app.py

4. Frontend Setup

cd Client
npm install
npm run dev

5. Docker Deployment (Optional)

# Build and run object detection service
cd object_detection
docker build -t deepvision-detection .
docker run -p 7860:7860 deepvision-detection

# Build and run image enhancement service
cd under-water_imaging-main
docker build -t deepvision-enhancement .
docker run -p 5000:5000 deepvision-enhancement

📖 Usage

1. Web Interface

Start the application by running all services
Navigate to http://localhost:5173 (React dev server)
Upload an image using the upload button
Click "Enhance Image" to process
View results including enhanced image and threat detection
Configure alerts in the Profile section

2. API Usage

Image Upload and Processing

curl -X POST http://localhost:3001/api/upload \
  -F "file=@your_image.jpg" \
  -H "Content-Type: multipart/form-data"

Object Detection API

curl -X POST http://localhost:7860/detect \
  -F "file=@your_image.jpg"

Image Enhancement API

curl -X POST http://localhost:5000/enhance \
  -F "file=@your_image.jpg"

3. Training Custom Models

Train Object Detection Model

cd object_detection
python train.py --data data.yaml --epochs 100 --batch-size 16

Train Image Enhancement Model

cd under-water_imaging-main
python train.py --epochs 20 --batch-size 16 --lr 0.0001

📚 API Documentation

Object Detection API (FastAPI)

Base URL: http://localhost:7860

Endpoint	Method	Description
`/detect`	POST	Detect objects in uploaded image
`/detect-video`	POST	Process video for object detection
`/status/{job_id}`	GET	Check processing status

Image Enhancement API (Flask)

Base URL: http://localhost:5000

Endpoint	Method	Description
`/enhance`	POST	Enhance underwater image
`/batch-enhance`	POST	Process multiple images

Main Server API (Express)

Base URL: http://localhost:3001

Endpoint	Method	Description
`/api/upload`	POST	Upload and process images
`/api/profile`	GET/POST	Manage operator profiles
`/api/logs`	GET	Retrieve system logs

📁 Project Structure

DeepVision/
├── Client/                    # React frontend
│   ├── src/
│   │   ├── components/        # Reusable UI components
│   │   ├── pages/            # Page components
│   │   ├── context/          # React context providers
│   │   └── main.jsx          # Application entry point
│   ├── public/               # Static assets
│   └── package.json
├── Server/                   # Node.js backend
│   ├── controllers/          # Request handlers
│   ├── models/              # Database models
│   ├── routes/              # API routes
│   ├── utils/               # Utility functions
│   └── server.js            # Server entry point
├── object_detection/         # YOLO object detection service
│   ├── app.py               # FastAPI application
│   ├── model.py             # YOLO model wrapper
│   ├── train.py             # Training script
│   ├── checkpoints/         # Model weights
│   ├── dataset/             # Training data
│   └── requirements.txt
├── under-water_imaging-main/ # Image enhancement service
│   ├── app.py               # Flask application
│   ├── model.py             # UNet model definition
│   ├── train.py             # Training script
│   ├── checkpoints/         # Model weights
│   └── requirements.txt
└── README.md

🤝 Contributing

We welcome contributions! Please follow these steps:

Fork the repository
Create a feature branch: git checkout -b feature/amazing-feature
Commit your changes: git commit -m 'Add amazing feature'
Push to the branch: git push origin feature/amazing-feature
Open a Pull Request

Development Guidelines

Follow the existing code style
Add tests for new features
Update documentation as needed
Ensure all services start without errors

Testing

# Frontend tests
cd Client
npm test

# Backend tests
cd Server
npm test

# Python tests
cd object_detection
python -m pytest

cd under-water_imaging-main
python -m pytest

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Ultralytics for the YOLO implementation
PyTorch team for the deep learning framework
React and Vite teams for the frontend tools
FastAPI and Flask teams for the Python web frameworks

📞 Support

For support and questions:

📧 Email: support@deepvision.ai
💬 Discord: DeepVision Community
📖 Documentation: Wiki
🐛 Issues: GitHub Issues

Made with ❤️ for underwater exploration and safety

⭐ Star this repo | 🍴 Fork it | 🐛 Report Bug

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Client		Client
Server		Server
object_detection		object_detection
under-water_imaging-main		under-water_imaging-main
readme.md		readme.md

Folders and files

Latest commit

History

Repository files navigation

🌊 DeepVision: AI-Powered Underwater Vision System

📋 Table of Contents

🌟 Overview

Key Capabilities

✨ Features

🤖 AI-Powered Image Processing

🌐 Modern Web Interface

🔔 Alert System

🏗️ Scalable Architecture

🏛️ Architecture

🛠️ Technology Stack

Frontend

Backend

AI/ML

DevOps & Deployment

🚀 Installation

Prerequisites

1. Clone the Repository

2. Environment Setup

3. Backend Setup

Node.js Server

Object Detection Service (FastAPI)

Image Enhancement Service (Flask)

4. Frontend Setup

5. Docker Deployment (Optional)

📖 Usage

1. Web Interface

2. API Usage

Image Upload and Processing

Object Detection API

Image Enhancement API

3. Training Custom Models

Train Object Detection Model

Train Image Enhancement Model

📚 API Documentation

Object Detection API (FastAPI)

Image Enhancement API (Flask)

Main Server API (Express)

📁 Project Structure

🤝 Contributing

Development Guidelines

Testing

📄 License

🙏 Acknowledgments

📞 Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages