ClauseAI

ClauseAI is a document processing and querying application that leverages AI-powered tools for extracting metadata, vectorizing content, and providing intelligent query responses. It combines the power of OpenAI models and Qdrant to create a seamless document management system.

Installation

Follow these steps to set up and run the ClauseAI project locally:

Prerequisites

Python 3.10 or higher
Virtual environment manager (optional but recommended)

Steps

Clone the Repository:
```
git clone <repository_url>
cd ClauseAI
```

Set up a Virtual Environment:

python -m venv virtual
source virtual/bin/activate   # On Windows, use virtual\Scripts\activate

Install Dependencies:
```
pip install -r requirements.txt
```

Set Environment Variables: Create a .env file in the root directory with the following content:

QDRANT_URL=<your_qdrant_url>
QDRANT_API_KEY=<your_qdrant_api_key>
OPENAI_API_KEY=<your_openai_api_key>

Run the Application:
```
streamlit run workflow.py
```

Workflow

ClauseAI consists of two main functionalities:

1. Document Processing

Upload a PDF document.
Convert the document to Markdown format and extract metadata.
Generate vector embeddings for the document content and store them in Qdrant.
Extract entities using GPT-4 for metadata enrichment.

2. Query Document

Select a processed document by its ID.
Query the document using two mechanisms:
- Qdrant: Fetch the most relevant context chunks.
- LLM: Refine the Qdrant output using OpenAI GPT-4 for a natural-language response.

Utilities

PDF to Markdown Conversion: Extracts textual content and metadata from uploaded PDF documents.
Vectorization: Converts document content into vector embeddings using OpenAI embeddings and stores them in Qdrant for efficient querying.
Entity Extraction: Uses GPT-4 to identify and extract key entities in the document.
Intelligent Querying: Combines Qdrant's vector search and GPT-4's natural language understanding to deliver detailed query responses.

Contributing

We welcome contributions to ClauseAI! Please fork the repository and create a pull request with your changes.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.devcontainer		.devcontainer
handler		handler
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
workflow.py		workflow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ClauseAI

Installation

Prerequisites

Steps

Workflow

1. Document Processing

2. Query Document

Utilities

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

BlackDevil559/ClauseAI

Folders and files

Latest commit

History

Repository files navigation

ClauseAI

Installation

Prerequisites

Steps

Workflow

1. Document Processing

2. Query Document

Utilities

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages