KnowIt (UWB 2025 Hackathon - Security Track)

KnowIt Backend

Submission for UW Saves The World Hackathon – April 27, 2025

This repository contains the backend service for KnowIt, a Chrome extension that assesses article reliability and detects phishing in Gmail. The service exposes two HTTP endpoints—/evaluate and /phishing—that the frontend calls to power its functionality.

Features

Article Reliability Evaluation
- Scrapes and parses online articles using newspaper3k.
- Sends cleaned text to the OpenAI API to compute a reliability score and suggest alternative sources.
Gmail Phishing Detection
- Receives raw email content and uses OpenAI to classify phishing risk (numeric score + explanation).
Caching
- cache_manager.py prevents repeated LLM calls for the same input, improving response time and reducing API usage.

Tech Stack & Dependencies

Python 3.8+
Flask – HTTP server framework
Flask-CORS – Cross-origin support for frontend requests
OpenAI Python SDK – LLM integration
newspaper3k – Article scraping & text extraction requirements.txt](file-service://file-WKu11AZhCuLbTnmZspmZoj)

Install dependencies from requirements.txt:

   pip install -r requirements.txt

Environment Variables

Before running, set your OpenAI API key:

export OPENAI_API_KEY="sk-…"

API Endpoints

POST /evaluate • Purpose: Compute a reliability score (0–100%) and suggest alternative article URLs. • Request Body (JSON):

{ "content": "" }

•	Response (JSON):

{ "score": 72, "alternatives": [ "https://reliable-source-1.example", "https://reliable-source-2.example" ] }

POST /phishing • Purpose: Analyze an email’s content for phishing risk. • Request Body (JSON):

{ "content": "<raw email HTML/text>" }

•	Response (JSON):

{ "phishingsense": 2, "explanation": "The email uses urgent language, unfamiliar sender, and suspicious links." }

Installation & Setup

1.	Clone this repo:

git clone https://github.com/yourorg/KnowIt-Backend.git cd KnowIt-Backend

2.	Install dependencies:

pip install -r requirements.txt

3.	Export your OpenAI key:

export OPENAI_API_KEY="sk-…"

Running the Service

Start the Flask server on port 4999 (must match the frontend):

python main.py

By default, CORS is enabled so the Chrome extension can call these endpoints directly.

Example Requests

Evaluate an article

curl -X POST http://localhost:4999/evaluate
-H "Content-Type: application/json"
-d '{"content":""}'

Check for phishing

curl -X POST http://localhost:4999/phishing
-H "Content-Type: application/json"
-d '{"content":"<paste raw email HTML/text>"}'

Future Work

Persist cache to disk or Redis for cross-instance sharing.
Add rate limiting to protect the OpenAI API key.
Improve prompt engineering for more nuanced scoring.
Deploy to a cloud service (e.g., AWS Lambda + API Gateway).

License

Released under the MIT License. See LICENSE for details.

Technical Table of Contents

Features
Directory Structure
Prerequisites
Installation
Configuration
Running the Server
API Endpoints
- GET /health
- POST /evaluate
Example Usage (cURL)
Code Overview
Troubleshooting

Features

HTML‐to‐JSON Extraction: Uses OpenAI to parse web‑scraped article HTML into four structured fields:
- topic (string)
- claims (list of strings)
- data (list of strings)
- intent (string)
Reliability Scoring: Rates each article on a 1–5 reliability scale with an explanation.
CORS‐Enabled: Ready for integration with a Chrome extension or other frontends.

Directory Structure

project-root/
├── main.py           # Flask application and endpoint definitions
├── llm_agent.py      # OpenAIService class encapsulating LLM calls
└── README.md         # This documentation

Prerequisites

Python 3.8+ (tested on 3.12)
pip
An OpenAI API key

Installation

Clone the repo:

git clone https://github.com/your-org/reliably.git
cd reliably

Install dependencies:
```
pip install -r requirements.txt
```

Configuration

Set your OpenAI key in environment or directly in code (not recommended for production):

export OPENAI_API_KEY="sk-...your-key..."

In main.py, modify the instantiation if you prefer environment variables:

from llm_agent import OpenAIService
import os

llm_client = OpenAIService(
    api_key=os.getenv('OPENAI_API_KEY')
)

Running the Server

python main.py
# or use flask CLI:
# export FLASK_APP=main.py
# flask run --host=0.0.0.0 --port=4999

By default, the service listens on port 4999.

API Endpoints

GET /health

Simple health check.

Request: GET http://<host>:4999/health
Response: 200 OK with JSON { "status": "healthy" }

POST /evaluate

Analyze article HTML and score reliability.

Endpoint: POST http://<host>:4999/evaluate
Headers: Content-Type: application/json

Request Body:

{
  "html_content": "<h1>Title</h1><p>Article body...</p>"
}

Response (200 OK):

{
  "topic": "...",
  "claims": ["..."],
  "data": ["..."],
  "intent": "...",
  "reliability": {
    "score": 1—5,
    "explanation": "..."
  }
}

Error Codes:
- 400 for malformed JSON or missing html_content
- 500 for LLM extraction/parsing errors

Example Usage (cURL)

curl -i \
  -X POST http://localhost:4999/evaluate \
  -H "Content-Type: application/json" \
  -d '{"html_content":"<h1>News</h1><p>Scientists confirmed discovery...</p>"}'

Use -i to see headers and status codes. Pipe to jq for pretty JSON.

Code Overview

`main.py`

Sets up Flask, CORS, and two routes:
- /health (GET): basic status
- /evaluate (POST): calls OpenAIService to extract & score

`llm_agent.py`

OpenAIService: encapsulates all OpenAI ChatCompletion calls:
- extract_article_content(html_content) → dict with four fields
- score_reliability(info) → dict with {score, explanation}

This separation keeps your Flask routes clean and makes LLM logic reusable.

Troubleshooting

500 errors typically means the LLM output didn’t parse. Check the raw log in your console for JSON decode errors.
CORS issues in the browser: restrict origins in CORS(app, …) to your extension’s ID or domain.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
__pycache__		__pycache__
cache		cache
.DS_Store		.DS_Store
README.md		README.md
cache_manager.py		cache_manager.py
llm_agent.py		llm_agent.py
llm_agent_local.py		llm_agent_local.py
main.py		main.py
main_local.py		main_local.py
reliability_scoring.py		reliability_scoring.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

KnowIt (UWB 2025 Hackathon - Security Track)

KnowIt Backend

📋 Table of Contents

Features

Tech Stack & Dependencies

Environment Variables

API Endpoints

Installation & Setup

Running the Service

Example Requests

Evaluate an article

Check for phishing

Future Work

License

Technical Table of Contents

Features

Directory Structure

Prerequisites

Installation

Configuration

Running the Server

API Endpoints

GET /health

POST /evaluate

Example Usage (cURL)

Code Overview

main.py

llm_agent.py

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`main.py`

`llm_agent.py`

Packages