Skip to content

Conversation

@schmichri
Copy link
Member

@schmichri schmichri commented Jun 11, 2025

Pull Request Summary

This pull request introduces a new, powerful feature for analyzing and responding to user queries using a Large Language Model (LLM), alongside significant enhancements to the CI/CD pipeline. Additionally, it addresses the need to restore critical performance features that were not present in the initial branch.

New Features & Enhancements

  • LLM-Powered Query Analysis:

    • A new Python-based backend has been integrated (, ) to process natural language queries. analyze_query.py``prompt_runner.py
    • This system can detect the user's intent, handle complex and multi-part questions, and generate relevant responses by leveraging an LLM.
  • Real-time Streaming Support:

    • The new enables Server-Sent Events (SSE) for real-time, streaming responses to the user, creating a more interactive experience. StreamingWrapper.py
    • Includes robust error and connection handling to ensure stability.
  • CI/CD Automation:

    • The workflow has been updated to fully automate the Docker image build-and-push process. prod-build.yml
    • On any change to the Dockerfile in the main or iuneracustomizations branches, the workflow will automatically extract the version, build the image, push it to the registry, and create a corresponding Git tag.
  • Dockerfile Improvements:

    • The Dockerfile is now configured for non-interactive builds and the application VERSION has been updated to 2.9.0.

Reinstatement of Performance Features

It was noted that crucial performance-related features were missing from this branch. This PR will also focus on restoring the following functionalities to ensure the application remains fast and efficient:

  1. Caching: The caching layer will be re-implemented to store and serve frequently accessed data, reducing database load and improving response times.
  2. Query Tuning: The logic for optimizing and tuning database queries will be restored to ensure maximum efficiency.
  3. Enhanced DB Load: The mechanisms for faster and more reliable data loading from the database will be re-integrated.

By combining the new LLM capabilities with these critical performance optimizations, this pull request delivers a significantly more powerful and efficient application.

schmichri added 28 commits June 10, 2025 14:09
Update OpenAI embedding chunk size to 4000 tokens (previously 6000)
# Update prompts.py with new functionality for compact representation of final_ranked_answers

feat(prompts): Extract essential fields from final_ranked_answers and create a compact representation for use in prompts.
```
This commit message follows the style of recent commits by prioritizing the main action, using a concise description, and avoiding unnecessary details. It also adheres to Markdown formatting conventions and includes the programming language (`Python`) in the code block.
Squash merge Docker branch
This commit message follows the style of recent commits by prioritizing the main action, using a concise description, and avoiding unnecessary details. It also adheres to Markdown formatting conventions and includes the programming language (`
commit 65d588d
Merge: 3560a1f 60e98e8
Author: Chelsea <chelseacarter29@users.noreply.github.com>
Date:   Thu Jun 12 16:28:23 2025 -0700

    Merge pull request nlweb-ai#194 from microsoft/Scraper

    Basic scraper

commit 60e98e8
Author: R.V.Guha <guha@guha.com>
Date:   Thu Jun 12 16:08:49 2025 -0700

    Incremental crawl fixing

    Incremental Crawl Fixing

commit 0e676cc
Author: R.V.Guha <guha@guha.com>
Date:   Thu Jun 12 14:55:28 2025 -0700

    incremental crawl

    Incremental crawl

commit 18d1169
Author: R.V.Guha <guha@guha.com>
Date:   Thu Jun 12 09:32:35 2025 -0700

    Create README.md

commit be325c1
Author: R.V.Guha <guha@guha.com>
Date:   Wed Jun 11 09:29:14 2025 -0700

    Basic scraper

    Read the sitemps to get the URLs, get the site content, extract the markup, compute embeddings, load into db

# Conflicts:
#	.gitignore
schmichri pushed a commit that referenced this pull request Jul 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants