fix: handle chatty LLM responses in JSON parsing by utkarsh240799 · Pull Request #4525 · mem0ai/mem0

utkarsh240799 · 2026-03-24T09:01:23Z

Linked Issue

Description

Local LLMs (LM Studio, Ollama) often wrap JSON output in conversational text (e.g., "Here is the memory extraction: {...} I hope this helps!"). This caused empty results because json.loads failed on the chatty text and the error was silently swallowed.

Three changes:

Enhanced extract_json() to locate JSON by first { / last } boundaries when no code block is found. This is safe for all existing callers — clean JSON, code-block-wrapped JSON, and arrays all produce identical results since the heuristic only activates when there's no code block AND the text contains {/}.
Added JSONDecodeError fallback to memory action parsing in both sync and async _add_to_vector_store paths. This was completely missing — when the LLM returned chatty text, json.loads(remove_code_blocks(response)) failed and the outer except Exception silently set new_memories_with_actions = {}.
Fixed fact retrieval paths to pass the original response to extract_json() instead of the already-processed output of remove_code_blocks(). Previously response = remove_code_blocks(response) mutated the variable, so the fallback received degraded input.

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Refactor (no functional changes)
Documentation update

Breaking Changes

N/A

Test Coverage

I added/updated unit tests
I added/updated integration tests
I tested manually (describe below)
No tests needed (explain why)

Tests Added

New test file tests/test_chatty_llm_parsing.py with 21 tests across three test classes:

TestExtractJson (9 tests) — validates extract_json() handles:

Pure JSON, markdown code blocks, plain code blocks
Chatty text with embedded markdown code blocks
Chatty text without code blocks (new {/} heuristic)
Multi-line JSON in chatty text, nested objects, whitespace, no-JSON-at-all

TestRemoveCodeBlocks (4 tests) — regression tests confirming:

Clean code block stripping works
Chatty text without code blocks is NOT parsed (verifying fallback is needed)
<think> tag handling edge cases

TestFallbackChain (8 tests) — validates the full remove_code_blocks → extract_json pattern as used in main.py:

Clean JSON, markdown-wrapped, chatty+markdown, chatty without markdown
<think> tags with JSON, LM Studio-style responses
Completely invalid responses (verifies error propagation)
Facts extraction responses (both LLM call paths)

All 58 tests pass (21 new + 25 existing memory + 12 LLM provider tests).

Checklist

My code follows the project's style guidelines
I have performed a self-review of my code
I have added tests that prove my fix/feature works
New and existing tests pass locally
I have updated documentation if needed

Local LLMs (LM Studio, Ollama) often wrap JSON output in conversational text. The fact retrieval path already has a JSONDecodeError fallback using extract_json(), but the memory action path did not, causing empty results. Apply the same fallback pattern to both sync and async paths, and enhance extract_json() to locate JSON by first '{' / last '}' boundaries when no code block is found. Closes #3788 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Introduce extract_json_from_chatty_response() instead of modifying the existing extract_json() which is used by 15+ LLM providers and vector stores. The new function adds {/} boundary detection for responses without code blocks, while extract_json remains unchanged to avoid regressions in unrelated code paths. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Local LLMs (LM Studio, Ollama) often wrap JSON output in conversational text. This caused empty results because json.loads failed on the chatty text and the error was silently swallowed. Changes: - Enhanced extract_json() to locate JSON by first '{' / last '}' boundaries when no code block is found - Added JSONDecodeError fallback to memory action parsing in both sync and async paths (was completely missing) - Fixed fact retrieval paths to pass original response to extract_json instead of the already-processed output of remove_code_blocks Closes #3788 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

utkarsh240799 and others added 4 commits March 24, 2026 13:04

utkarsh240799 requested a review from kartik-mem0 March 24, 2026 10:11

slideshow-dingo mentioned this pull request Mar 24, 2026

Fact retrieval crashes on LLM responses with JSON wrapped in explanation text (no code fences) #4526

Closed

Krishnachaitanyakc mentioned this pull request Mar 25, 2026

fix(ts): extract JSON from chatty LLM responses in fact retrieval #4533

Merged

14 tasks

kartik-mem0 approved these changes Mar 26, 2026

View reviewed changes

kartik-mem0 merged commit 2a59c9f into main Mar 26, 2026
8 checks passed

kartik-mem0 deleted the fix/chatty-llm-json-parsing branch March 26, 2026 11:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: handle chatty LLM responses in JSON parsing#4525

fix: handle chatty LLM responses in JSON parsing#4525
kartik-mem0 merged 4 commits intomainfrom
fix/chatty-llm-json-parsing

utkarsh240799 commented Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

utkarsh240799 commented Mar 24, 2026

Linked Issue

Description

Type of Change

Breaking Changes

Test Coverage

Tests Added

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants