Server changes: load all graphs, don't compute num_relations in stats by karasikov · Pull Request #563 · ratschlab/metagraph

karasikov · 2025-11-06T23:49:46Z

Improvements in server

Computing num_relations takes up to 1 min for very large graphs that are loaded with mmap and not cached.
Now we don't report it, hence /stats always works instantly.
Always keep all graphs loaded, so that they don't need to be reloaded for every query.
Don't store copied annotation labels, which may create a large overhead for indexes with many labels
Fixed catching exceptions thrown in worker threads, so now it reliably works with incorrect requests in multi-graph regime.
Logging and other minor improvements

adamant-pwn

Thanks! I think overall it looks nice and is improvement, but I don't really like using .first and .second a lot, and I think it's generally a good practice to use named structures, rather than std::pair / std::tuple, except maybe in function return values that are immediately parsed by structured bindings.

I would prefer to change .first and .second into structured bindings that would give them names, or in this particular case, I also think we can "painlessly" integrate the new graphs_cache map into existing indexes map, as we almost exclusively access it using all stored values in indexes[name] anyway.

adamant-pwn · 2025-11-20T11:32:43Z

metagraph/src/cli/server.cpp

+    size_t num_server_threads = std::max(1u, get_num_threads());
+    set_num_threads(0);
+
+    std::unordered_map<std::pair<std::string, std::string>, std::unique_ptr<AnnotatedDBG>> graphs_cache;


I feel like an extra map could be unnecessary here, given that all actual use cases are iterating over indexes[name] and accessing them in that order. Maybe make

struct index { std::string graph_fname; std::string anno_fname; std::unique_ptr<AnnotatedDBG> anno_dbg; };

And put it in indexes instead of std::pair<std::string, std::string>? In any case, I think struct with named fields is better than std::pair / std::tuple, outside of some specific cases.

The purpose of two maps is to allow having the same (graph, anno) for multiple different names without duplicating it in memory. It's a weird case, but it happens in tests

Ah, alright, that's annoying to deal with. And making a custom hasher for in-place struct is a pain too. Let's just give names to .first and .second wherever we use them then?

metagraph/src/cli/server.cpp

Copilot

Pull Request Overview

This PR optimizes the server performance by preloading and caching all graphs to avoid repeated loading, removes the expensive num_relations computation from the /stats endpoint, and improves exception handling in multi-threaded worker contexts. It also includes various logging improvements with consistent [Server] prefixes and better error visibility.

Key Changes:

All graphs are now preloaded into a cache (graphs_cache) and kept in memory to eliminate reload overhead for queries
Worker thread exceptions are now properly captured and rethrown via std::exception_ptr
The /stats endpoint no longer computes num_relations, making it return instantly even for large mmap'd graphs

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 5 comments.

File	Description
metagraph/src/cli/server_utils.cpp	Changed exception types to `std::invalid_argument`, improved error logging with warn level and request IDs
metagraph/src/cli/server.cpp	Implemented graph caching with preloading, fixed worker thread exception handling, updated stats endpoint to use cache, reduced request timeout to 15 minutes

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

metagraph/src/cli/server.cpp

adamant-pwn

LGTM 👍

karasikov added 18 commits November 6, 2025 23:44

don't compute the number of relations in server stats

51023e7

always load all graphs at server start

7c4baf0

reduced timeout to 15 min

1653f29

report k and graph mode in stats when they're the same for all graphs

8d72461

minor

045683c

wait longer

3e0a09a

wait longer for more graphs

122fcbd

moved graphs loading to the if statement

c52dc98

no need to store label names now

bcb7e35

cleanup

74cd9a6

cleanup

accfd8b

add prefix [Server] in logs

4da4f3e

minor

78fcac4

catch exceptions in worker threads

c4f5e89

minor

943b182

print warnings where error is due to bad requests and is not critical

76999e5

try

95bb4c7

without std::optional

9c60f87

karasikov force-pushed the mk/server branch from bd26955 to 9c60f87 Compare November 20, 2025 03:05

karasikov requested a review from adamant-pwn November 20, 2025 03:06

adamant-pwn requested changes Nov 20, 2025

View reviewed changes

adamant-pwn reviewed Nov 20, 2025

View reviewed changes

metagraph/src/cli/server.cpp Outdated Show resolved Hide resolved

minor

bd27fb1

karasikov requested a review from adamant-pwn November 20, 2025 13:29

adamant-pwn requested a review from Copilot November 20, 2025 13:33

Copilot started reviewing on behalf of adamant-pwn November 20, 2025 13:34 View session

Copilot finished reviewing on behalf of adamant-pwn November 20, 2025 13:36

Copilot AI reviewed Nov 20, 2025

View reviewed changes

metagraph/src/cli/server.cpp Show resolved Hide resolved

metagraph/src/cli/server.cpp Show resolved Hide resolved

metagraph/src/cli/server.cpp Show resolved Hide resolved

metagraph/src/cli/server.cpp Show resolved Hide resolved

metagraph/src/cli/server.cpp Show resolved Hide resolved

adamant-pwn approved these changes Nov 20, 2025

View reviewed changes

karasikov merged commit 48a1af9 into master Nov 20, 2025
147 of 148 checks passed

karasikov deleted the mk/server branch November 20, 2025 21:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Server changes: load all graphs, don't compute num_relations in stats#563

Server changes: load all graphs, don't compute num_relations in stats#563
karasikov merged 19 commits intomasterfrom
mk/server

karasikov commented Nov 6, 2025 •

edited

Loading

Uh oh!

adamant-pwn left a comment

Uh oh!

adamant-pwn Nov 20, 2025

Uh oh!

karasikov Nov 20, 2025 •

edited

Loading

Uh oh!

adamant-pwn Nov 20, 2025

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adamant-pwn left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

karasikov commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adamant-pwn left a comment

Choose a reason for hiding this comment

Uh oh!

adamant-pwn Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

karasikov Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adamant-pwn Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adamant-pwn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

karasikov commented Nov 6, 2025 •

edited

Loading

karasikov Nov 20, 2025 •

edited

Loading