API support for count/coord indexes, parallel graph search, and intermediate query representation by thomastzhou · Pull Request #358 · ratschlab/metagraph

thomastzhou · 2021-11-29T18:49:45Z

New features:

Support for --count-kmers and --query-coords in API and server.
New default coordinate behaviour: continuous coordinate sequences are collapsed into a range. All coordinates can be shown using --verbose-coords query flag.
API level parallel requests with MultiGraphClient

Backend has been re-written to use an intermediate query representation instead always returning string output (and re-parsing string into JSON for server). From here, can also allow for more flexible server JSON responses.

…ords flag

thomastzhou · 2021-11-29T18:54:24Z

I believe these failed checks are related to AppleClang 13 and not the changes, this is the first commit that has been checked with 13 (previous ones are using 12_4).

On MacOS Monterey with AppleClang 13 even master branch fails to build. I built these with g++ 11 instead.

karasikov · 2021-11-30T00:49:06Z

I believe these failed checks are related to AppleClang 13 and not the changes, this is the first commit that has been checked with 13 (previous ones are using 12_4).

On MacOS Monterey with AppleClang 13 even master branch fails to build. I built these with g++ 11 instead.

Feel free to adapt main.yml

thomastzhou · 2021-11-30T10:52:41Z

I've left issue #359 and outlined a way to squash this in the workflow. It depends on whether or not you want to support AppleClang 13 now or point any users to another compiler for now

karasikov

Please apply similar stylistic changes in the rest of the PR.

metagraph/src/cli/query.hpp

metagraph/src/cli/query.cpp

metagraph/src/cli/query.hpp

karasikov · 2021-12-05T18:27:10Z

I've left issue #359 and outlined a way to squash this in the workflow. It depends on whether or not you want to support AppleClang 13 now or point any users to another compiler for now

I fixed that. Please merge master into this branch to pull the fix.

…ords flag

…into tz/count_api

karasikov

It still shows changes that have been merged to master. So this means the commits from master haven't been merged into this branch. Can you do that?

git fetch
git checkout tz/count_api
git merge origin/master

metagraph/src/cli/config/config.cpp

metagraph/api/python/metagraph/client.py

karasikov · 2021-12-10T14:50:27Z

metagraph/src/cli/query.cpp

+    for (const auto &coords : tuples) {
+        if (coords.empty()) continue;
+
+        // TODO: Look for example where row_tuples are multiple values?


What do you mean?

oh, never mind. I see what you mean...

@hmusta What format would you suggest to print them succinctly?
(Suppose we have a single sequence with repeats and each k-mer may have multiple coordinates.)
Maybe <position in query>-<start_coord>-<end_coord>:..., or you know some other tools that output a similar thing?

You can think of the case where the indexed sequence is AAA...A and the query is AAA.
So essentially every k-mer gets all the coordinates.

The current implementation will only print one coordinate range, which is incorrect. So this should be fixed. But we should agree on the desired output format first.

- Made coord/count flags client options instead of default - Error thrown if coord/count flags tried with unsupported queries - Empty alignment attempts default to empty result for now (#369) - Integration tests for coord/count API queries

karasikov · 2022-01-08T13:32:44Z

metagraph/api/python/metagraph/client.py

+        executor = ThreadPoolExecutor(max_workers=num_processes)
+
+        # Populate async results dict with concurrent.futures.Future instances
        for name, graph_client in self.graphs.items():
-            # TODO: do this async
-            result[name] = graph_client.align(sequence, min_exact_match,
-                                              max_alternative_alignments,
-                                              max_num_nodes_per_seq_char)
+            futures[name] = executor.submit(graph_client.align, sequence, min_exact_match,
+                                            max_alternative_alignments,
+                                            max_num_nodes_per_seq_char)

-        return result
+        print(f'Made {len(self.graphs)} requests with {num_processes} threads...')
+
+        # Shutdown executor but do not stop futures
+        executor.shutdown(wait=False)
+        return futures


Do we really benefit from returning Futures instead of just always waiting for everything to finish and returning the actual dictionary?

thomastzhou added 3 commits November 16, 2021 09:00

Rewrote query code to use intermediate representation and --expand-co…

74850fa

…ords flag

Change casts to be compatible with older compilers

f3364f4

Made format more consistent and fixed last bugs, reverted label count

5948d48

thomastzhou changed the title ~~Tz/count api~~ API support for count/coord indexes and intermediate query representation Nov 30, 2021

karasikov requested changes Dec 2, 2021

View reviewed changes

thomastzhou added 5 commits December 6, 2021 22:09

Rewrote query code to use intermediate representation and --expand-co…

c6cb823

…ords flag

Change casts to be compatible with older compilers

64bbe26

Made format more consistent and fixed last bugs, reverted label count

cd7437c

Rebase and requested style changes

d8db656

Merge branch 'tz/count_api' of https://github.com/ratschlab/metagraph …

73739a9

…into tz/count_api

karasikov requested changes Dec 9, 2021

View reviewed changes

metagraph/src/cli/config/config.cpp Outdated Show resolved Hide resolved

karasikov added 2 commits December 10, 2021 13:09

cleanup

1b5c483

cleanup

cf32901

karasikov reviewed Dec 10, 2021

View reviewed changes

metagraph/api/python/metagraph/client.py Show resolved Hide resolved

karasikov reviewed Dec 10, 2021

View reviewed changes

karasikov and others added 11 commits December 10, 2021 17:42

fixed collapsing of coordinates with repeats

eba16bd

Merge remote-tracking branch 'origin/master' into tz/count_api

a74eba3

cleanup

9d4c004

Search with alignment limit alternative alignments to 1

dd2763e

Parallel multi graph requests using thread pool

7eed2fc

Restore coding comment

bad7959

Make parallel graph search return Futures

088c24e

Ensure regular API integration tests do not use parallel search

49d2b5b

Integration tests for parallel search API client

72dd829

Count API integration test fix

1928421

thomastzhou added 4 commits December 21, 2021 11:31

Count API integration test fix

1928421

Merge branch 'tz/async_client' into tz/count_api

c1ad726

minor

0d327ec

Merge remote-tracking branch 'origin/master' into tz/count_api

56d9d33

thomastzhou changed the title ~~API support for count/coord indexes and intermediate query representation~~ API support for count/coord indexes, parallel graph search, and intermediate query representation Dec 21, 2021

thomastzhou and others added 4 commits December 21, 2021 14:29

Collapsed coord ranges test example (WIP)

4e3c4c4

Minor

0b0ccd2

fix

1e39042

collapse_coords tests and brief docs

e7a45b1

karasikov reviewed Jan 8, 2022

View reviewed changes

karasikov added 2 commits January 8, 2022 15:19

pass

fd154d6

minor

2cbe571

karasikov merged commit a74911c into master Jan 9, 2022

karasikov deleted the tz/count_api branch January 9, 2022 13:41

Conversation

thomastzhou commented Nov 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thomastzhou commented Nov 29, 2021

Uh oh!

karasikov commented Nov 30, 2021

Uh oh!

thomastzhou commented Nov 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

karasikov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

karasikov commented Dec 5, 2021

Uh oh!

karasikov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

karasikov Dec 10, 2021

Choose a reason for hiding this comment

Uh oh!

karasikov Dec 10, 2021

Choose a reason for hiding this comment

Uh oh!

karasikov Dec 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

karasikov Dec 10, 2021

Choose a reason for hiding this comment

Uh oh!

karasikov Jan 8, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

thomastzhou commented Nov 29, 2021 •

edited

Loading

thomastzhou commented Nov 30, 2021 •

edited

Loading

karasikov Dec 10, 2021 •

edited

Loading