Add stats for time spent fetching data while searching snapshots by DaveCTurner · Pull Request #51866 · elastic/elasticsearch

DaveCTurner · 2020-02-04T14:19:31Z

This commit builds on #51637, adding tracking of the total time spent fetching
data from the blob store.

Relates #50999.

This commit builds on elastic#51637, adding tracking of the total time spent fetching data from the blob store and a linear regression model for these fetches.

elasticmachine · 2020-02-04T14:19:34Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

DaveCTurner · 2020-02-04T14:25:41Z

I'm not sure the linear regression is actually useful here. Since we're tracking stats on a file-by-file basis, essentially every fetch will be the same size, which stops us from building a meaningful model of the fetch time as a function of size. Maybe we just want to track the total took time? Possibly min and max too?

tlrx · 2020-02-06T08:27:26Z

I tend to agree with you, unless we implement different range sizes per files (which we could do easily) I'm not sure the linear regression is very useful. Maybe we could reuse the existing counters and track the total/min/max took times as you suggested.

…e-snapshots-track-time-spent-fetching-from-blob-store

DaveCTurner

Updated following discussions on other channels; this is ready for a proper review now.

DaveCTurner · 2020-02-24T12:03:51Z

...snapshots/src/main/java/org/elasticsearch/xpack/searchablesnapshots/SearchableSnapshots.java

    public Map<String, DirectoryFactory> getDirectoryFactories() {
        return Map.of(SearchableSnapshotRepository.SNAPSHOT_DIRECTORY_FACTORY_KEY,
-            SearchableSnapshotRepository.newDirectoryFactory(repositoriesService::get, cacheService::get));
+            SearchableSnapshotRepository.newDirectoryFactory(repositoriesService::get, cacheService::get, System::nanoTime));


Using System::nanoTime since we need finer resolution than ThreadPool::relativeTimeInNanos offers.

...main/java/org/elasticsearch/xpack/core/searchablesnapshots/SearchableSnapshotShardStats.java

x-pack/plugin/searchable-snapshots/qa/rest/src/test/resources/rest-api-spec/test/stats.yml

...napshots/src/main/java/org/elasticsearch/xpack/searchablesnapshots/cache/CacheDirectory.java

...main/java/org/elasticsearch/xpack/core/searchablesnapshots/SearchableSnapshotShardStats.java

tlrx

LGTM, thanks David!

Add stats for fetch speed as a function of size

48e2d1a

This commit builds on elastic#51637, adding tracking of the total time spent fetching data from the blob store and a linear regression model for these fetches.

DaveCTurner added WIP :Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs labels Feb 4, 2020

DaveCTurner requested review from tlrx and ywelsch February 4, 2020 14:19

DaveCTurner added 8 commits February 24, 2020 10:16

Merge branch 'feature/searchable-snapshots' into 2020-02-04-searchabl…

5870d96

…e-snapshots-track-time-spent-fetching-from-blob-store

Remove linear model

906a39e

Rename

86e4b97

Use System::nanoTime since threadpool's time is too coarse

b7db2ef

Unnecessary test case

dbc18a7

inline test utilities

d224619

Tidy

e4c2a5d

Add stats

e634b78

DaveCTurner removed the WIP label Feb 24, 2020

Whitespace

95a86e7

DaveCTurner commented Feb 24, 2020

View reviewed changes

DaveCTurner added the >enhancement label Feb 24, 2020

DaveCTurner changed the title ~~Add stats for fetch speed as a function of size~~ Add stats for time spent fetching data while searching snapshots Feb 24, 2020

ZLong

8edbdf6

tlrx reviewed Feb 24, 2020

View reviewed changes

Neater subclassing and human-only output

5b26f66

DaveCTurner requested a review from tlrx February 24, 2020 13:52

tlrx approved these changes Feb 24, 2020

View reviewed changes

DaveCTurner merged commit c9ac57f into elastic:feature/searchable-snapshots Feb 24, 2020

DaveCTurner deleted the 2020-02-04-searchable-snapshots-track-time-spent-fetching-from-blob-store branch February 24, 2020 15:21

tlrx mentioned this pull request Apr 6, 2020

Merge feature/searchable-snapshots branch into master #54803

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add stats for time spent fetching data while searching snapshots#51866

Add stats for time spent fetching data while searching snapshots#51866
DaveCTurner merged 12 commits intoelastic:feature/searchable-snapshotsfrom
DaveCTurner:2020-02-04-searchable-snapshots-track-time-spent-fetching-from-blob-store

DaveCTurner commented Feb 4, 2020 •

edited

Loading

Uh oh!

elasticmachine commented Feb 4, 2020

Uh oh!

DaveCTurner commented Feb 4, 2020

Uh oh!

tlrx commented Feb 6, 2020

Uh oh!

DaveCTurner left a comment

Uh oh!

DaveCTurner Feb 24, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tlrx left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

DaveCTurner commented Feb 4, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Feb 4, 2020

Uh oh!

DaveCTurner commented Feb 4, 2020

Uh oh!

tlrx commented Feb 6, 2020

Uh oh!

DaveCTurner left a comment

Choose a reason for hiding this comment

Uh oh!

DaveCTurner Feb 24, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tlrx left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DaveCTurner commented Feb 4, 2020 •

edited

Loading