updates to locustfile by m-goggins · Pull Request #394 · CDCgov/RecordLinker

m-goggins · 2025-05-30T17:02:37Z

Description

This PR updates the locustfile for load testing to:

Check whether the MPI has been seeded based on user input
Add a flag for number of records to link to better control the load testing size
Loop through the original data (1.5 M records), randomly choose a record, and send to the link endpoint. Repeat until the number of records to link has been reached

Related Issues

Closes #393

Additional Notes

If you have better ideas for randomly selecting records from the original data, I'm all ears, because I'm worried it will make seeding artificially slow. Right now, the original_data file is too large to load into memory to allow for random indexing so it loops through the records and randomly decides if they should be linked. It seems unlikely that we'd test all 1.5M records so the first records would be more heavily tested. This probably doesn't matter for load testing but I wanted to point it out.

The other option I considered (but ruled out) was to create the list of records that would be linked during the seeding process, i.e., when looping through the clusters_iter in the on_start section, you could also randomize whether a cluster gets added to a record_to_link list. However, we'd likely run into the same issue with the list being too large to hold in memory. We could write the list to file and then read from that in the link task, but it would only remove line 85 if random.random() < 0.5.

Curious to hear other ideas!

codecov · 2025-05-30T17:05:30Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.51%. Comparing base (4ee75ca) to head (3a8d048).
Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #394   +/-   ##
=======================================
  Coverage   98.51%   98.51%           
=======================================
  Files          33       33           
  Lines        1948     1948           
=======================================
  Hits         1919     1919           
  Misses         29       29

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ericbuckley

Looking good, you can just pin mypy to 1.15.0 in pyproject.toml to get around the type checking issue.

…difying

…/CDCgov/RecordLinker into local-load-testing

…cal-load-testing

m-goggins added 2 commits May 30, 2025 09:49

updates to locustfile

4e8701a

update file

8014f7e

m-goggins added 2 commits May 30, 2025 10:28

try with diff version mypy

7a1a484

go back go back

62b9c42

m-goggins marked this pull request as ready for review May 30, 2025 17:37

m-goggins requested review from bamader, ericbuckley and johanna-skylight as code owners May 30, 2025 17:37

ericbuckley reviewed May 30, 2025

View reviewed changes

Comment thread pyproject.toml

Comment thread tests/load/locustfile.py Outdated

Comment thread tests/load/locustfile.py Outdated

Comment thread tests/load/locustfile.py Outdated

m-goggins added 2 commits June 2, 2025 08:51

pin mypy to 1.15.0

bbdc574

add argument for probability to link

8a2b093

m-goggins requested a review from ericbuckley June 2, 2025 16:05

ericbuckley reviewed Jun 2, 2025

View reviewed changes

Comment thread tests/load/locustfile.py Outdated

ericbuckley approved these changes Jun 2, 2025

View reviewed changes

m-goggins and others added 10 commits June 2, 2025 13:44

remove reference to seld.seeded

c045f58

ensure that locust task stops when hitting record_to_link

1cbce12

fixes for pytest 8.4.0

4231480

clean up

81db14f

remove lru_cache in favor of fixture session caching

724e174

update linking tests to use a copy of the default algorithm before mo…

61cbded

…difying

pinning mypy to 1.15.0

8b5c7f2

ignore type checker error on different return type

cca82e0

Merge branch 'fix/caching-default-algo-fixture' of https://github.com…

9358265

…/CDCgov/RecordLinker into local-load-testing

Merge branch 'main' of https://github.com/CDCgov/RecordLinker into lo…

3a8d048

…cal-load-testing

m-goggins merged commit f6ef4bf into main Jun 3, 2025
15 checks passed

m-goggins deleted the local-load-testing branch June 3, 2025 16:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updates to locustfile#394

updates to locustfile#394
m-goggins merged 16 commits into
mainfrom
local-load-testing

m-goggins commented May 30, 2025 •

edited

Loading

Uh oh!

codecov Bot commented May 30, 2025 •

edited

Loading

Uh oh!

ericbuckley left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

m-goggins commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Additional Notes

Uh oh!

codecov Bot commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ericbuckley left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

m-goggins commented May 30, 2025 •

edited

Loading

codecov Bot commented May 30, 2025 •

edited

Loading