Improve performance of query string parsing by bdraco · Pull Request #1493 · aio-libs/yarl

bdraco · 2025-04-03T06:31:26Z

What do these changes do?

When looking at the performance regression in #1492, I noticed that much of the time was spent in parse_qsl, however we do not need all the complexity of parse_sql since we use a very limited subset of the functionality.

We can write a faster version using the unquoter built-in to yarl with very little code.

Are there changes in behavior for the user?

no

Related issue number

n/a

Checklist

I think the code is well written
Unit tests for the changes exist
Documentation reflects the changes

When looking at the performance regression in #1492, I noticed that much of the time was spent in parse_qsl, however we do not need all the complexity of parse_sql since we use a very limited subset of the functionality. We can write a faster version using the unquoter built-in to yarl with very little code.

codspeed-hq · 2025-04-03T06:40:14Z

CodSpeed Performance Report

Merging #1493 will improve performances by 29.13%

_{Comparing query_string_parsing (563933c) with master (74f79ae)}

Summary

⚡ 3 improvements
✅ 98 untouched benchmarks

Benchmarks breakdown

	Benchmark	`BASE`	`HEAD`	Change
⚡	`test_parse_query_uncached[long]`	29.8 ms	25.3 ms	+17.81%
⚡	`test_parse_query_uncached[short]`	2.6 ms	2 ms	+29.13%
⚡	`test_update_query_string`	752.4 µs	621.3 µs	+21.1%

codecov · 2025-04-03T06:40:15Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.92%. Comparing base (74f79ae) to head (563933c).
Report is 44 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #1493   +/-   ##
=======================================
  Coverage   98.92%   98.92%           
=======================================
  Files          32       32           
  Lines        6055     6070   +15     
  Branches      363      365    +2     
=======================================
+ Hits         5990     6005   +15     
  Misses         62       62           
  Partials        3        3

Flag	Coverage Δ
CI-GHA	`98.92% <100.00%> (+<0.01%)`	⬆️
MyPy	`98.07% <100.00%> (+<0.01%)`	⬆️
OS-Linux	`98.78% <100.00%> (+<0.01%)`	⬆️
OS-Windows	`98.82% <100.00%> (+<0.01%)`	⬆️
OS-macOS	`98.56% <100.00%> (+<0.01%)`	⬆️
Py-3.10.11	`98.53% <100.00%> (+<0.01%)`	⬆️
Py-3.10.16	`98.74% <100.00%> (+<0.01%)`	⬆️
Py-3.11.11	`98.74% <100.00%> (+<0.01%)`	⬆️
Py-3.11.9	`98.53% <100.00%> (+<0.01%)`	⬆️
Py-3.12.9	`98.74% <100.00%> (+<0.01%)`	⬆️
Py-3.13.2	`98.74% <100.00%> (+<0.01%)`	⬆️
Py-3.9.13	`98.49% <100.00%> (+<0.01%)`	⬆️
Py-3.9.21	`98.70% <100.00%> (+<0.01%)`	⬆️
Py-pypy7.3.16	`98.69% <100.00%> (+<0.01%)`	⬆️
Py-pypy7.3.19	`98.71% <100.00%> (+<0.01%)`	⬆️
VM-macos-latest	`98.56% <100.00%> (+<0.01%)`	⬆️
VM-ubuntu-latest	`98.78% <100.00%> (+<0.01%)`	⬆️
VM-windows-latest	`98.82% <100.00%> (+<0.01%)`	⬆️
pytest	`98.78% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

asvetlov · 2025-04-04T08:45:36Z

Good idea!
I suspect all functions from urllib.parse could be replaced with faster ones.
The only big question is the compatibility; URL parsing is not a trivial thing.

bdraco · 2025-04-04T08:52:24Z

Good idea!
I suspect all functions from urllib.parse could be replaced with faster ones.
The only big question is the compatibility; URL parsing is not a trivial thing.

We are down to very few urllib.parse calls left. The one in

yarl/yarl/_quoters.py

Line 32 in f73f47e

return "".join(c if c.isprintable() else quote(c) for c in s)

could probably be replaced as well. That one is probably a bit more tricky since I'm not sure any of the quoters are directly compatible with urllib.parse.quote. However I don't expect human_quote is called frequently so I haven't tried to optimize it so much.

This reverts commit c110d6a.

changelog

7a81f86

bdraco marked this pull request as ready for review April 3, 2025 18:36

psf-chronographer bot added the bot:chronographer:provided There is a change note present in this PR label Apr 3, 2025

bdraco added 5 commits April 5, 2025 12:41

Merge branch 'master' into query_string_parsing

b4f650e

tune

cc23bab

tune

c110d6a

Revert "tune"

c795271

This reverts commit c110d6a.

remove duplicate comment

563933c

bdraco merged commit 3bf614a into master Apr 5, 2025
48 of 50 checks passed

bdraco deleted the query_string_parsing branch April 5, 2025 23:36

bdraco mentioned this pull request Apr 6, 2025

Add direct coverage for unquoter with plus=True #1497

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve performance of query string parsing#1493

Improve performance of query string parsing#1493
bdraco merged 7 commits intomasterfrom
query_string_parsing

bdraco commented Apr 3, 2025 •

edited

Loading

Uh oh!

codspeed-hq bot commented Apr 3, 2025 •

edited

Loading

Uh oh!

codecov bot commented Apr 3, 2025 •

edited

Loading

Uh oh!

asvetlov commented Apr 4, 2025

Uh oh!

bdraco commented Apr 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

bdraco commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What do these changes do?

Are there changes in behavior for the user?

Related issue number

Checklist

Uh oh!

codspeed-hq bot commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #1493 will improve performances by 29.13%

Summary

Benchmarks breakdown

Uh oh!

codecov bot commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

asvetlov commented Apr 4, 2025

Uh oh!

bdraco commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bdraco commented Apr 3, 2025 •

edited

Loading

codspeed-hq bot commented Apr 3, 2025 •

edited

Loading

codecov bot commented Apr 3, 2025 •

edited

Loading

bdraco commented Apr 4, 2025 •

edited

Loading