gh-125346: Fix decoding with non-standard Base64 alphabet by serhiy-storchaka · Pull Request #141128 · python/cpython

serhiy-storchaka · 2025-11-06T10:07:19Z

The "+" and "/" characters are no longer recognized as the part of the Base64 alphabet in base64.urlsafe_b64decode() and base64.b64decode() the altchars argument that does not contain them.

Issue: Base64 decode with altchars still accepts non-altchars #125346

The "+" and "/" characters are no longer recognized as the part of the Base64 alphabet in base64.urlsafe_b64decode() and base64.b64decode() the altchars argument that does not contain them.

sethmlarson · 2025-11-06T14:26:32Z

@serhiy-storchaka Thanks for this, can you link this PR to either the original issue or a new issue for tracking purposes.

sethmlarson

I'm really concerned about the subtle breakages that this change could cause, especially because the default behavior is to throw away characters that aren't in the current alphabet. If the default behavior was to raise an error I would feel better about this change.

Makes me wonder if we should target validate=True with this behavior change (because IMO, the silent dropping of invalid characters is in itself a concerning behavior) and then long-term move to having validate be enabled by default?

Lib/test/test_base64.py

serhiy-storchaka

This worries me too. We can keep the old behavior but emit a warning if characters + or / occur in Base64 data with the alternative alphabet.

But urlsafe_b64decode() does not have the validate parameter.

Lib/test/test_base64.py

sethmlarson · 2025-11-07T14:51:50Z

But urlsafe_b64decode() does not have the validate parameter.

I wonder if for urlsafe_b64decode() it is okay to error out on bad characters as the function name is more clear that this is for a specific base64 alphabet?

serhiy-storchaka · 2025-11-07T17:19:31Z

But is not passing altchars to b64decode() also makes it clear that it uses a different alphabet?

sethmlarson · 2025-11-12T20:11:07Z

@serhiy-storchaka Yes maybe you're right that using altchars is enough that users should be expecting the alphabet to completely shift. I did some poking around with libraries that use altchars with b64decode() and found comments that confirm your thinking. Maybe that's a vote for moving to always erroring in case a non-alphabet value is supplied?

serhiy-storchaka · 2025-11-22T17:07:35Z

If this is only for 3.15, then making validate=True by default will make the code more reliable by default.

This reverts commit db32b32.

…ternative alphabet is used (pythonGH-141128) Emit a warning in base64.urlsafe_b64decode() and base64.b64decode() when the "+" or "/" characters occur in the Base64 data with alternative alphabet if they are not the part of the alternative alphabet. It is a DeprecationWarning in the strict mode (will be error) and a FutureWarning in non-strict mode (will be ignored).

mayeut · 2026-03-14T08:51:58Z

Sorry to comment on a merged pull request but I'd rather ask the question here before opening an issue:

The altchars len validation moved from a simple assert to a ValueError. While I think it makes sense it's not being documented as being changed. Should it be ?
This also makes the check in the encode function to now behave differently (still the simple assert) which lacks consistency:

cpython/Lib/base64.py

Line 61 in 77c06f3

assert len(altchars) == 2, repr(altchars)

Does either of this concerns make sense ? If so I'll open a new issue.

gpshead · 2026-03-15T06:10:47Z

feel free to open a new PR referencing the same issue for little cleanup followups like that. We don't need to document ValueError or document that one as that is a normal error response to an API not being used quite right.

pythongh-141061: Fix decoding with non-standard Base64 alphabet

591127b

The "+" and "/" characters are no longer recognized as the part of the Base64 alphabet in base64.urlsafe_b64decode() and base64.b64decode() the altchars argument that does not contain them.

serhiy-storchaka requested a review from sethmlarson November 6, 2025 10:07

serhiy-storchaka added needs backport to 3.13 bugs and security fixes needs backport to 3.14 bugs and security fixes labels Nov 6, 2025

bedevere-app bot added the awaiting core review label Nov 6, 2025

sethmlarson reviewed Nov 6, 2025

View reviewed changes

Lib/test/test_base64.py Outdated Show resolved Hide resolved

Fix the issue number.

5c1e8d4

serhiy-storchaka changed the title ~~gh-141061: Fix decoding with non-standard Base64 alphabet~~ gh-125346: Fix decoding with non-standard Base64 alphabet Nov 6, 2025

bedevere-app bot mentioned this pull request Nov 6, 2025

Base64 decode with altchars still accepts non-altchars #125346

Closed

serhiy-storchaka added 3 commits November 6, 2025 17:47

Remove unrelated changes.

b0d5877

Merge branch 'main' into b64decode-altchars

15b39c5

Only emit a warning if validate=False.

414e4ac

serhiy-storchaka commented Nov 6, 2025

View reviewed changes

Lib/test/test_base64.py Outdated Show resolved Hide resolved

serhiy-storchaka marked this pull request as draft November 7, 2025 08:20

bedevere-app bot removed the awaiting core review label Nov 7, 2025

serhiy-storchaka added 3 commits November 13, 2025 23:42

Merge branch 'main' into b64decode-altchars

f5d1932

Merge branch 'main' into b64decode-altchars

0c8fe51

Make validate=True by default in base64.b64decode().

db32b32

serhiy-storchaka removed needs backport to 3.13 bugs and security fixes needs backport to 3.14 bugs and security fixes labels Nov 22, 2025

serhiy-storchaka marked this pull request as ready for review November 22, 2025 17:07

serhiy-storchaka requested a review from AA-Turner as a code owner November 22, 2025 17:07

bedevere-app bot added the awaiting core review label Nov 22, 2025

Merge branch 'main' into b64decode-altchars

2b75653

serhiy-storchaka added 2 commits January 18, 2026 11:12

Revert "Make validate=True by default in base64.b64decode()."

e9db343

This reverts commit db32b32.

Always only emit a warning.

220fc4e

sethmlarson approved these changes Jan 20, 2026

View reviewed changes

sethmlarson requested review from gpshead and zooba January 20, 2026 22:06

serhiy-storchaka merged commit 9060b4a into python:main Jan 21, 2026
47 checks passed

bedevere-app bot removed the awaiting core review label Jan 21, 2026

serhiy-storchaka deleted the b64decode-altchars branch January 21, 2026 07:42

Viicos mentioned this pull request Feb 6, 2026

test_base64url[Base64UrlBytes-bytes-alphabet-vanilla] throws a warning with Python 3.15 pydantic/pydantic#12778

Closed

1 task

cdce8p mentioned this pull request Mar 31, 2026

testAllBase64Features_librt fails with Python 3.14.4 python/mypy#21120

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gh-125346: Fix decoding with non-standard Base64 alphabet#141128

gh-125346: Fix decoding with non-standard Base64 alphabet#141128
serhiy-storchaka merged 11 commits intopython:mainfrom
serhiy-storchaka:b64decode-altchars

serhiy-storchaka commented Nov 6, 2025 •

edited by bedevere-app bot

Loading

Uh oh!

sethmlarson commented Nov 6, 2025

Uh oh!

sethmlarson left a comment

Uh oh!

Uh oh!

serhiy-storchaka left a comment

Uh oh!

Uh oh!

sethmlarson commented Nov 7, 2025

Uh oh!

serhiy-storchaka commented Nov 7, 2025

Uh oh!

sethmlarson commented Nov 12, 2025

Uh oh!

serhiy-storchaka commented Nov 22, 2025

Uh oh!

Uh oh!

mayeut commented Mar 14, 2026

Uh oh!

gpshead commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

serhiy-storchaka commented Nov 6, 2025 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sethmlarson commented Nov 6, 2025

Uh oh!

sethmlarson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

serhiy-storchaka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sethmlarson commented Nov 7, 2025

Uh oh!

serhiy-storchaka commented Nov 7, 2025

Uh oh!

sethmlarson commented Nov 12, 2025

Uh oh!

serhiy-storchaka commented Nov 22, 2025

Uh oh!

Uh oh!

mayeut commented Mar 14, 2026

Uh oh!

gpshead commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

serhiy-storchaka commented Nov 6, 2025 •

edited by bedevere-app bot

Loading