[3.15] gh-152415: Exercise curses non-ASCII tests under 8-bit locale encodings (GH-152416) by serhiy-storchaka · Pull Request #152453 · python/cpython

serhiy-storchaka · 2026-06-27T20:09:42Z

The non-ASCII tests only exercised what the runner's locale could encode (in practice UTF-8). Add 8-bit-encoding cases to the character and string I/O tests, each guarded by the existing encodability check: ASCII, a character common to the Latin encodings ('é'), and ones distinctive to a single encoding (byte 0xA4 is '¤' in ISO-8859-1, '€' in ISO-8859-15, 'є' in KOI8-U). Run the whole suite under different locales to cover them; unrepresentable cases skip.

Read each written character back with in_wch() or instr() rather than inch(), which on a wide build returns the low byte of the code point instead of the locale-encoded byte and so mangles a non-ASCII character of an 8-bit locale. This lets the int-argument cases cover '€'/'є', and adds matching coverage for the str argument.

insch() with an int byte > 127 is checked only for Latin-1: on a wide build ncurses winsch stores a printable byte directly as a code point instead of decoding it through the locale.
(cherry picked from commit 003d362)

Issue: Extend curses tests to cover non-ASCII characters under 8-bit locales #152415

…ocale encodings (pythonGH-152416) The non-ASCII tests only exercised what the runner's locale could encode (in practice UTF-8). Add 8-bit-encoding cases to the character and string I/O tests, each guarded by the existing encodability check: ASCII, a character common to the Latin encodings ('é'), and ones distinctive to a single encoding (byte 0xA4 is '¤' in ISO-8859-1, '€' in ISO-8859-15, 'є' in KOI8-U). Run the whole suite under different locales to cover them; unrepresentable cases skip. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * pythongh-152415: Verify character output round-trips in test_output_character Read each written character back with in_wch() or instr() rather than inch(), which on a wide build returns the low byte of the code point instead of the locale-encoded byte and so mangles a non-ASCII character of an 8-bit locale. This lets the int-argument cases cover '€'/'є', and adds matching coverage for the str argument. insch() with an int byte > 127 is checked only for Latin-1: on a wide build ncurses winsch stores a printable byte directly as a code point instead of decoding it through the locale. (cherry picked from commit 003d362) Co-authored-by: Serhiy Storchaka <storchaka@gmail.com> Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

miss-islington-app · 2026-06-27T20:40:17Z

Thanks @serhiy-storchaka for the PR 🌮🎉.. I'm working now to backport this PR to: 3.13, 3.14.
🐍🍒⛏🤖

bedevere-app · 2026-06-27T20:40:39Z

GH-152456 is a backport of this pull request to the 3.14 branch.

bedevere-app · 2026-06-27T20:40:50Z

GH-152457 is a backport of this pull request to the 3.13 branch.

…encodings (GH-152416) (GH-152453) (GH-152457) The non-ASCII tests only exercised what the runner's locale could encode (in practice UTF-8). Add 8-bit-encoding cases to the character and string I/O tests, each guarded by the existing encodability check: ASCII, a character common to the Latin encodings ('é'), and ones distinctive to a single encoding (byte 0xA4 is '¤' in ISO-8859-1, '€' in ISO-8859-15, 'є' in KOI8-U). Run the whole suite under different locales to cover them; unrepresentable cases skip. * gh-152415: Verify character output round-trips in test_output_character Read each written character back with in_wch() or instr() rather than inch(), which on a wide build returns the low byte of the code point instead of the locale-encoded byte and so mangles a non-ASCII character of an 8-bit locale. This lets the int-argument cases cover '€'/'є', and adds matching coverage for the str argument. insch() with an int byte > 127 is checked only for Latin-1: on a wide build ncurses winsch stores a printable byte directly as a code point instead of decoding it through the locale. (cherry picked from commit 003d362) (cherry picked from commit a75aa41) Co-authored-by: Serhiy Storchaka <storchaka@gmail.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>

…encodings (GH-152416) (GH-152453) (GH-152456) The non-ASCII tests only exercised what the runner's locale could encode (in practice UTF-8). Add 8-bit-encoding cases to the character and string I/O tests, each guarded by the existing encodability check: ASCII, a character common to the Latin encodings ('é'), and ones distinctive to a single encoding (byte 0xA4 is '¤' in ISO-8859-1, '€' in ISO-8859-15, 'є' in KOI8-U). Run the whole suite under different locales to cover them; unrepresentable cases skip. * gh-152415: Verify character output round-trips in test_output_character Read each written character back with in_wch() or instr() rather than inch(), which on a wide build returns the low byte of the code point instead of the locale-encoded byte and so mangles a non-ASCII character of an 8-bit locale. This lets the int-argument cases cover '€'/'є', and adds matching coverage for the str argument. insch() with an int byte > 127 is checked only for Latin-1: on a wide build ncurses winsch stores a printable byte directly as a code point instead of decoding it through the locale. (cherry picked from commit 003d362) (cherry picked from commit a75aa41) Co-authored-by: Serhiy Storchaka <storchaka@gmail.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>

bedevere-app Bot added the tests Tests in the Lib/test dir label Jun 27, 2026

bedevere-app Bot mentioned this pull request Jun 27, 2026

Extend curses tests to cover non-ASCII characters under 8-bit locales #152415

Closed

bedevere-app Bot added the skip news label Jun 27, 2026

bedevere-app Bot mentioned this pull request Jun 27, 2026

gh-152415: Exercise curses non-ASCII tests under 8-bit locale encodings #152416

Merged

bedevere-app Bot added the awaiting core review label Jun 27, 2026

serhiy-storchaka enabled auto-merge (squash) June 27, 2026 20:10

serhiy-storchaka added needs backport to 3.13 bugs and security fixes needs backport to 3.14 bugs and security fixes labels Jun 27, 2026

serhiy-storchaka merged commit a75aa41 into python:3.15 Jun 27, 2026
59 of 60 checks passed

bedevere-app Bot removed the awaiting core review label Jun 27, 2026

bedevere-app Bot removed the needs backport to 3.14 bugs and security fixes label Jun 27, 2026

bedevere-app Bot removed the needs backport to 3.13 bugs and security fixes label Jun 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[3.15] gh-152415: Exercise curses non-ASCII tests under 8-bit locale encodings (GH-152416)#152453

[3.15] gh-152415: Exercise curses non-ASCII tests under 8-bit locale encodings (GH-152416)#152453
serhiy-storchaka merged 1 commit into
python:3.15from
serhiy-storchaka:backport-003d362-3.15

serhiy-storchaka commented Jun 27, 2026 •

edited by bedevere-app Bot

Loading

Uh oh!

Uh oh!

miss-islington-app Bot commented Jun 27, 2026

Uh oh!

bedevere-app Bot commented Jun 27, 2026

Uh oh!

bedevere-app Bot commented Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Uh oh!

Conversation

serhiy-storchaka commented Jun 27, 2026 • edited by bedevere-app Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

miss-islington-app Bot commented Jun 27, 2026

Uh oh!

bedevere-app Bot commented Jun 27, 2026

Uh oh!

bedevere-app Bot commented Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

serhiy-storchaka commented Jun 27, 2026 •

edited by bedevere-app Bot

Loading