ext/mbstring: Update to Unicode 16 #15898

Ayesh · 2024-09-15T19:03:52Z

Updates UCD to Unicode 16.0 (released 2024 Sept).

Previously: 0fdffc1, #7502, #14680

Unicode 16 adds several new character sets and case folding rules. However, the existing ucgendat script can still parse them.

This also adds a couple test cases to make sure the new rules for East Asian Wide characters and case folding work correctly. These tests fail on Unicode 15.1 and older because those verisons do not contain those rules.

alexdowad · 2024-09-15T20:22:40Z

Thanks very much!
Any comment from @youkidearitai?

youkidearitai

I saw changes of Unicode 16.0, Looks good to me.
Thank you very much.

Updates UCD to Unicode 16.0 (released 2024 Sept). Previously: 0fdffc1, php#7502, php#14680 Unicode 16 adds several new character sets and case folding rules. However, the existing ucgendat script can still parse them. This also adds a couple test cases to make sure the new rules for East Asian Wide characters and case folding work correctly. These tests fail on Unicode 15.1 and older because those verisons do not contain those rules.

Ayesh · 2024-09-16T03:47:47Z

Thank you for approving this @alexdowad @youkidearitai.
I adjusted our notes in the UPGRADING and NEWS files as well.

alexdowad · 2024-09-16T03:59:39Z

@youkidearitai Shall I merge?

youkidearitai · 2024-09-16T04:05:00Z

@alexdowad Yes, please!

youkidearitai · 2024-09-17T01:08:20Z

@alexdowad Did you find any problems? If nothing, shall I merge instead of you?

alexdowad · 2024-09-17T01:26:12Z

Very sorry, I just got occupied with other things and didn't complete this task.

alexdowad · 2024-09-17T01:26:41Z

CI failure for WINDOWS_X64_ZTS is spurious.

alexdowad · 2024-09-17T01:32:29Z

Just to make very sure everything is OK, I downloaded the UCD files for Unicode 16.0.0 and re-ran ucgendat.php. Same results as this PR.

alexdowad · 2024-09-17T01:39:31Z

This is really odd... when I fetch this comment and merge it into master locally, I don't see the added entry in NEWS. 😕 Fixing that up manually...

Ayesh · 2024-09-17T01:39:56Z

Thank you @alexdowad. You are right, the changes are merely after running the script. We had some issues in the script for Unicode 15.1, but 16.0 had no problems.

alexdowad · 2024-09-17T01:41:24Z

Thanks very much, @Ayesh... this is now landed on master.

Ayesh · 2024-09-17T01:41:47Z

Thank you @youkidearitai @alexdowad 🙏.

Ayesh requested review from alexdowad and youkidearitai as code owners September 15, 2024 19:03

github-actions bot added the Extension: mbstring label Sep 15, 2024

youkidearitai approved these changes Sep 16, 2024

View reviewed changes

Ayesh force-pushed the unicode-16 branch from 3c6a957 to bdc19e8 Compare September 16, 2024 03:46

alexdowad closed this Sep 17, 2024

Ayesh deleted the unicode-16 branch September 17, 2024 01:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ext/mbstring: Update to Unicode 16 #15898

ext/mbstring: Update to Unicode 16 #15898

Uh oh!

Ayesh commented Sep 15, 2024

Uh oh!

alexdowad commented Sep 15, 2024

Uh oh!

youkidearitai left a comment

Uh oh!

Ayesh commented Sep 16, 2024

Uh oh!

alexdowad commented Sep 16, 2024

Uh oh!

youkidearitai commented Sep 16, 2024

Uh oh!

youkidearitai commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

Ayesh commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

Ayesh commented Sep 17, 2024

Uh oh!

Uh oh!

ext/mbstring: Update to Unicode 16 #15898

ext/mbstring: Update to Unicode 16 #15898

Uh oh!

Conversation

Ayesh commented Sep 15, 2024

Uh oh!

alexdowad commented Sep 15, 2024

Uh oh!

youkidearitai left a comment

Choose a reason for hiding this comment

Uh oh!

Ayesh commented Sep 16, 2024

Uh oh!

alexdowad commented Sep 16, 2024

Uh oh!

youkidearitai commented Sep 16, 2024

Uh oh!

youkidearitai commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

Ayesh commented Sep 17, 2024

Uh oh!

alexdowad commented Sep 17, 2024

Uh oh!

Ayesh commented Sep 17, 2024

Uh oh!

Uh oh!