Fix Unicode-related tests

* test/lisp/international/mule-tests.el
(mule-cmds-tests--ucs-names-missing-names): Update no-name regions
of codepoints to Unicode 17.0.

* lisp/international/mule-cmds.el (ucs-names): Fix comments.

* admin/notes/unicode: Update instructions.
This commit is contained in:
Eli Zaretskii 2025-09-13 17:52:30 +03:00
parent 125b3588c9
commit fd5d35407a
3 changed files with 14 additions and 3 deletions

View file

@ -102,6 +102,14 @@ Specifically, the values of 'ucs-normalize-composition-exclusions' and
'check-range", defined at the beginning of ucs-normalize.el, should be
verified against the latest Unicode data files.
Run the ucs-names test:
make -C test lisp/international/mule-tests
If it fails, the exclusion ranges of codepoints in
'mule-cmds-tests--ucs-names-missing-names' may need to be updated to the
added Unicode codepoints.
Next, test normalization functions against NormalizationTests.txt,
in the top-level directory run:

View file

@ -3105,12 +3105,14 @@ on encoding."
(#x16100 . #x16139)
;; (#x1613A . #x167FF) unused
(#x16800 . #x16F9F)
(#x16FE0 . #x16FF1)
(#x16FE0 . #x16FF6)
;; (#x17000 . #x187FF) Tangut Ideographs
;; (#x18800 . #x18AFF) Tangut Components
;; (#x18B00 . #x18CFF) Khitan Small Script
;; (#x18D00 . #x18D1E) Tangut Ideograph Supplement
;; (#x18D80 . #x18DFF) Tangut Components
;; (#x18D1F . #x18D7F) unused
;; (#x18D80 . #x18DF2) Tangut Components
;; (#x18DF3 . #x18DFF) unused
(#x1AFF0 . #x1B122)
;; (#x1B123 . #x1B131) unused
(#x1B132 . #x1B132)

View file

@ -69,7 +69,8 @@
(dotimes (u (1+ (max-char 'ucs)))
(when-let* ((name (get-char-code-property u 'name)))
(when (and (not (<= #xD800 u #xDFFF))
(not (<= #x18800 u #x18AFF))
(not (<= #x18800 u #x18D1E))
(not (<= #x18D80 u #x18DF2))
(not (char-from-name name)))
(push (format "%X" u) code-points))))
(setq code-points (nreverse code-points))