edit/crates/unicode-gen
Leonard Hecker a704f4236c wip
2025-11-05 16:45:33 +01:00
..
src Move arena & helpers into their own crate 2025-11-05 16:45:32 +01:00
Cargo.toml wip 2025-11-05 16:45:33 +01:00
README.md Move arena & helpers into their own crate 2025-11-05 16:45:32 +01:00

Grapheme Table Generator

This tool processes Unicode Character Database (UCD) XML files to generate efficient, multi-stage trie lookup tables for properties relevant to terminal applications:

  • Grapheme cluster breaking rules
  • Line breaking rules (optional)
  • Character width properties

Usage

  • Download ucd.nounihan.grouped.zip
  • Run some equivalent of:
    grapheme-table-gen --lang=rust --extended --no-ambiguous --line-breaks path/to/ucd.nounihan.grouped.xml
    
  • Place the result in src/unicode/tables.rs