Methods
Constants
UNICODE_VERSION | = | RbConfig::CONFIG["UNICODE_VERSION"] |
The |
Instance Public methods
compose(codepoints) Link
Compose decomposed characters to the composed form.
decompose(type, codepoints) Link
Decompose composed characters to the decomposed form.
default_normalization_form() Link
default_normalization_form=(_) Link
tidy_bytes(string, force = false) Link
Replaces all ISO-8859-1 or CP1252 characters by their UTF-8 equivalent resulting in a valid UTF-8 string.
Passing true
will forcibly tidy all bytes, assuming that the string's encoding is entirely CP1252 or ISO-8859-1.