-
Strings and patterns may contain characters that have an ordinal
value larger than 255.
-
Identifiers within a Perl program may contain Unicode alphanumeric
characters.
-
Regular expressions match characters and not bytes.
-
Character classes in regular expressions match characters and not
bytes.
-
Named Unicode properties and block ranges may be used as character
classes with the \p and \P
constructs.
-
\X matches any extended Unicode sequence.
-
tr// matches characters instead of bytes.
-
Case translation operators use the Unicode case translation tables
when provided character input.
-
Most operators that deal with positions or lengths in a string switch
to using character positions.
-
pack( ) and unpack( ) do not
change.
-
Bit operators work on characters.
-
scalar reverse( ) reverses characters and not
bytes.