I read the following in PHP Manual > Language Reference > Types: Details of the String Type:
Given that PHP does not dictate a specific encoding for strings, one might wonder how string literals are encoded. For instance, is the string "
á" equivalent to "\xE1" (ISO-8859-1), "\xC3\xA1" (UTF-8, C form), "\x61\xCC\x81" (UTF-8, D form) or any other possible representation?
What does "UTF-8, C form" and "UTF-8, D form" mean - are they two versions of UTF-8?
UTF-8 C form and UTF-8 D form are two alternate ways of encoding the same Unicode code points in UTF-8, with C form using a single code unit for characters that can be represented in ASCII, and D form using two code units for all characters. Example: