String
From GNUpdf
[edit]
Text strings
The text string type is used to store encoded text using one of the following CCS:
| CCS | description |
|---|---|
| PDFDocEncoding | Unibyte encoding of a character set defined in the PDF standard. The character set contain the ISO Latin 1 characters. |
| UTF-16BE | Multibyte encoding of the unicode/ISO-10646 character set. A big-endian UTF16 mark precedes the data in the encoded string (U+FEFF or 254d, 255d). |
[edit]
ASCII strings
An ASCII string contain text encoded using a unibyte encoding of the ASCII (ISO 646) character set. Byte strings
Byte strings are string objects whose content is interpreted as a sequence of unsigned chars (octects). The values encoded in the octects may not be characters.
If the values encoded in the octects are in fact characters, the used encoding is unknown.



