Hello @all,
I'm currently getting into details about PDF. My current problem is, that I have a PDF
with exactly one font. This font is CMAP font with ToUnicode encoding.
I've got the CMAP table - all in all 0xb7 entries mapping from the hex-string in (...)Tj
to the appropriate Unicode.
Example:
0 : 0xfffd
1 : "A" (as a Unicode)
2 : "B" (as a Unicode)
..
.
The uncompressed (...)Tj has than the form e.g. 0x00 0x01 0x00 0x02 0x00 0x02 - reading "ABB"
Everything seems fine .... until the code 0x00 0x5c 0x62 comes up. I really don't know how to handle
these "backslash" codes. What has to be done in such case. Trying to map just the 0x62 to Unicode
doesnt' give the correct character.
Please help!
Thank you!!