Quantcast
Channel: Adobe Community : Popular Discussions - PDF Language and Specifications
Viewing all articles
Browse latest Browse all 46145

Decoding (...)Tj - what todo with "backslash" in hex encoded ToUnicode indices

$
0
0

Hello @all,

 

I'm currently getting into details about PDF. My current problem is, that I have a PDF

with exactly one font. This font is CMAP font with ToUnicode encoding.

 

I've got the CMAP table - all in all 0xb7 entries mapping from the hex-string in (...)Tj

to the appropriate Unicode.

 

Example:

 

0 : 0xfffd

1 : "A" (as a Unicode)

2 : "B" (as a Unicode)

..

.

 

The uncompressed (...)Tj has than the form e.g. 0x00 0x01 0x00 0x02 0x00 0x02 - reading "ABB"

Everything seems fine .... until the code 0x00 0x5c 0x62 comes up. I really don't know how to handle

these "backslash" codes. What has to be done in such case. Trying to map just the 0x62 to Unicode

doesnt' give the correct character.

 

Please help!

 

Thank you!!


Viewing all articles
Browse latest Browse all 46145

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>