Quantcast
Channel: Adobe Community : Popular Discussions - PDF Language and Specifications
Viewing all articles
Browse latest Browse all 46145

Pdf text extract problem with CID font and Identity-H

$
0
0

Hi all,

 

Iam facing some big problem with text extraction from pdf file.

Currently iam using congviews pdf2xl text extraction tool.

About 95% of the text extract correcly but few charaters showing box some ? and some dotted circle mark.

 

Font Used:

 

ArialUnicodeMS(Embedded Subset)

Type:(True Type (CID)

Encoding:Identity-H

 

TimesNewRomanPSMT

Type:True Type

Sample.jpgEncoding:ANSI

ActualFont:TimesNewRomanPSMT

ActualFontType:TrueType

 

Anyone please help me to overcome this.

 

 

Regards

Gilbert.X


Viewing all articles
Browse latest Browse all 46145

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>