For some time I've noticed that PDF files generated by PrinceXML that contain Chinese characters don't play perfectly well with Acrobat XI (the only version I have). That is, the text copied from Acrobat is garbled (see the attachment), but other viewers like SumatraPDF and Foxit Reader work fine.
Not all Chinese fonts have this issue. A specific Chinese font may work well in one CSS setting and fail in another, which is quite annoying. Some Chinese fonts (e.g. Microsoft Yahei) are stabler than others.
Unfortunately I fail to produce a minimal example. A reproducible example can be found in the attachment.
zh-cn.pdf is generated by:
The "Source Han Serif CN" font is from this project. Direct download: https://github.com/Pal3love/Source-Han-TrueType/releases/download/2.004-2.001-1.002-R/SourceHanSerifCN.zip.
copied-Acrobat_XI.txt and copied-SumatraPDF_3.3.3.txt contain text copied from Acrobat XI and SumatraPDF, respectively.
Thank you very much!
Not all Chinese fonts have this issue. A specific Chinese font may work well in one CSS setting and fail in another, which is quite annoying. Some Chinese fonts (e.g. Microsoft Yahei) are stabler than others.
Unfortunately I fail to produce a minimal example. A reproducible example can be found in the attachment.
zh-cn.pdf is generated by:
prince -s foo.css http://utf8everywhere.org/zh-cn -o zh-cn.pdf
The "Source Han Serif CN" font is from this project. Direct download: https://github.com/Pal3love/Source-Han-TrueType/releases/download/2.004-2.001-1.002-R/SourceHanSerifCN.zip.
copied-Acrobat_XI.txt and copied-SumatraPDF_3.3.3.txt contain text copied from Acrobat XI and SumatraPDF, respectively.
Thank you very much!
Edited by yyang