I'm having difficulty with Prince-rendered PDFs when the HTML <title> contains an 8-bit character. Here's an HTML page that displays the word "Résumé":
This converts correctly with Prince, creating a PDF that contains ""Résumé".
However, if you add an 8-bit character into the HTML title, the word Résumé gets rendered in the PDF as "Résumé".
Is this a bug? Am I doing something wrong with encodings? Thanks.
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html lang="en" dir="ltr">
<head>
<title>X</title>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
</head>
<body>
<p>Résumé</p>
</body>
</html>
This converts correctly with Prince, creating a PDF that contains ""Résumé".
$ /usr/bin/prince --input=html good.html -o good.pdf
However, if you add an 8-bit character into the HTML title, the word Résumé gets rendered in the PDF as "Résumé".
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html lang="en" dir="ltr">
<head>
<title>Xé</title>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
</head>
<body>
<p>Résumé</p>
</body>
</html>
Is this a bug? Am I doing something wrong with encodings? Thanks.