Character entities are a method of including arbitrary characters in XML documents by referencing their UNICODE number rather than writing them directly.
This can be convenient when you wish to include characters such as smart quotes, symbols or mathematical operators that do not exist in the character encoding that you are using for your XML documents (such as ASCII or ISO-8859-1 "Latin-1").
Here is an example of including smart quotes in an XML document using character entities:
XML
<p>
These are “smart quotes” to be precise.
</p>
Output
The tables below are a quick reference for some commonly requested UNICODE characters; for everything else please consult the UNICODE code charts.
À | À | A grave |
Á | Á | A acute |
 |  | A circumflex |
à | à | A tilde |
Ä | Ä | A dieresis |
Å | Å | A ring |
Ç | Ç | C cedilla |
È | È | E grave |
É | É | E acute |
Ê | Ê | E circumflex |
Ë | Ë | E dieresis |
Ì | Ì | I grave |
Í | Í | I acute |
Î | Î | I circumflex |
Ï | Ï | I dieresis |
Ñ | Ñ | N tilde |
Ò | Ò | O grave |
Ó | Ó | O acute |
Ô | Ô | O circumflex |
Õ | Õ | O tilde |
Ö | Ö | O dieresis |
Ø | Ø | O slash |
Ù | Ù | U grave |
Ú | Ú | U acute |
Û | Û | U circumflex |
Ü | Ü | U dieresis |
Ÿ | Ÿ | Y dieresis |
à | à | a grave |
á | á | a acute |
â | â | a circumflex |
ã | ã | a tilde |
ä | ä | a dieresis |
å | å | a ring |
ç | ç | c cedilla |
è | è | e grave |
é | é | e acute |
ê | ê | e circumflex |
ë | ë | e dieresis |
ì | ì | i grave |
í | í | i acute |
î | î | i circumflex |
ï | ï | i dieresis |
ı | ı | dotless i |
ñ | ñ | n tilde |
ò | ò | o grave |
ó | ó | o acute |
ô | ô | o circumflex |
õ | õ | o tilde |
ö | ö | o dieresis |
ø | ø | o slash |
ù | ù | u grave |
ú | ú | u acute |
û | û | u circumflex |
ü | ü | u dieresis |
ÿ | ÿ | y dieresis |
• | • | bullet |
¢ | ¢ | cent |
© | © | copyright |
¤ | ¤ | currency |
† | † | dagger |
° | ° | degree |
÷ | ÷ | divide |
‡ | ‡ | double dagger |
ƒ | ƒ | florin |
⁄ | ⁄ | fraction |
¡ | ¡ | inverted exclamation |
¿ | ¿ | inverted question |
µ | µ | micro |
· | · | middle dot |
¬ | ¬ | not |
ª | ª | ordfeminine |
º | º | ordmasculine |
¶ | ¶ | paragraph |
‰ | ‰ | perthousand |
± | ± | plusminus |
® | ® | registered |
§ | § | section |
£ | £ | sterling |
™ | ™ | trademark |
¥ | ¥ | yen |
‘ | ‘ | left single quote |
’ | ’ | right single quote |
“ | “ | left double quote |
” | ” | right double quote |
‹ | ‹ | left single guillemet |
› | › | right single guillemet |
« | « | left double guillemet |
» | » | right double guillemet |
‚ | ‚ | base single quote |
„ | „ | base double quote |
´ | ´ | acute |
˘ | ˘ | breve |
ˇ | ˇ | caron |
¸ | ¸ | cedilla |
^ | ^ | circumflex |
¨ | ¨ | dieresis |
˙ | ˙ | dot accent |
˝ | ˝ | double acute |
¯ | ¯ | macron |
˛ | ˛ | ogonek |
˚ | ˚ | ring |
˜ | ˜ | tilde |
… | … | ellipsis |
— | — | em dash |
– | – | en dash |
fi | fi | fi ligature |
fl | fl | fl ligature |
  | non-breaking space | |
Æ | Æ | AE |
æ | æ | ae |
Œ | Œ | OE |
œ | œ | oe |
ß | ß | sharp s |