Supplementary: Common HTML entities used for typography
By Ben Henick · 26 Sep, 2008
Introduction
There are a number of HTML entities that come in handy when there’s a need for first-rate typesetting. Many of those listed in Table 1 are useful only when used in foreign language copy (and copy written in specific dialects of English), so context should be taken into account before the choice is made to use them.
For the sake of portability, Unicode entity references should be reserved for use in documents certain to be written in the UTF-8 or UTF-16 character sets. In all other cases, the alphanumeric references should be used.
| Character(s) | Literal(s) | Alphanumeric value(s) | Unicode value(s) | Prefer to |
|---|---|---|---|---|
| Cent (currency) | ¢ | ¢ |
¢ |
|
| Pound (currency) | £ | £ |
£ |
|
| Section 1 | § | § |
§ |
|
| Copyright | © | © |
© |
(c) |
| Guillemets 2 | « » | « » |
« » |
" |
| Registered trademark | ® | ® |
® |
(R) |
| Degree(s) | ° | ° |
° |
|
| Plus/minus | ± | ± |
± |
+/- |
| Pilcrow (paragraph) 3 | ¶ | ¶ |
¶ |
|
| Middle dot 4 | · | · |
· |
|
| Fractional half 5 | ½ | ½ |
¼ |
1/2 |
| En dash 6, 7 | – | – |
– |
- for ranges |
| Em (long) dash 7, 8 | — | — |
— |
- enclosed by spaces, or -- |
| Single quotes 9, 10 | ‘ ’ | ‘ ’ |
‘ ’ |
' or ' |
| Single low quote 11 | ‚ | ‚ |
‚ |
' or comma |
| Double quotes 9 | “ ” | “ ” |
“ ” |
", ", '', or `` |
| Double low quote 11 | „ | „ |
„ |
" or ,, |
| Single & double daggers | † ‡ | † ‡ |
† ‡ |
* and ** |
| Bullet | • | • |
• |
* |
| Ellipsis 12 | … | … |
… |
... |
| Prime & double prime 13 | ′ ″ | ′ ″ |
′ ″ |
', '', ', ", minutes:seconds elapsed |
| Euro sign | € | € |
€ |
|
| Trademark | ™ | ™ |
™ |
(tm) |
| Almost equal to | ≈ | ≈ |
≈ |
~ |
| Not equal to | ≠ | ≠ |
≠ |
!= |
| Less/greater than or equal to | ≤ ≥ | ≤ ≥ |
≤ ≥ |
<= or >= |
| Less/greater than | < > | < > |
> < |
Table 1: HTML entities useful for proper typesetting, listed in order by decimal Unicode position.
Note that guillemets are used for quotes in certain European languages (such as French and Norsk); in these situations, you should always use q elements instead.
HTML entity usage notes
- Citations of statute law, eg, “29 USC § 794 (d),” are the matter most likely to reference this character.
- Guillemets often enclose the names of stories, songs, films, public accommodations (eg, «Rick’s Café Americain»), and popular toponyms in European languages, particularly those of the Romance sub-family. They are also used for quotes in certain European languages (such as French and Norsk); in these situations, you should always use
qelements instead. - The pilcrow, used to mark the beginning of paragraphs that might otherwise be ambiguous, is useful when setting teaser copy. The print distribution of Rolling Stone magazine has often used such an approach. In technical writing, it might also be useful for marking an orphaned first line of a paragraph.
¶ Paragraphs marked with this symbol will most often be assigned a display value of inline, which will be explained in the introduction to the CSS layout model.
−/−). However, it should always be distinguished from a hyphen (-), which is used to separate the parts of an ad hoc compound word.letter-spacing or text-align properties.About the author
Ben Henick has been building Web sites in one capacity or another since September 1995, when he took on his first Web project as an academic volunteer. Since then, most of his work has been done on a freelance basis.
Ben is a generalist; his skillset touches on nearly every aspect of site design and development, from CSS and HTML, to design and copywriting, to PHP/MySQL and JavaScript/Ajax.
He lives in Lawrence, Kansas, with three computers and zero television sets. You can read more about him and his work at henick.net.
