1 | <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
|
---|
2 | <html>
|
---|
3 | <!-- Copyright (C) 2022 Richard Stallman and Free Software Foundation, Inc.
|
---|
4 |
|
---|
5 | (The work of Trevis Rothwell and Nelson Beebe has been assigned or
|
---|
6 | licensed to the FSF.)
|
---|
7 |
|
---|
8 | Permission is granted to copy, distribute and/or modify this document
|
---|
9 | under the terms of the GNU Free Documentation License, Version 1.3 or
|
---|
10 | any later version published by the Free Software Foundation; with the
|
---|
11 | Invariant Sections being "GNU General Public License," with the
|
---|
12 | Front-Cover Texts being "A GNU Manual," and with the Back-Cover
|
---|
13 | Texts as in (a) below. A copy of the license is included in the
|
---|
14 | section entitled "GNU Free Documentation License."
|
---|
15 |
|
---|
16 | (a) The FSF's Back-Cover Text is: "You have the freedom to copy and
|
---|
17 | modify this GNU manual. Buying copies from the FSF supports it in
|
---|
18 | developing GNU and promoting software freedom." -->
|
---|
19 | <!-- Created by GNU Texinfo 6.7, http://www.gnu.org/software/texinfo/ -->
|
---|
20 | <head>
|
---|
21 | <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
|
---|
22 | <title>Characters (GNU C Language Manual)</title>
|
---|
23 |
|
---|
24 | <meta name="description" content="Characters (GNU C Language Manual)">
|
---|
25 | <meta name="keywords" content="Characters (GNU C Language Manual)">
|
---|
26 | <meta name="resource-type" content="document">
|
---|
27 | <meta name="distribution" content="global">
|
---|
28 | <meta name="Generator" content="makeinfo">
|
---|
29 | <link href="index.html" rel="start" title="Top">
|
---|
30 | <link href="Symbol-Index.html" rel="index" title="Symbol Index">
|
---|
31 | <link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
|
---|
32 | <link href="Lexical-Syntax.html" rel="up" title="Lexical Syntax">
|
---|
33 | <link href="Whitespace.html" rel="next" title="Whitespace">
|
---|
34 | <link href="English.html" rel="prev" title="English">
|
---|
35 | <style type="text/css">
|
---|
36 | <!--
|
---|
37 | a.summary-letter {text-decoration: none}
|
---|
38 | blockquote.indentedblock {margin-right: 0em}
|
---|
39 | div.display {margin-left: 3.2em}
|
---|
40 | div.example {margin-left: 3.2em}
|
---|
41 | div.lisp {margin-left: 3.2em}
|
---|
42 | kbd {font-style: oblique}
|
---|
43 | pre.display {font-family: inherit}
|
---|
44 | pre.format {font-family: inherit}
|
---|
45 | pre.menu-comment {font-family: serif}
|
---|
46 | pre.menu-preformatted {font-family: serif}
|
---|
47 | span.nolinebreak {white-space: nowrap}
|
---|
48 | span.roman {font-family: initial; font-weight: normal}
|
---|
49 | span.sansserif {font-family: sans-serif; font-weight: normal}
|
---|
50 | ul.no-bullet {list-style: none}
|
---|
51 | -->
|
---|
52 | </style>
|
---|
53 |
|
---|
54 |
|
---|
55 | </head>
|
---|
56 |
|
---|
57 | <body lang="en">
|
---|
58 | <span id="Characters"></span><div class="header">
|
---|
59 | <p>
|
---|
60 | Next: <a href="Whitespace.html" accesskey="n" rel="next">Whitespace</a>, Previous: <a href="English.html" accesskey="p" rel="prev">English</a>, Up: <a href="Lexical-Syntax.html" accesskey="u" rel="up">Lexical Syntax</a> [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Symbol-Index.html" title="Index" rel="index">Index</a>]</p>
|
---|
61 | </div>
|
---|
62 | <hr>
|
---|
63 | <span id="Characters-1"></span><h3 class="section">5.2 Characters</h3>
|
---|
64 | <span id="index-character-set"></span>
|
---|
65 | <span id="index-Unicode"></span>
|
---|
66 |
|
---|
67 |
|
---|
68 | <p>GNU C source files are usually written in the
|
---|
69 | <a href="https://en.wikipedia.org/wiki/ASCII">ASCII</a> character set, which
|
---|
70 | was defined in the 1960s for English. However, they can also include
|
---|
71 | Unicode characters represented in the
|
---|
72 | <a href="https://en.wikipedia.org/wiki/UTF-8">UTF-8</a> multibyte encoding.
|
---|
73 | This makes it possible to represent accented letters such as ‘<samp>á</samp>’,
|
---|
74 | as well as other scripts such as Arabic, Chinese, Cyrillic, Hebrew,
|
---|
75 | Japanese, and Korean.<a id="DOCF1" href="#FOOT1"><sup>1</sup></a>
|
---|
76 | </p>
|
---|
77 | <p>In C source code, non-ASCII characters are valid in comments, in wide
|
---|
78 | character constants (see <a href="Wide-Character-Constants.html">Wide Character Constants</a>), and in string
|
---|
79 | constants (see <a href="String-Constants.html">String Constants</a>).
|
---|
80 | </p>
|
---|
81 | <p>Another way to specify non-ASCII characters in constants (character or
|
---|
82 | string) and identifiers is with an escape sequence starting with
|
---|
83 | backslash, specifying the intended Unicode character. (See <a href="Unicode-Character-Codes.html">Unicode Character Codes</a>.) This specifies non-ASCII characters without
|
---|
84 | putting a real non-ASCII character in the source file itself.
|
---|
85 | </p>
|
---|
86 | <p>C accepts two-character aliases called <em>digraphs</em> for certain
|
---|
87 | characters. See <a href="Digraphs.html">Digraphs</a>.
|
---|
88 | </p>
|
---|
89 | <div class="footnote">
|
---|
90 | <hr>
|
---|
91 | <h4 class="footnotes-heading">Footnotes</h4>
|
---|
92 |
|
---|
93 | <h5><a id="FOOT1" href="#DOCF1">(1)</a></h3>
|
---|
94 | <p>On some obscure systems, GNU C uses
|
---|
95 | UTF-EBCDIC instead of UTF-8, but that is not worth describing in this
|
---|
96 | manual.</p>
|
---|
97 | </div>
|
---|
98 |
|
---|
99 |
|
---|
100 |
|
---|
101 | </body>
|
---|
102 | </html>
|
---|