source: public/doc/gnu-c/Characters.html@ 02598c2

Last change on this file since 02598c2 was 02598c2, checked in by Mikhail Kirillov <w96k@…>, on Oct 6, 2022 at 12:36:29 PM

Add gnu-c

  • Property mode set to 100644
File size: 4.4 KB
Line 
1<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
2<html>
3<!-- Copyright (C) 2022 Richard Stallman and Free Software Foundation, Inc.
4
5(The work of Trevis Rothwell and Nelson Beebe has been assigned or
6licensed to the FSF.)
7
8Permission is granted to copy, distribute and/or modify this document
9under the terms of the GNU Free Documentation License, Version 1.3 or
10any later version published by the Free Software Foundation; with the
11Invariant Sections being "GNU General Public License," with the
12Front-Cover Texts being "A GNU Manual," and with the Back-Cover
13Texts as in (a) below. A copy of the license is included in the
14section entitled "GNU Free Documentation License."
15
16(a) The FSF's Back-Cover Text is: "You have the freedom to copy and
17modify this GNU manual. Buying copies from the FSF supports it in
18developing GNU and promoting software freedom." -->
19<!-- Created by GNU Texinfo 6.7, http://www.gnu.org/software/texinfo/ -->
20<head>
21<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
22<title>Characters (GNU C Language Manual)</title>
23
24<meta name="description" content="Characters (GNU C Language Manual)">
25<meta name="keywords" content="Characters (GNU C Language Manual)">
26<meta name="resource-type" content="document">
27<meta name="distribution" content="global">
28<meta name="Generator" content="makeinfo">
29<link href="index.html" rel="start" title="Top">
30<link href="Symbol-Index.html" rel="index" title="Symbol Index">
31<link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
32<link href="Lexical-Syntax.html" rel="up" title="Lexical Syntax">
33<link href="Whitespace.html" rel="next" title="Whitespace">
34<link href="English.html" rel="prev" title="English">
35<style type="text/css">
36<!--
37a.summary-letter {text-decoration: none}
38blockquote.indentedblock {margin-right: 0em}
39div.display {margin-left: 3.2em}
40div.example {margin-left: 3.2em}
41div.lisp {margin-left: 3.2em}
42kbd {font-style: oblique}
43pre.display {font-family: inherit}
44pre.format {font-family: inherit}
45pre.menu-comment {font-family: serif}
46pre.menu-preformatted {font-family: serif}
47span.nolinebreak {white-space: nowrap}
48span.roman {font-family: initial; font-weight: normal}
49span.sansserif {font-family: sans-serif; font-weight: normal}
50ul.no-bullet {list-style: none}
51-->
52</style>
53
54
55</head>
56
57<body lang="en">
58<span id="Characters"></span><div class="header">
59<p>
60Next: <a href="Whitespace.html" accesskey="n" rel="next">Whitespace</a>, Previous: <a href="English.html" accesskey="p" rel="prev">English</a>, Up: <a href="Lexical-Syntax.html" accesskey="u" rel="up">Lexical Syntax</a> &nbsp; [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Symbol-Index.html" title="Index" rel="index">Index</a>]</p>
61</div>
62<hr>
63<span id="Characters-1"></span><h3 class="section">5.2 Characters</h3>
64<span id="index-character-set"></span>
65<span id="index-Unicode"></span>
66
67
68<p>GNU C source files are usually written in the
69<a href="https://en.wikipedia.org/wiki/ASCII">ASCII</a> character set, which
70was defined in the 1960s for English. However, they can also include
71Unicode characters represented in the
72<a href="https://en.wikipedia.org/wiki/UTF-8">UTF-8</a> multibyte encoding.
73This makes it possible to represent accented letters such as &lsquo;<samp>á</samp>&rsquo;,
74as well as other scripts such as Arabic, Chinese, Cyrillic, Hebrew,
75Japanese, and Korean.<a id="DOCF1" href="#FOOT1"><sup>1</sup></a>
76</p>
77<p>In C source code, non-ASCII characters are valid in comments, in wide
78character constants (see <a href="Wide-Character-Constants.html">Wide Character Constants</a>), and in string
79constants (see <a href="String-Constants.html">String Constants</a>).
80</p>
81<p>Another way to specify non-ASCII characters in constants (character or
82string) and identifiers is with an escape sequence starting with
83backslash, specifying the intended Unicode character. (See <a href="Unicode-Character-Codes.html">Unicode Character Codes</a>.) This specifies non-ASCII characters without
84putting a real non-ASCII character in the source file itself.
85</p>
86<p>C accepts two-character aliases called <em>digraphs</em> for certain
87characters. See <a href="Digraphs.html">Digraphs</a>.
88</p>
89<div class="footnote">
90<hr>
91<h4 class="footnotes-heading">Footnotes</h4>
92
93<h5><a id="FOOT1" href="#DOCF1">(1)</a></h3>
94<p>On some obscure systems, GNU C uses
95UTF-EBCDIC instead of UTF-8, but that is not worth describing in this
96manual.</p>
97</div>
98
99
100
101</body>
102</html>
Note: See TracBrowser for help on using the repository browser.