1 | <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
|
---|
2 | <html>
|
---|
3 | <!-- Copyright (C) 2022 Richard Stallman and Free Software Foundation, Inc.
|
---|
4 |
|
---|
5 | (The work of Trevis Rothwell and Nelson Beebe has been assigned or
|
---|
6 | licensed to the FSF.)
|
---|
7 |
|
---|
8 | Permission is granted to copy, distribute and/or modify this document
|
---|
9 | under the terms of the GNU Free Documentation License, Version 1.3 or
|
---|
10 | any later version published by the Free Software Foundation; with the
|
---|
11 | Invariant Sections being "GNU General Public License," with the
|
---|
12 | Front-Cover Texts being "A GNU Manual," and with the Back-Cover
|
---|
13 | Texts as in (a) below. A copy of the license is included in the
|
---|
14 | section entitled "GNU Free Documentation License."
|
---|
15 |
|
---|
16 | (a) The FSF's Back-Cover Text is: "You have the freedom to copy and
|
---|
17 | modify this GNU manual. Buying copies from the FSF supports it in
|
---|
18 | developing GNU and promoting software freedom." -->
|
---|
19 | <!-- Created by GNU Texinfo 6.7, http://www.gnu.org/software/texinfo/ -->
|
---|
20 | <head>
|
---|
21 | <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
|
---|
22 | <title>UTF-8 String Constants (GNU C Language Manual)</title>
|
---|
23 |
|
---|
24 | <meta name="description" content="UTF-8 String Constants (GNU C Language Manual)">
|
---|
25 | <meta name="keywords" content="UTF-8 String Constants (GNU C Language Manual)">
|
---|
26 | <meta name="resource-type" content="document">
|
---|
27 | <meta name="distribution" content="global">
|
---|
28 | <meta name="Generator" content="makeinfo">
|
---|
29 | <link href="index.html" rel="start" title="Top">
|
---|
30 | <link href="Symbol-Index.html" rel="index" title="Symbol Index">
|
---|
31 | <link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
|
---|
32 | <link href="Constants.html" rel="up" title="Constants">
|
---|
33 | <link href="Unicode-Character-Codes.html" rel="next" title="Unicode Character Codes">
|
---|
34 | <link href="String-Constants.html" rel="prev" title="String Constants">
|
---|
35 | <style type="text/css">
|
---|
36 | <!--
|
---|
37 | a.summary-letter {text-decoration: none}
|
---|
38 | blockquote.indentedblock {margin-right: 0em}
|
---|
39 | div.display {margin-left: 3.2em}
|
---|
40 | div.example {margin-left: 3.2em}
|
---|
41 | div.lisp {margin-left: 3.2em}
|
---|
42 | kbd {font-style: oblique}
|
---|
43 | pre.display {font-family: inherit}
|
---|
44 | pre.format {font-family: inherit}
|
---|
45 | pre.menu-comment {font-family: serif}
|
---|
46 | pre.menu-preformatted {font-family: serif}
|
---|
47 | span.nolinebreak {white-space: nowrap}
|
---|
48 | span.roman {font-family: initial; font-weight: normal}
|
---|
49 | span.sansserif {font-family: sans-serif; font-weight: normal}
|
---|
50 | ul.no-bullet {list-style: none}
|
---|
51 | -->
|
---|
52 | </style>
|
---|
53 |
|
---|
54 |
|
---|
55 | </head>
|
---|
56 |
|
---|
57 | <body lang="en">
|
---|
58 | <span id="UTF_002d8-String-Constants"></span><div class="header">
|
---|
59 | <p>
|
---|
60 | Next: <a href="Unicode-Character-Codes.html" accesskey="n" rel="next">Unicode Character Codes</a>, Previous: <a href="String-Constants.html" accesskey="p" rel="prev">String Constants</a>, Up: <a href="Constants.html" accesskey="u" rel="up">Constants</a> [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Symbol-Index.html" title="Index" rel="index">Index</a>]</p>
|
---|
61 | </div>
|
---|
62 | <hr>
|
---|
63 | <span id="UTF_002d8-String-Constants-1"></span><h3 class="section">12.8 UTF-8 String Constants</h3>
|
---|
64 | <span id="index-UTF_002d8-String-Constants"></span>
|
---|
65 |
|
---|
66 | <p>Writing ‘<samp>u8</samp>’ immediately before a string constant, with no
|
---|
67 | intervening space, means to represent that string in UTF-8 encoding as
|
---|
68 | a sequence of bytes. UTF-8 represents ASCII characters with a single
|
---|
69 | byte, and represents non-ASCII Unicode characters (codes 128 and up)
|
---|
70 | as multibyte sequences. Here is an example of a UTF-8 constant:
|
---|
71 | </p>
|
---|
72 | <div class="example">
|
---|
73 | <pre class="example">u8"A cónstàñt"
|
---|
74 | </pre></div>
|
---|
75 |
|
---|
76 | <p>This constant occupies 13 bytes plus the terminating null,
|
---|
77 | because each of the accented letters is a two-byte sequence.
|
---|
78 | </p>
|
---|
79 | <p>Concatenating an ordinary string with a UTF-8 string conceptually
|
---|
80 | produces another UTF-8 string. However, if the ordinary string
|
---|
81 | contains character codes 128 and up, the results cannot be relied on.
|
---|
82 | </p>
|
---|
83 |
|
---|
84 |
|
---|
85 |
|
---|
86 | </body>
|
---|
87 | </html>
|
---|