source: public/doc/gnu-c/UTF_002d8-String-Constants.html@ 02598c2

Last change on this file since 02598c2 was 02598c2, checked in by Mikhail Kirillov <w96k@…>, on Oct 6, 2022 at 12:36:29 PM

Add gnu-c

  • Property mode set to 100644
File size: 3.8 KB
Line 
1<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
2<html>
3<!-- Copyright (C) 2022 Richard Stallman and Free Software Foundation, Inc.
4
5(The work of Trevis Rothwell and Nelson Beebe has been assigned or
6licensed to the FSF.)
7
8Permission is granted to copy, distribute and/or modify this document
9under the terms of the GNU Free Documentation License, Version 1.3 or
10any later version published by the Free Software Foundation; with the
11Invariant Sections being "GNU General Public License," with the
12Front-Cover Texts being "A GNU Manual," and with the Back-Cover
13Texts as in (a) below. A copy of the license is included in the
14section entitled "GNU Free Documentation License."
15
16(a) The FSF's Back-Cover Text is: "You have the freedom to copy and
17modify this GNU manual. Buying copies from the FSF supports it in
18developing GNU and promoting software freedom." -->
19<!-- Created by GNU Texinfo 6.7, http://www.gnu.org/software/texinfo/ -->
20<head>
21<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
22<title>UTF-8 String Constants (GNU C Language Manual)</title>
23
24<meta name="description" content="UTF-8 String Constants (GNU C Language Manual)">
25<meta name="keywords" content="UTF-8 String Constants (GNU C Language Manual)">
26<meta name="resource-type" content="document">
27<meta name="distribution" content="global">
28<meta name="Generator" content="makeinfo">
29<link href="index.html" rel="start" title="Top">
30<link href="Symbol-Index.html" rel="index" title="Symbol Index">
31<link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
32<link href="Constants.html" rel="up" title="Constants">
33<link href="Unicode-Character-Codes.html" rel="next" title="Unicode Character Codes">
34<link href="String-Constants.html" rel="prev" title="String Constants">
35<style type="text/css">
36<!--
37a.summary-letter {text-decoration: none}
38blockquote.indentedblock {margin-right: 0em}
39div.display {margin-left: 3.2em}
40div.example {margin-left: 3.2em}
41div.lisp {margin-left: 3.2em}
42kbd {font-style: oblique}
43pre.display {font-family: inherit}
44pre.format {font-family: inherit}
45pre.menu-comment {font-family: serif}
46pre.menu-preformatted {font-family: serif}
47span.nolinebreak {white-space: nowrap}
48span.roman {font-family: initial; font-weight: normal}
49span.sansserif {font-family: sans-serif; font-weight: normal}
50ul.no-bullet {list-style: none}
51-->
52</style>
53
54
55</head>
56
57<body lang="en">
58<span id="UTF_002d8-String-Constants"></span><div class="header">
59<p>
60Next: <a href="Unicode-Character-Codes.html" accesskey="n" rel="next">Unicode Character Codes</a>, Previous: <a href="String-Constants.html" accesskey="p" rel="prev">String Constants</a>, Up: <a href="Constants.html" accesskey="u" rel="up">Constants</a> &nbsp; [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Symbol-Index.html" title="Index" rel="index">Index</a>]</p>
61</div>
62<hr>
63<span id="UTF_002d8-String-Constants-1"></span><h3 class="section">12.8 UTF-8 String Constants</h3>
64<span id="index-UTF_002d8-String-Constants"></span>
65
66<p>Writing &lsquo;<samp>u8</samp>&rsquo; immediately before a string constant, with no
67intervening space, means to represent that string in UTF-8 encoding as
68a sequence of bytes. UTF-8 represents ASCII characters with a single
69byte, and represents non-ASCII Unicode characters (codes 128 and up)
70as multibyte sequences. Here is an example of a UTF-8 constant:
71</p>
72<div class="example">
73<pre class="example">u8&quot;A cónstàñt&quot;
74</pre></div>
75
76<p>This constant occupies 13 bytes plus the terminating null,
77because each of the accented letters is a two-byte sequence.
78</p>
79<p>Concatenating an ordinary string with a UTF-8 string conceptually
80produces another UTF-8 string. However, if the ordinary string
81contains character codes 128 and up, the results cannot be relied on.
82</p>
83
84
85
86</body>
87</html>
Note: See TracBrowser for help on using the repository browser.