UnicodeToUtf8 (Unicode function): Difference between revisions

Latest revision as of 20:10, 6 November 2012

Translate to UTF-8 (Unicode class)

UnicodeToUtf8 converts a Unicode string to a UTF-8 Longstring byte stream.

Syntax

%string = unicode:UnicodeToUtf8[( [InsertBOM= boolean])]

Syntax terms

%string	A `String` or `Longstring` variable to receive the method object string translated to a UTF-8 byte stream.
unicode	A `Unicode` string.
`InsertBOM`	The optional, name required, `InsertBOM` argument is a Boolean: If its value is `True`, the "Byte Order Mark" (U+FEFF, encoded as X'EFBBBF') is inserted at the start of the output stream. If its value is `False`, the default, no Byte Order Mark is inserted.

Exceptions

UnicodeToUtf8 can throw the following exception:

CharacterTranslationException: If the method encounters a translation problem, properties of the exception object may indicate the location and type of problem.

Usage notes

UnicodeToUtf8 is available as of Sirius Mods Version 7.3.

Examples

In the following fragment, UnicodeToUtf8 is used to show how the Unicode U+B2 character (superscript 2) is represented in UTF-8. Appending the StringToHex method is useful for viewing the hex values of characters that do not have displayable EBCDIC equivalents.

%u unicode initial('²':U) print %u:UnicodeToUtf8:stringToHex

The result is:

C2B2

@@ Line 11: / Line 11: @@
 <td>A <var>Unicode</var> string.</td></tr>
 <tr><th><var>InsertBOM</var></th>
-<td>The optional (<var>[[Methods#Named parameters|NameRequired]]</var>) <var>InsertBOM</var> argument is a [[Boolean enumeration|Boolean]]:<ul><li>If its value is <code>True</code>, the "Byte Order Mark" (U+FEFF, encoded as X'EFBBBF') is inserted at the start of the output stream.<li>If its value is <code>False</code>, the default, no Byte Order Mark is inserted.</ul></td></tr>
+<td>The optional, [[Notation conventions for methods#Named parameters|name required]], <var>InsertBOM</var> argument is a [[Boolean enumeration|Boolean]]:
+<ul>
+<li>If its value is <code>True</code>, the "Byte Order Mark" (U+FEFF, encoded as X'EFBBBF') is inserted at the start of the output stream.
+<li>If its value is <code>False</code>, the default, no Byte Order Mark is inserted.
+</ul></td></tr>
 </table>
@@ Line 22: / Line 26: @@
 ==Usage notes==
-<ul><li><var>UnicodeToUtf8</var> is available as of <var class="product">[[Sirius Mods]]</var> Version 7.3.</ul>
+<ul>
+<li><var>UnicodeToUtf8</var> is available as of <var class="product">Sirius Mods</var> Version 7.3.
+</ul>
 ==Examples==
-<ol><li>
 In the following fragment, <var>UnicodeToUtf8</var> is used to show how the <var>Unicode</var> U+B2 character (superscript 2) is represented in UTF-8.  Appending the <var>StringToHex</var> method is useful for viewing the hex values of characters that do not have displayable EBCDIC equivalents.
-<p class="code">%u unicode initial('&amp;amp;#xB2;':[[U (String function)|U]])
+<p class="code">%u unicode initial('&amp;#xB2;':[[U (String function)|U]])
 print %u:UnicodeToUtf8:[[StringToHex (String function)|stringToHex]]
 </p>
 The result is:
 <p class="output">C2B2
-</p></ol>
+</p>
 ==See also==

Float class String class Unicode class	List of Float methods List of String methods List of Unicode methods List of Intrinsic methods	Float methods syntax String methods syntax Unicode methods syntax
Notation conventions for methods

UnicodeToUtf8 (Unicode function): Difference between revisions

Latest revision as of 20:10, 6 November 2012

Contents

Syntax

Syntax terms

Exceptions

Usage notes

Examples

See also

Navigation menu

UnicodeToUtf8 (Unicode function): Difference between revisions

Latest revision as of 20:10, 6 November 2012

Syntax

Syntax terms

Exceptions

Usage notes

Examples

See also

Navigation menu

Search