Utf8ToUnicode (String function): Difference between revisions

Latest revision as of 19:38, 13 April 2016

Convert a UTF-8 Longstring bytestream to Unicode (String class)

The Utf8ToUnicode intrinsic function converts a UTF-8 string to Unicode.

Syntax

%unicode = string:Utf8ToUnicode[( [AllowUntranslatable= boolean])] Throws CharacterTranslationException

Syntax terms

%unicode	A Unicode string variable to receive the method object `string` translated to `Unicode`.
string	The method object `string` that is presumed to contain a UTF-8 byte stream.
`AllowUntranslatable`	This argument indicates whether this function will store values into the target Unicode string if they cannot be translated to EBCDIC. This value defaults to `True`, which means that such values would be allowed. If this argument is set to `False`, a Unicode value not translatable to EBCDIC produces a CharacterTranslationException exception. Unless there is a compelling reason to do otherwise, it is best to use the default value of `True`.

Exceptions

Utf8ToUnicode can throw the following exception:

`CharacterTranslationException`	If the method encounters a translation problem, properties of the exception object may indicate the location and type of problem.

Examples

In the following fragment, Utf8ToUnicode converts a hexadecimal input to a single Unicode character. In case the Unicode character translates to an EBCDIC character that cannot be displayed, the CharacterEncode option of UnicodeToEbcdic causes the output of a hexadecimal character reference.

%u unicode %u = 'E284A2':X:Utf8ToUnicode print %u:unicodeToEbcdic(CharacterEncode=true)

The result of the above fragment is the character reference for the trademark character:

™

@@ Line 7: / Line 7: @@
 <table class="syntaxTable">
 <tr><th>%unicode</th>
-<td>A string variable to receive the method object <var class="term">string</var> translated to <var>Unicode</var>.</td></tr>
+<td>A Unicode string variable to receive the method object <var class="term">string</var> translated to <var>Unicode</var>.</td></tr>
 <tr><th>string</th>
 <td>The method object <var class="term">string</var> that is presumed to contain a UTF-8 byte stream.</td></tr>
 <tr><th><var>AllowUntranslatable</var></th>
-<td></td>
+<td>This argument indicates whether this function will store values into the target Unicode string if they cannot be translated to EBCDIC. This value defaults to <var>True</var>, which means that such values would be allowed. If this argument is set to <var>False</var>, a Unicode value not translatable to EBCDIC produces a [[CharacterTranslationException class|CharacterTranslationException exception]].
+<p>
+Unless there is a compelling reason to do otherwise, it is best to use the default value of <var>True</var>.</p></td></tr>
 </table>
@@ Line 17: / Line 21: @@
 <var>Utf8ToUnicode</var> can throw the following exception:
-<dl>
+<table class="noBorder">
-<dt><var>[[CharacterTranslationException_class|CharacterTranslationException]]</var>
+<tr><th><var>[[CharacterTranslationException_class|CharacterTranslationException]]</var> </th>
-<dd>If the method encounters a translation problem, properties of the exception object may indicate the location and type of problem.
+<td>If the method encounters a translation problem, properties of the exception object may indicate the location and type of problem. </td></tr>
-</dl>
+</table>
-==Usage notes==
-<ul><li><var>Utf8ToUnicode</var> is available as of <var class="product">[[Sirius Mods|"Sirius Mods"]]</var> Version 7.3.</ul>
 ==Examples==
-<ol><li>
+In the following fragment, <var>Utf8ToUnicode</var> converts a hexadecimal input to a single <var>Unicode</var> character. In case the <var>Unicode</var> character translates to an EBCDIC character that cannot be displayed, the <var>CharacterEncode</var> option of <var>[[UnicodeToEbcdic (Unicode function)|UnicodeToEbcdic]]</var> causes the output of a hexadecimal character reference.
-In the following fragment, <var>Utf8ToUnicode</var> converts a hexadecimal input to a single <var>Unicode</var> character. In case the <var>Unicode</var> character translates to an EBCDIC character that cannot be displayed, the CharacterEncode option of <var>[[UnicodeToEbcdic (Unicode function)|UnicodeToEbcdic]]</var> causes the output of a hexadecimal character reference.
 <p class="code">%u unicode
 %u = 'E284A2':[[X (String function)|X]]:Utf8ToUnicode
@@ Line 33: / Line 33: @@
 </p>
 The result of the above fragment is the character reference for the trademark character:
-<p class="output">&amp;amp;#x2122;
+<p class="output">&amp;#x2122;
-</p></ol>
+</p>
 ==See also==
-<ul><li><var>[[Unicode#UTF-8_and_UTF-16|Utf8 and Utf16]]</var> has more information about UTF-8 conversions.
+<ul>
-<li><var>[[UnicodeToUtf8_(Unicode_function)|UnicodeToUtf8]]</var> converts a <var>Unicode</var> string to a UTF-8 <var>Longstring</var> byte stream.
+<li>[[Unicode#UTF-8 and UTF-16|Utf8 and Utf16]] has more information about UTF-8 conversions. </li>
-<li><var>[[Utf16ToUnicode (String function)|Utf16ToUnicode]]</var> converts a UTF-16 byte stream to <var>Unicode</var>.
+<li><var>[[UnicodeToUtf8 (Unicode function)|UnicodeToUtf8]]</var> converts a <var>Unicode</var> string to a UTF-8 <var>Longstring</var> byte stream. </li>
+<li><var>[[Utf16ToUnicode (String function)|Utf16ToUnicode]]</var> converts a UTF-16 byte stream to <var>Unicode</var>. </li>
 </ul>
 {{Template:String:Utf8ToUnicode footer}}

Float class String class Unicode class	List of Float methods List of String methods List of Unicode methods List of Intrinsic methods	Float methods syntax String methods syntax Unicode methods syntax
Notation conventions for methods

Utf8ToUnicode (String function): Difference between revisions

Latest revision as of 19:38, 13 April 2016

Contents

Syntax

Syntax terms

Exceptions

Examples

See also

Navigation menu

Utf8ToUnicode (String function): Difference between revisions

Latest revision as of 19:38, 13 April 2016

Syntax

Syntax terms

Exceptions

Examples

See also

Navigation menu

Search