Utf8ToUnicode (String function): Difference between revisions

From m204wiki
Jump to navigation Jump to search
m (more tags)
m (more tags)
Line 10: Line 10:
<tr><th>string</th>
<tr><th>string</th>
<td>The method object <var class="term">string</var> that is presumed to contain a UTF-8 byte stream.</td></tr>
<td>The method object <var class="term">string</var> that is presumed to contain a UTF-8 byte stream.</td></tr>
<tr><th>AllowUntranslatable</th>
<tr><th><var>AllowUntranslatable</var></th>
<td></td>
<td></td>
</table>
</table>
Line 33: Line 33:
</p>
</p>
The result of the above fragment is the character reference for the trademark character:
The result of the above fragment is the character reference for the trademark character:
<p class="output">&amp;#x2122;
<p class="output">&amp;amp;#x2122;
</p></ol>
</p></ol>



Revision as of 00:21, 13 April 2011

Convert a UTF-8 Longstring bytestream to Unicode (String class)

The Utf8ToUnicode intrinsic function converts a UTF-8 string to Unicode.

Syntax

%unicode = string:Utf8ToUnicode[( [AllowUntranslatable= boolean])] Throws CharacterTranslationException

Syntax terms

%unicode A string variable to receive the method object string translated to Unicode.
string The method object string that is presumed to contain a UTF-8 byte stream.
AllowUntranslatable

Exceptions

Utf8ToUnicode can throw the following exception:

CharacterTranslationException
If the method encounters a translation problem, properties of the exception object may indicate the location and type of problem.

Usage notes

Examples

  1. In the following fragment, Utf8ToUnicode converts a hexadecimal input to a single Unicode character. In case the Unicode character translates to an EBCDIC character that cannot be displayed, the CharacterEncode option of UnicodeToEbcdic causes the output of a hexadecimal character reference.

    %u unicode %u = 'E284A2':X:Utf8ToUnicode print %u:unicodeToEbcdic(CharacterEncode=true)

    The result of the above fragment is the character reference for the trademark character:

    &amp;#x2122;

See also