UnicodeToUtf8 (Unicode function): Difference between revisions

From m204wiki
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
Line 1: Line 1:
<span style="font-size:120%; color:black"><b>Unicode string converted to UTF-8 byte stream</b></span>
{{Template:Unicode:UnicodeToUtf8 subtitle}}
[[Category:Intrinsic Unicode methods|UnicodeToUtf8 function]]
[[Category:Intrinsic Unicode methods|UnicodeToUtf8 function]]
[[Category:Intrinsic methods]]
[[Category:Intrinsic methods]]
Line 10: Line 10:


The UnicodeToUtf8 function is available as of version 7.3 of the ''Sirius Mods''.
The UnicodeToUtf8 function is available as of version 7.3 of the ''Sirius Mods''.
===Syntax===
==Syntax==
   %utf8Stream = unicode:UnicodeToUtf8([InsertBOM=bool])])
   %utf8Stream = unicode:UnicodeToUtf8([InsertBOM=bool])])
<dl>
<dl>
Line 21: Line 21:
<dd>The optional (name required) InsertBOM argument is a Boolean:
<dd>The optional (name required) InsertBOM argument is a Boolean:
<ul>
<ul>
<li>If its value is <tt>True</tt>, the &ldquo;Byte Order
<li>If its value is <tt>True</tt>, the "Byte Order
Mark&rdquo; (U+FEFF, encoded as X'EFBBBF') is inserted at the start of the output
Mark" (U+FEFF, encoded as X'EFBBBF') is inserted at the start of the output
stream.
stream.
<li>If its value is <tt>False</tt>, the default,
<li>If its value is <tt>False</tt>, the default,
Line 38: Line 38:
See [[CharacterTranslationException exception class]].
See [[CharacterTranslationException exception class]].
</dl>
</dl>
===Usage Notes===
==Usage notes==
<ul>
<ul>
<li>For more information about UTF-8 conversions, see [[Unicode#UTF-8 and UTF-16]].
<li>For more information about UTF-8 conversions, see [[Unicode#UTF-8 and UTF-16]].
Line 46: Line 46:
converts a UTF-8 Longstring byte stream to Unicode.
converts a UTF-8 Longstring byte stream to Unicode.
</ul>
</ul>
===Examples===
==Examples==


In the following fragment, UnicodeToUtf8 is used to show how the
In the following fragment, UnicodeToUtf8 is used to show how the

Revision as of 04:09, 19 January 2011

Translate to UTF-8 (Unicode class)

This function converts a Unicode string to a UTF-8 Longstring byte stream.

The UnicodeToUtf8 function is available as of version 7.3 of the Sirius Mods.

Syntax

  %utf8Stream = unicode:UnicodeToUtf8([InsertBOM=bool])])
%utf8Stream
A String or Longstring variable to receive the method object string translated to a UTF-8 byte stream.
unicode
A Unicode string.
InsertBOM=bool
The optional (name required) InsertBOM argument is a Boolean:
  • If its value is True, the "Byte Order Mark" (U+FEFF, encoded as X'EFBBBF') is inserted at the start of the output stream.
  • If its value is False, the default, no Byte Order Mark is inserted.

Exceptions

This function can throw the following exception:

CharacterTranslationException
If the method encounters a translation problem, properties of the exception object may indicate the location and type of problem. See CharacterTranslationException exception class.

Usage notes

Examples

In the following fragment, UnicodeToUtf8 is used to show how the Unicode U+B2 character (superscript 2) is represented in UTF-8. Appending the StringToHex method is useful for viewing the hex values of characters that do not have displayable EBCDIC equivalents.

The U constant function and StringToHex function are used in the example.

    %u Unicode Initial('&#xB2;':U)
    Print %u:UnicodeToUtf8:StringToHex

The result is:

    C2B2