ParseLines (Stringlist subroutine)

From m204wiki
Jump to navigation Jump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Parse delimited string, appending to this Stringlist (Stringlist class)


This method is a subroutine used to parse a delimited string into a Stringlist object. Generally, as the name of the method suggests, the string would contain line-end character delimited lines.

Syntax

sl:ParseLines( string, [delims], [StripTrailingNull= boolean])

Syntax terms

sl A Stringlist object; the parsed lines are appended as items to sl.
string The string to be parsed.
delims An, optional, list of one or more delimiters to be used as line delimiters. If specified, the first character in the delimiter list is the separator character for the delimiter list itself and the remainder are the delimiters to be used to separate the string being parsed. If this optional argument is not specified, the delimiter list used contains X'0D', X'25', and X'0D25' (which are the EBCDIC carriage-return, line-feed, and carriage-return/line-feed characters respectively), and it should handle any line-end delimited data from an ASCII host that is been translated to ASCII.
StripTrailingNull This name required argument, StripTrailingNull, is a boolean value that indicates whether a trailing null line should be stripped. StripTrailingNull is an optional argument that defaults to true, which results in a trailing null line being stripped.

Usage notes

  • All errors in ParseLines result in request cancellation.
  • The default for trailing null-line stripping for ParseLines is based on the fact that most ASCII applications end the last line of a file with the line-end character(s). As such, not stripping the trailing null-line would create a Stringlist item where a line was not intended, so it would likely be a bit of an annoyance. Similarly, the Createlines method also ends the output string with the line delimiter by default.

    Still, the default trailing null-line handling is necessarily the most "correct" in some absolute sense. Specifically, if a ParseLines operation is used to load a Stringlist, the result of a Createlines with the same delimiter character would produce an output different from the input to ParseLines, if the original input did not end with a delimiter — a delimiter would be added. Using False for both the ParseLines StripTrailingNull parameter and the Createlines AddTrailingDelimiter parameter would ensure reversability of these two methods.

Examples

  1. The following loads the names of three popular sandwich contents onto separate Stringlist items:

    %contents is object stringList %contents = new ... %contents:parseLines('Bacon/Lettuce/Tomato', ' /')

    The space before the slash character is the delimiter-set delimiter character, that is, the character used to separate multiple delimiters, should there be more than one. As this example illustrates, the ParseLines method can be used for general-purpose parsing, although this was not really its intent and it is somewhat limited for this purpose. Its main use is expected to be in parsing text received into a string via $Web_Input_Content, $Web_File_Content, or $Web_Output_Content. Accordingly, the default delimiter set for the ParseLines method is X'0D', X'25', and X'0D25' (which are the EBCDIC carriage-return, line-feed, and carriage-return/line-feed characters respectively).

  2. In the following example, the contents of the file being uploaded via form field MYFILE are moved to a Stringlist:

    %contents is object stringList %myfile is longstring ... %myfile = $web_file_content('MYFILE', ,'text') ... %contents = new %contents:parseLines(%myfile)

    It would be possible (though probably awkward) to process the file in its native ASCII format, although the ASCII line-feed character is different from the EBCDIC line-feed character (interestingly, carriage return is the same), so a different delimiter set would have to be used:

    %asciiLinend is string len 32 static - initial('/' with $x2c('0D') with - '/' with $x2c('0A') with - '/' with $x2c('0A0D')) %contents is object stringList %myfile is longstring ... %myfile = $web_file_content('MYFILE') ... %contents = new %contents:parseLines(%myfile, %asciiLinend)

  3. As with the default delimiter set, one delimiter can sometimes be contained in another. That is, for example, the carriage-return and line-feed are part of the carriage-return/line-feed delimiter. In such cases, the longest matching delimiter is used regardless of the order in which the delimiters are specified. For example, consider the following statement:

    %list:parseLines('ab/+/cd+/ef+gh', ' /+/ +/ +')

    The resulting Stringlist would contain items 'ab', 'cd', 'ef', and 'gh', respectively. The character used to separate the delimiters is irrelevant, other than the obvious fact that it can't appear in any of the delimiters.

  4. The default behavior of ParseLines is to strip a trailing null line. The following would add three items to the Stringlist: 'a', 'b', and 'c'.

    %list:parseLines('a/b/c/', ' /')

    This would produce results indistinguishable from:

    %list:ParseLines('a/b/c', ' /')

    If the distinction between these two cases is important, the StripTrailingNull argument to ParseLines should be set to False:

    %list:ParseLines('a/b/c/', ' /', stripTrailingNull=false)

See also