RegexSubset (Stringlist function): Difference between revisions
m (1 revision) |
m (1 revision) |
(No difference)
|
Revision as of 21:23, 7 February 2011
Create subset of Stringlist that matches a regex (Stringlist class)
This method returns a Stringlist that is a subset of the method Stringlist. The subset contains copies of all the items in the method Stringlist that are matched by a specified regex. Information about the regular expression matching rules observed is provided in "Regex processing rules".
RegexSubset accepts one required and four optional arguments, and it returns a Stringlist.
Syntax
%subsetList = sl:RegexSubset( regex, [Options= string], [Status= %output], - [StartCol= number], [EndCol= number]) Throws InvalidRegex
Syntax terms
%subsetList | A Stringlist that contains the sl items matched by regex. | ||||||||
---|---|---|---|---|---|---|---|---|---|
sl | A Stringlist object. | ||||||||
regex | A string that is interpreted as a regular expression and is applied to the sl method Stringlist items to determine whether the regex finds a match. | ||||||||
Options | This is an optional, but nameRequired, parameter supplying a string of single letter options, which may be specified in uppercase or lowercase, in any combination, and blank separated or not as you prefer. For more information about these options, see "Common regex options".
| ||||||||
Status | The Status argument (name required) is optional; if specified, it is set to an integer code. These values are possible:
| ||||||||
StartCol | The StartCol argument (name required) is an optional number that specifies the starting column of the range of columns in which the matched string must be located. If specified, number must be greater than or equal to 1 and less than or equal to the EndCol argument value. If the argument is omitted, its default value is 1. If you specify a number value that is greater than the length of a particular sl item, the regex is matched against the empty string for that item. | ||||||||
EndCol | The EndCol argument (name required) is an optional number that specifies the ending column of the range of columns in which the matched string must be located. If specified, number must be greater than or equal to 1, and greater than or equal to the StartCol argument value, and less than or equal to the lesser of 6124 or the length of sl item. If the EndCol argument is omitted, its default value is 6124. If the EndCol argument is omitted and a sl item exceeds 6124 bytes, the request is cancelled. |
Usage notes
- All errors in RegexSubset, including invalid argument(s) result in request cancellation.
- It is strongly recommended that you protect your environment from regular expression processing demands on PDL and STBL space by setting, say,
UTABLE LPDLST 3000
andUTABLE LSTBL 9000
. See "User Language programming considerations". - The regex matching is limited to the first 6124 bytes of each item, but a matched item is copied in its entirety to the output subset.
- Prior to copying matched items to %subsetList, any preexisting contents of that Stringlist are deleted.
- For information about additional methods and $functions that support regular expressions, see "Regex Processing".
- RegexSubset is available as of "sirius Mods" Version 6.9.
Examples
- In the following code fragment, RegexSubset is applied to the method Stringlist
%sl
to find the%sl
items that are matched by the regex%\([a-z]*\)
. The regex is designed to find items that contain shared methods whose class names contain only upper and lowercase letters.... %sl = new text to %sl b %doc is object xmlDoc %(daemon):getInputObject(%doc) %doc:selectSingleNode('/outer/inner'):addAttribute('foo','bar') %(daemon):returnObject(%doc) end
end text
%regex = '%\([a-z]*\)' %opt='i' %sl2 = %sl:RegexSubset (%regex, Options=%opt, Status=%st)
If (%st EQ 0) then
Print 'Status from RegexSubset is ' %st
Else
Print %regex ' matches the following items:'
End If For %i from 1 to %sl2:Count
Print 'Matching item ' %i ' is: ' %sl2:Item(%i)
End For
...
This code would print the following:
%\([a-z]*\) matches the following items: Matching item 1 is: %(daemon):getInputObject(%doc) Matching item 2 is: %(daemon):returnObject(%doc)