RegexMatch (String function): Difference between revisions
m (match syntax diagram to revised template; fix tags and links) |
m (Apply Dave's comments) |
||
Line 1: | Line 1: | ||
{{Template:String:RegexMatch subtitle}} | {{Template:String:RegexMatch subtitle}} | ||
The <var>RegexMatch</var> [[Intrinsic classes|intrinsic]] function determines whether a given pattern (regular expression, or "regex") matches within a given string according to the | The <var>RegexMatch</var> [[Intrinsic classes|intrinsic]] function determines whether a given pattern (regular expression, or "regex") matches within a given string according to the <var>[[Regex_processing#Regex_rules|"rules"]]</var> of regular expression matching. | ||
==Syntax== | ==Syntax== | ||
Line 14: | Line 14: | ||
<td>A string that is interpreted as a regular expression and is applied to the method object <var class="term">string</var> to determine whether the regex matches <var class="term">string</var>.</td></tr> | <td>A string that is interpreted as a regular expression and is applied to the method object <var class="term">string</var> to determine whether the regex matches <var class="term">string</var>.</td></tr> | ||
<tr><th>Options</th> | <tr><th>Options</th> | ||
<td>This is an optional, but <var class="term">nameRequired</var>, parameter supplying a string of single letter options, which may be specified in uppercase or lowercase, in any combination, and blank separated or not as you prefer. For more information about these options, see <var>[[Regex_processing#Common_regex_options|Common regex options]]</var>. | <td>This is an optional, but <var class="term">nameRequired</var>, parameter supplying a string of single letter options, which may be specified in uppercase or lowercase, in any combination, and blank separated or not as you prefer. For more information about these options, see <var>[[Regex_processing#Common_regex_options|"Common regex options"]]</var>. | ||
<table class="syntaxNested"> | <table class="syntaxNested"> | ||
<tr><th>I</th> | <tr><th>I</th> | ||
Line 23: | Line 23: | ||
<td>Multi-line mode: let anchor characters match end-of-line indicators <b><i>wherever</i></b> the indicator appears in the input string. <var class="term">M</var> mode is ignored if <var class="term">C</var> (XML Schema) mode is specified.</td></tr> | <td>Multi-line mode: let anchor characters match end-of-line indicators <b><i>wherever</i></b> the indicator appears in the input string. <var class="term">M</var> mode is ignored if <var class="term">C</var> (XML Schema) mode is specified.</td></tr> | ||
<tr><th>C</th> | <tr><th>C</th> | ||
<td>Do the match according to <var>[[Regex_processing#XML_Schema_mode|XML Schema regex rules]]</var>. Each <var class="term">regex</var> is implicitly anchored at the beginning and end, and no characters serve as anchors.</td></tr> | <td>Do the match according to <var>[[Regex_processing#XML_Schema_mode|"XML Schema regex rules"]]</var>. Each <var class="term">regex</var> is implicitly anchored at the beginning and end, and no characters serve as anchors.</td></tr> | ||
</table></td></tr> | </table></td></tr> | ||
<tr><th>CaptureList</th><td></td></tr> | <tr><th>CaptureList</th><td></td></tr> | ||
Line 36: | Line 36: | ||
==Usage notes== | ==Usage notes== | ||
<ul><li>It is strongly recommended that you protect your environment from regular expression processing demands on PDL and STBL space by setting, say, <code>UTABLE LPDLST 3000</code> and <code>UTABLE LSTBL 9000</code>. See <var>[[Regex_processing#User_Language_programming_considerations|User Language programming considerations]]</var>. | <ul><li>It is strongly recommended that you protect your environment from regular expression processing demands on PDL and STBL space by setting, say, <code>UTABLE LPDLST 3000</code> and <code>UTABLE LSTBL 9000</code>. See <var>[[Regex_processing#User_Language_programming_considerations|"User Language programming considerations"]]</var>. | ||
<li>For information about additional methods that support regular expressions, see <var>[[Regex_processing|Regex Processing]]</var>. | <li>For information about additional methods that support regular expressions, see <var>[[Regex_processing|"Regex Processing"]]</var>. | ||
<li><var>RegexMatch</var> is something of a misnomer. It does not determine if a string matches a regular expression, it determines if a string <b><i>contains</i></b> a substring that matches a regular expression. <var>RegexMatch</var> behaves more like a matching method if the regular expression is "anchored" (begins with a caret ('<code>ˆ</code>') and ends with a dollar sign ('<code>$</code>')), or if the C option indicates XML Schema mode. | <li><var>RegexMatch</var> is something of a misnomer. It does not determine if a string matches a regular expression, it determines if a string <b><i>contains</i></b> a substring that matches a regular expression. <var>RegexMatch</var> behaves more like a matching method if the regular expression is "anchored" (begins with a caret ('<code>ˆ</code>') and ends with a dollar sign ('<code>$</code>')), or if the C option indicates XML Schema mode. | ||
<li><var>RegexMatch</var> is available as of <var class="product">[[Sirius Mods]]</var> Version 7.2.</ul> | <li><var>RegexMatch</var> is available as of <var class="product">[[Sirius Mods]]</var> Version 7.2.</ul> | ||
Line 70: | Line 70: | ||
==See also== | ==See also== | ||
<ul><li>For details of the <var>printtext</var> statement, please see <var>[[Intrinsic classes#printtext|printText]]</var></ul> | <ul><li>For details of the <var>printtext</var> statement, please see <var>[[Intrinsic classes#printtext|printText]]</var>.</ul> | ||
{{Template:String:RegexMatch footer}} | {{Template:String:RegexMatch footer}} |
Revision as of 22:41, 3 February 2011
Position after match of regex (String class)
The RegexMatch intrinsic function determines whether a given pattern (regular expression, or "regex") matches within a given string according to the "rules" of regular expression matching.
Syntax
%number = string:RegexMatch( regex, [Options= string], - [CaptureList= stringlist]) Throws InvalidRegex
Syntax terms
%number | A variable to return the position of the character after the last character matched, or a zero if no characters in the method object string match the regular expression. | ||||||||
---|---|---|---|---|---|---|---|---|---|
string | The input string, to which the regular expression regex is applied. | ||||||||
regex | A string that is interpreted as a regular expression and is applied to the method object string to determine whether the regex matches string. | ||||||||
Options | This is an optional, but nameRequired, parameter supplying a string of single letter options, which may be specified in uppercase or lowercase, in any combination, and blank separated or not as you prefer. For more information about these options, see "Common regex options".
| ||||||||
CaptureList |
Exceptions
RegexMatch can throw the following exceptions:
- InvalidRegex
- If the regex parameter does not contain a valid regular expression. The exception object indicates the position of the character in the regex parameter where it was determined that the regular expression is invalid, and a description of the nature of the error.
Usage notes
- It is strongly recommended that you protect your environment from regular expression processing demands on PDL and STBL space by setting, say,
UTABLE LPDLST 3000
andUTABLE LSTBL 9000
. See "User Language programming considerations". - For information about additional methods that support regular expressions, see "Regex Processing".
- RegexMatch is something of a misnomer. It does not determine if a string matches a regular expression, it determines if a string contains a substring that matches a regular expression. RegexMatch behaves more like a matching method if the regular expression is "anchored" (begins with a caret ('
ˆ
') and ends with a dollar sign ('$
')), or if the C option indicates XML Schema mode. - RegexMatch is available as of Sirius Mods Version 7.2.
Examples
- The following example tests whether the regex '\*bc?[5-8]' contains a substring that matches 'a*b6'.
begin %rc float %regex longstring %string longstring %regex = '\*bc?[5-8]' %string = 'a\*b6' %rc = %string:regexmatch(%regex) if %rc then printText '{%regex}' matches '{%string}' else printText '{%regex}' does not match '{%string}' end if end
The regex matches the input string; the example result is:
'\*bc?[5-8]' matches 'a\*b6'
This regex demonstrates the following:
- To match a string, a regex pattern must merely "fit" a substring of the string.
- Metacharacters, in this case star ('
*
'), must be escaped. - An optional character ('
c?
') may fail to find a match, but this does not prevent the success of the overall match. - The character class range ('
[5-8]
') matches the '6
' in the input string.
See also
- For details of the printtext statement, please see printText.