Signs for Regular Expresions

^         Start of line
$          End of line
n?        Zero or only one single occurrence of character ‘n’
n*        Zero or more occurrences of character ‘n’
n+        At least one or more occurrences of character ‘n’
n{2}     Exactly two occurrences of ‘n’
n{2,}    At least 2 or more occurrences of ‘n’
n{2,4}  From 2 to 4 occurrences of ‘n’
.          Any single character
()         Parenthesis to group expressions
(.*)      Zero or more occurrences of any single character, ie, anything!
(n|a)    Either ‘n’ or ‘a’
[1-6]    Any single digit in the range between 1 and 6
[c-h]    Any single lower case letter in the range between c and h
[D-M]   Any single upper case letter in the range between D and M
[^a-z]  Any single character EXCEPT any lower case letter between a and z.

Pitfall: the ^ symbol only acts as an EXCEPT rule if it is the very first character inside a range, and it denies the entire range including the ^ symbol itself if it appears again later in the range. Also remember that if it is the first character in the entire expression, it means “start of line”.
In any other place, it is always treated as a regular ^ symbol. In other words, you cannot deny a word with ^undesired_word or a group with ^(undesired_phrase).
Read more detailed regex documentation to find out what is necessary to achieve this.

[_4^a-zA-Z]    Any single character which can be the underscore or the number 4 or the ^ symbol or any letter, lower or upper case

?, +, * and the {} count parameters can be appended not only to a single character, but also to a group() or a range[].

therefore,
^.{2}[a-z]{1,2}_?[0-9]*([1-6]|[a-f])[^1-9]{2}a+$
would mean:

^.{2}          = A line beginning with any two characters,
[a-z]{1,2}    = followed by either 1 or 2 lower case letters,
_?               = followed by an optional underscore,
[0-9]*         = followed by zero or more digits,
([1-6]|[a-f]) = followed by either a digit between 1 and 6 OR a lower case letter between a and f,
[^1-9]{2}    = followed by any two characters except digits between 1 and 9 (0 is possible),
a+$             = followed by at least one or more occurrences of ‘a’ at the end of a line.

Related Posts:

  • Metacharacters in Regular Expresions
    ereg_replaceTo delete all signs / symbols from a string, you can use the function with special values:ereg_replace(”[[:punct:]]”,”,$string);The following is a set of special values that deno...

Tags:

11.Oct.08 PHP, Programming


You can follow any responses to this entry through the RSS 2.0 feed.
You can leave a response, or trackback from your own site.

Reader's Comments

  1. Membuat Autolink dengan Script PHP | January 26th, 2009 at 10:11 pm

    [...] ke URL/halaman lain. Sebenarnya simple, kita hanya menggunakan regex (untuk lebih jelasnya baca disini) dan beberapa fungsi dari PHP. Langsung saja kita bahas bagaimana membuat autolink dengan script [...]

Leave a Comment

:)