Regex Pattern To Exlude Specific String
I have a string that have this format: some text septembar 1989  And I'm using this regex find the month and year part 
(?!=b\.)(.*?\b\d{4}\b) and I
(?!=b\.)(.*?\b\d{4}\b) and I
Solution 1:
Try this
<br/?>([^<]+)\d{4}
[^<] means match anything except an opening tag which is what you want.
Solution 2:
I written a simple code, may be you find helpful unto some extend:
import re
defgetDate(str):
 m = re.match("[\<br>]*[\w\s]*\<br>([\w\s]*[12][0-9]{3})",str);
 return m.group(1)
print getDate("some text <br>dec 1989<br>");
print getDate("<br> some text <br>septembar 1989<br>");
print getDate("grijesh chuahan <br>feb 2009<br>");
Output:
dec1989septembar1989feb2009Solution 3:
import re
ss = 'dfgqeg<br>some text <br>septembar 1989<br>'
reg = re.compile('<br(?: /)?>''(?!.+?<br(?: /)?>.+?<br(?: /)?>)''(.+?\d{4})''<br(?: /)?>')
print reg.search(ss).group(1)
.
- '<br(?: /)?>'catches- <br>and- <br />occurrences
.
- '(?!.+?<br(?: /)?>.+?<br(?: /)?>)'is a look-ahead assertion, it verifies that after the position where it starts in the analyzed text, there isn't the suite of characters described as a succession of :- .+?any kind of characters, but the- ?orders that this portion must stop as soon as- <br>or- <br />is encountered
- <br>or- <br />
- again any kind of characters stopping before <br>or<br />
- <br>or- <br />
 
Post a Comment for "Regex Pattern To Exlude Specific String"