Sometimes, we want to use Python regular expression with Unicode.
In this article, we’ll look at how to use Python regular expression with Unicode.
How to use Python regular expression with Unicode?
To use Python regular expression with Unicode, we can use the re.compile method.
For instance, we write
myre = re.compile(ur'[\u0000\u0000\u0000\u0000\u0000\u0000]+',
re.UNICODE)
to call re.compile with the Unicode regex string ur'[\u0000\u0000\u0000\u0000\u0000\u0000]+'.
The 2nd argument is re.UNICODE so re.compile will treat the first argument as a Unicode regex string.
Conclusion
To use Python regular expression with Unicode, we can use the re.compile method.