Sometimes, we want to remove emojis from a string in Python.
In this article, we’ll look at how to remove emojis from a string in Python.
How to remove emojis from a string in Python?
To remove emojis from a string in Python, we can create a regex that matches a list of emojis.
For instance, we write
import re
text = u'This is a smiley face \U0001f602'
print(text) # with emoji
def de_emojify(text):
    regex_pattern = re.compile(pattern = "["
        u"\U0001F600-\U0001F64F" 
        u"\U0001F300-\U0001F5FF"
        u"\U0001F680-\U0001F6FF"
        u"\U0001F1E0-\U0001F1FF"
                           "]+", flags = re.UNICODE)
    return regex_pattern.sub(r'', text)
print(de_emojify(text))
to call re.compile with pattern set to a string that matches the character code ranges for emojis.
\U0001F600-\U0001F64F is the code range for emoticons.
\U0001F300-\U0001F5FF is the range for symbols and pictographs.
\U0001F680-\U0001F6FF" is the range for transport and map symbols.
\U0001F1E0-\U0001F1FF is the range for flag emojis in iOS.
Then we call regex_pattern.sub to replace the emojis in text with empty strings.
Conclusion
To remove emojis from a string in Python, we can create a regex that matches a list of emojis.
