InterviewSolution
| 1. |
What is the meaning of Text Normalization in NLP? |
|
Answer» Consider a situation in which we’re operating with a set of social media posts to find information events. Social media textual content may be very exceptional from the language we’d SEE in, say, newspapers. A phrase may be spelt in multiple ways, such as in shortened forms, (for INSTANCE, with and without hyphens), names are USUALLY in lowercase, and so on. When we're developing NLP TOOLS to work with such kinds of data, it’s beneficial to attain a CANONICAL representation of textual content that captures these kinds of variations into one representation. This is referred to as text normalization. Converting all text to lowercase or uppercase, converting digits to text (e.g., 7 to seven), expanding abbreviations, and so on are some frequent text normalisation stages. |
|