1.

What are the steps involved in preprocessing data for NLP?

Answer»

Here are some common pre-processing steps used in NLP software:

  • Preliminaries: This includes word tokenization and sentence segmentation.
  • Common Steps: STOP word REMOVAL, STEMMING and lemmatization, removing digits/punctuation, lowercasing, etc.
  • Processing Steps: CODE mixing, normalization, language detection, transliteration, etc.
  • Advanced Processing: Parts of Speech (POS) tagging, COREFERENCE resolution, parsing, etc.


Discussion

No Comment Found