
Data Processing with Optimus
By :

In any text, it is common to find a word in multiple forms. See these, for example:
All these words have the unique root Truck
. The words in the list are called inflections.
The following is a quote from Wikipedia:
In grammar, inflection is the modification of a word to express different grammatical categories such as tense, case, voice, aspect, person, number, gender, and mood. An inflection expresses one or more grammatical categories with a prefix, suffix, or infix, or another internal modification such as a vowel change.
Changing a word from its inflected form to its root form is called word normalization.
In natural language processing, there are two main techniques to achieve this: stemming and lemmatization.
While stemming, we use an algorithm to reduce the word to its stems. This is not the case for lemmatization, in which we use the language's morphological...