It is always necessary to process any dataset or text source before working on it. Creating a text resource so that it can be further analysed and predictions can be made. This section discusses operations such as spell correction, smoothing, tokenization, stemming, and so on.