A study of stemming algorithms in text mining
Abstract
Text Mining is the term used as an alternate to Knowledge Discovery in Web. Finding relevant documents for the user query is the process of text mining. Deciding which documents should be retrieved is a difficult task for the researcher. The query terms are matched along with the terms of the document and the decision is made either binary, i.e., Retrieved / Rejected or the relevance of the document is estimated. The terms will be morphological so that the need of stemming arises which further reduces the size of the document. Therefore, it increases the processing time and decreases the storage space. In this paper a study of Stemming algorithms and the factors used to calculate its efficiency and the strength of the Stemmer are discussed.
Key words: Text Mining, Stemming, Morphological terms, Porter Stemmer
Downloads
Published
How to Cite
Issue
Section
License
International Journal of Engineering Technology and Computer Research (IJETCR) by Articles is licensed under a Creative Commons Attribution 4.0 International License.