A subject identification method based on term frequency technique

The analyzing and extracting important information from a text document is crucial and has produced interest in the area of text mining and information retrieval. This process is used in order to notice particularly in the text. Furthermore, on view of the readers that people tend to read almost eve...

全面介紹

Saved in:
書目詳細資料
Main Authors: Jamil, Nurul Syafidah, Ku-Mahamud, Ku Ruhana, Mohamed Din, Aniza, Ahmad, Faudziah, Che Pa, Noraziah, Wan Ishak, Wan Hussain, Din, Roshidi, Ahmad, Farzana Kabir
格式: Article
語言:English
出版: ACCENTS 2017
主題:
在線閱讀:http://repo.uum.edu.my/25538/1/IJACR%207%2030%202017%20%20103%20110.pdf
http://repo.uum.edu.my/25538/
http://doi.org/10.19101/IJACR.2017.730020
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:The analyzing and extracting important information from a text document is crucial and has produced interest in the area of text mining and information retrieval. This process is used in order to notice particularly in the text. Furthermore, on view of the readers that people tend to read almost everything in text documents to find some specific information. However, reading a text document consumes time to complete and additional time to extract information. Thus, classifying text to a subject can guide a person to find relevant information. In this paper, a subject identification method which is based on term frequency to categorize groups of text into a particular subject is proposed. Since term frequency tends to ignore the semantics of a document, the term extraction algorithm is introduced for improving the result of the extracted relevant terms from the text. The evaluation of the extracted terms has shown that the proposed method is exceeded other extraction techniques.