Normalization of noisy texts in Malaysian online reviews

The process of gathering useful information from online messages has increased as more and more people use the Internet and other online applications such as Facebook and Twitter to communicate with each other.One of the problems in processing online messages is the high number of noisy texts that...

Full description

Saved in:
Bibliographic Details
Main Authors: Samsudin, Norlela, Puteh, Mazidah, Hamdan, Abdul Razak, Ahmad Nazri, Mohd Zakree
Format: Article
Language:English
Published: Universiti Utara Malaysia Press 2012
Subjects:
Online Access:http://repo.uum.edu.my/24089/1/JICT%2012%202013%20147%E2%80%93159.pdf
http://repo.uum.edu.my/24089/
http://jict.uum.edu.my/index.php/previous-issues/141-journal-of-information-and-communication-technology-jict-vol-12-2013
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uum.repo.24089
record_format eprints
spelling my.uum.repo.240892018-05-06T23:42:48Z http://repo.uum.edu.my/24089/ Normalization of noisy texts in Malaysian online reviews Samsudin, Norlela Puteh, Mazidah Hamdan, Abdul Razak Ahmad Nazri, Mohd Zakree QA75 Electronic computers. Computer science The process of gathering useful information from online messages has increased as more and more people use the Internet and other online applications such as Facebook and Twitter to communicate with each other.One of the problems in processing online messages is the high number of noisy texts that exist in these messages.Few studies have shown that the noisy texts decreased the result of text mining activities.On the other hand, very few works have investigated on the patterns of noisy texts that are created by Malaysians.In this study, a common noisy terms list and an artificial abbreviations list were created using specific rules and were utilized to select candidates of correct words for a noisy term.Later, the correct term was selected based on a bi-gram words index.The experiments used online messages that were created by the Malaysians.The result shows that normalization of noisy texts using artificial abbreviations list compliments the use of common noisy texts list. Universiti Utara Malaysia Press 2012 Article PeerReviewed application/pdf en http://repo.uum.edu.my/24089/1/JICT%2012%202013%20147%E2%80%93159.pdf Samsudin, Norlela and Puteh, Mazidah and Hamdan, Abdul Razak and Ahmad Nazri, Mohd Zakree (2012) Normalization of noisy texts in Malaysian online reviews. Journal of Information and Communication Technology, 11. pp. 147-159. ISSN 2180-3862 http://jict.uum.edu.my/index.php/previous-issues/141-journal-of-information-and-communication-technology-jict-vol-12-2013
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Institutionali Repository
url_provider http://repo.uum.edu.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Samsudin, Norlela
Puteh, Mazidah
Hamdan, Abdul Razak
Ahmad Nazri, Mohd Zakree
Normalization of noisy texts in Malaysian online reviews
description The process of gathering useful information from online messages has increased as more and more people use the Internet and other online applications such as Facebook and Twitter to communicate with each other.One of the problems in processing online messages is the high number of noisy texts that exist in these messages.Few studies have shown that the noisy texts decreased the result of text mining activities.On the other hand, very few works have investigated on the patterns of noisy texts that are created by Malaysians.In this study, a common noisy terms list and an artificial abbreviations list were created using specific rules and were utilized to select candidates of correct words for a noisy term.Later, the correct term was selected based on a bi-gram words index.The experiments used online messages that were created by the Malaysians.The result shows that normalization of noisy texts using artificial abbreviations list compliments the use of common noisy texts list.
format Article
author Samsudin, Norlela
Puteh, Mazidah
Hamdan, Abdul Razak
Ahmad Nazri, Mohd Zakree
author_facet Samsudin, Norlela
Puteh, Mazidah
Hamdan, Abdul Razak
Ahmad Nazri, Mohd Zakree
author_sort Samsudin, Norlela
title Normalization of noisy texts in Malaysian online reviews
title_short Normalization of noisy texts in Malaysian online reviews
title_full Normalization of noisy texts in Malaysian online reviews
title_fullStr Normalization of noisy texts in Malaysian online reviews
title_full_unstemmed Normalization of noisy texts in Malaysian online reviews
title_sort normalization of noisy texts in malaysian online reviews
publisher Universiti Utara Malaysia Press
publishDate 2012
url http://repo.uum.edu.my/24089/1/JICT%2012%202013%20147%E2%80%93159.pdf
http://repo.uum.edu.my/24089/
http://jict.uum.edu.my/index.php/previous-issues/141-journal-of-information-and-communication-technology-jict-vol-12-2013
_version_ 1644283961413730304
score 13.15806