Normalization of noisy texts in Malaysian online reviews
The process of gathering useful information from online messages has increased as more and more people use the Internet and other online applications such as Facebook and Twitter to communicate with each other.One of the problems in processing online messages is the high number of noisy texts that...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universiti Utara Malaysia Press
2012
|
Subjects: | |
Online Access: | http://repo.uum.edu.my/24089/1/JICT%2012%202013%20147%E2%80%93159.pdf http://repo.uum.edu.my/24089/ http://jict.uum.edu.my/index.php/previous-issues/141-journal-of-information-and-communication-technology-jict-vol-12-2013 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.uum.repo.24089 |
---|---|
record_format |
eprints |
spelling |
my.uum.repo.240892018-05-06T23:42:48Z http://repo.uum.edu.my/24089/ Normalization of noisy texts in Malaysian online reviews Samsudin, Norlela Puteh, Mazidah Hamdan, Abdul Razak Ahmad Nazri, Mohd Zakree QA75 Electronic computers. Computer science The process of gathering useful information from online messages has increased as more and more people use the Internet and other online applications such as Facebook and Twitter to communicate with each other.One of the problems in processing online messages is the high number of noisy texts that exist in these messages.Few studies have shown that the noisy texts decreased the result of text mining activities.On the other hand, very few works have investigated on the patterns of noisy texts that are created by Malaysians.In this study, a common noisy terms list and an artificial abbreviations list were created using specific rules and were utilized to select candidates of correct words for a noisy term.Later, the correct term was selected based on a bi-gram words index.The experiments used online messages that were created by the Malaysians.The result shows that normalization of noisy texts using artificial abbreviations list compliments the use of common noisy texts list. Universiti Utara Malaysia Press 2012 Article PeerReviewed application/pdf en http://repo.uum.edu.my/24089/1/JICT%2012%202013%20147%E2%80%93159.pdf Samsudin, Norlela and Puteh, Mazidah and Hamdan, Abdul Razak and Ahmad Nazri, Mohd Zakree (2012) Normalization of noisy texts in Malaysian online reviews. Journal of Information and Communication Technology, 11. pp. 147-159. ISSN 2180-3862 http://jict.uum.edu.my/index.php/previous-issues/141-journal-of-information-and-communication-technology-jict-vol-12-2013 |
institution |
Universiti Utara Malaysia |
building |
UUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Utara Malaysia |
content_source |
UUM Institutionali Repository |
url_provider |
http://repo.uum.edu.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Samsudin, Norlela Puteh, Mazidah Hamdan, Abdul Razak Ahmad Nazri, Mohd Zakree Normalization of noisy texts in Malaysian online reviews |
description |
The process of gathering useful information from online messages has increased as more and more people use the Internet and other online applications such as Facebook and Twitter to
communicate with each other.One of the problems in processing online messages is the high number of noisy texts that exist in these messages.Few studies have shown that the noisy texts decreased the result of text mining activities.On the other hand, very few works have investigated on the patterns of noisy texts that are created by Malaysians.In this study, a common noisy
terms list and an artificial abbreviations list were created using specific rules and were utilized to select candidates of correct words for a noisy term.Later, the correct term was selected
based on a bi-gram words index.The experiments used online messages that were created by the Malaysians.The result shows that normalization of noisy texts using artificial abbreviations list
compliments the use of common noisy texts list. |
format |
Article |
author |
Samsudin, Norlela Puteh, Mazidah Hamdan, Abdul Razak Ahmad Nazri, Mohd Zakree |
author_facet |
Samsudin, Norlela Puteh, Mazidah Hamdan, Abdul Razak Ahmad Nazri, Mohd Zakree |
author_sort |
Samsudin, Norlela |
title |
Normalization of noisy texts in Malaysian online reviews |
title_short |
Normalization of noisy texts in Malaysian online reviews |
title_full |
Normalization of noisy texts in Malaysian online reviews |
title_fullStr |
Normalization of noisy texts in Malaysian online reviews |
title_full_unstemmed |
Normalization of noisy texts in Malaysian online reviews |
title_sort |
normalization of noisy texts in malaysian online reviews |
publisher |
Universiti Utara Malaysia Press |
publishDate |
2012 |
url |
http://repo.uum.edu.my/24089/1/JICT%2012%202013%20147%E2%80%93159.pdf http://repo.uum.edu.my/24089/ http://jict.uum.edu.my/index.php/previous-issues/141-journal-of-information-and-communication-technology-jict-vol-12-2013 |
_version_ |
1644283961413730304 |
score |
13.15806 |