A bi-annotated Malay-English code-switching (Manglish) dataset of X posts for biological gender identification and authorship attribution
Low-resource languages, like Malay, face the threat of extinction when linguistic resources become scarce. This paper addresses the scarcity issue by contributing to the inventory of low-resource languages, specifically focusing on Malay-English, known as Manglish. Manglish speakers are primarily...
Saved in:
| Main Authors: | , , , , , , , |
|---|---|
| Format: | Article |
| Language: | en |
| Published: |
Elsevier
2024
|
| Subjects: | |
| Online Access: | http://eprints.uthm.edu.my/10920/1/J17377_a3b15f369ba6e61ca5517eaf40899173.pdf http://eprints.uthm.edu.my/10920/ https://doi.org/10.1016/j.dib.2024.110034 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!
