malaytextr: Text Mining for Bahasa Malaysia

It is designed to work with text written in Bahasa Malaysia. We provide functions and data sets that will make working with Bahasa Malaysia text much easier. For word stemming in particular, we will look up the Malay words in a dictionary and then proceed to remove "extra suffix" as explained in Khan, Rehman Ullah, Fitri Suraya Mohamad, Muh Inam UlHaq, Shahren Ahmad Zadi Adruce, Philip Nuli Anding, Sajjad Nawaz Khan, and Abdulrazak Yahya Saleh Al-Hababi (2017) <> . This package includes a dictionary of Malay words that may be used to perform word stemming, a dataset of Malay stop words, a dataset of sentiment words and a dataset of normalized words.

Version: 0.1.3
Depends: R (≥ 2.10)
Imports: dplyr, magrittr, rlang, stringr
Suggests: rmarkdown, knitr, testthat (≥ 3.0.0)
Published: 2023-01-17
DOI: 10.32614/CRAN.package.malaytextr
Author: Zahier Nasrudin ORCID iD [aut, cre]
Maintainer: Zahier Nasrudin <zahiernasrudin at>
License: MIT + file LICENSE
NeedsCompilation: no
Materials: README NEWS
CRAN checks: malaytextr results


Reference manual: malaytextr.pdf
Vignettes: malaytextr


Package source: malaytextr_0.1.3.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
macOS binaries: r-release (arm64): malaytextr_0.1.3.tgz, r-oldrel (arm64): malaytextr_0.1.3.tgz, r-release (x86_64): malaytextr_0.1.3.tgz, r-oldrel (x86_64): malaytextr_0.1.3.tgz
Old sources: malaytextr archive


Please use the canonical form to link to this page.