Analysis of Stemming Influence on Indonesian Tweet Classification
Abstract: Stemming has been
commonly used by some researchers in natural language processing area such as
text mining, text classification, and information retrieval. In information
retrieval, stemming mayhelp to raise retrieval performance. However, there is
an indication that stemming does not hand oversignificant influence toward the
accuracy in text classification. Therefore, this paper analyzes further research
about the influence of stemming on tweet classification in Bahasa Indonesia.
This work examines about the accuracy result between two conditions by
involving stemming and without involving stemming in pre-processing task for
tweet classification. The contribution of this research is to find out a better
preprocessing task in order to obtain good accuracy in text classification.
According to the experiments, it is observed that all accuracy results in tweet
classification tend to decrease. Stemming task does not raise the accuracy
either using SVM or Naive Bayes algorithm. Therefore, this work summarized that
stemming process does not affect significantly towards the accuracy
performance.
Author: Ahmad Fathan
Hidayatullah
Journal Code: jptkomputergg160251
Artikel Terkait :
Jp Teknik Komputer gg 2016
- An Improved Artificial Bee Colony Algorithm for Staged Search
- Multi-Criteria in Discriminant Analysis to Find the Dominant Features
- A Novel Multifunction Digital Chip Design Based on CMOS Technology
- An Improved Adaptive Niche Differential Evolution Algorithm
- Power Quality Analysis of Integration Photovoltaic Generator to Three Phase Grid under Variable Solar Irradiance Level
- A Combined User-order and Chunk-order Algorithm to Minimize the Average BER for Chunk Allocation in SCFDMA Systems
- An Optimized Model for MapReduce Based on Hadoop
- Hierarchical i* Modeling in Requirement Engineering
- The Optimal High Performance Computing Infrastructure for Solving High Complexity Problem
- Transformer Fault Diagnosis Method Based on Dynamic Weighted Combination Model
- Hybrid Hierarchical Collision Detection Based on Data Reuse
- Brightness and Contrast Modification in Ultrasonography Images Using Edge Detection Results
- Recognition of Fission Signals Based on Wavelet Analysis and Neural Network
- Application of Nonlinear Dynamical Methods for Arc Welding Quality Monitoring
- Chaos-Enhanced Cuckoo Search for Economic Dispatch with Valve Point Effects
- GPU CUDA Accelerated Image Inpainting using Fourth Order PDE Equation
- Comparative Analysis of Spatial Decision Tree Algorithms for Burned Area of Peatland in Rokan Hilir Riau
- Action Recognition of Human’s Lower Limbs Based on a Human Joint
- A Soft Error Study on Tri-gate Based FinFET and Junctionless-FinFET 6T SRAM Cell - A Comparison
- Design of AC Charging Interface and Status Acquisition Circuit for Electric Vehicles
- Towards Smooth and High-Quality Bitrate Adaptation for HTTP Adaptive Streaming
- Features Deletion on Multiple Objects Recognition
- Research on Batch Scheduling in Cloud Computing
- Classification of Motorcyclists not Wear Helmet on Digital Image with Backpropagation Neural Network
- Internet Protocol Based Satellite On-Board System