Comparison Study of Term Weighting optimally with SVM in Sentiment Analysis

Penulis Amril Mutoi, Sutan Faisal, Tukino, Adam Puspabhuana, Manase Sahat H Simarangkir
Publisher Prosiding International Conference on Advance & Scientific Innovation (ICASI) 2018


The rapid of internet and social media users have changed the way people interact in their daily activities. For example, banking and retail began to use various social media, especially online media such as tweeter. The problem that arises is how to get information from thousands and even million data generated through social media, to be a decision as in predicting consumer satisfaction of the service or product. Another problem is the social media users in communicating using slang or local language. In sentiment analysis to predict the sentiment is not easy because it must be able to identify the words. In sentiment analysis, to overcome these problems the method used is text mining so as to process opinions from social media. The proposed approach is to analyze optimal term weighting between TF-IDF, frequency term (TF) and Binary Term Occurrence (BTO), using SVM algorithm. Target feature extraction for selection of datasets by predicting positive and negative sentiments. The result of weighting of terms approaching sentiment is using TF-IDF with SVM.

Teks Lengkap:



Chintala, S. (2012). Sentiment Analysis using neural architectures. New York University.

Cortes, C. Vapnik, V. (1995). Support-Vector Networks Machine Learning. Retrieved November 8, 2013, from 4hm87j80g/

Dhanawat, V. (n.d.). Twitter Sentiment Analysis. Retrieved from Analysis Dataset.csv

Go, A., Huang, L,Bhayani, R. (2009). Twitter sentiment analysis (Final Projects from CS224N).

Luhn, H. P. (1957). A Statistical Approach to Mechanized Encoding and Searching of Literary Information. IBM Journal of Research and DevelopmentI(4), 315.

Mostafa, M. (2013). More than words: Social networks’ text mining for consumer brand sentiments’. Expert Systems with Applications: An International Journal40(10), 4241–4251.

Pan, S., Ni, X., Sun, J., Yang, Q., & Chen, Z. (2010). Cross-domain sentiment classification via spectral feature alignment. In International World Wide Web Conference Committee (pp. 751–760).

Ravichandran, M., & Kulanthaivel, G. (2014). Twitter Sentiment Mining (TSM) Framework Based Learners Emotional State Classification and Visualization For E-Learning System. Journal of Theoretical and Applied Information Technology69(1), 84–90.

Spärck Jones, K. (1972). A Statistical Interpretation of Term Specificity and Its Application in Retrieval. Journal of Documentation28, 11–21.