Semantic Classification Model for Twitter Dataset Using Wordnet

Abstract

Twitter is an emerged field in today social media. As twitters increasing, an increasing demand is emerged to mine these twitters and extract useful information. Traditional classification methods have a problem with tweets due to its short sentences. This paper handles the problem of classifying tweets by adapting bag of words feature with semantic tools for natural processing language. The experiments showed a stable performance of classifications in accuracy compared with traditional features of text classification.

Country : Yemen

1 Seham A. Bamatraf2 Rasha A. Bin-Thalab

  1. Department of Computer Engineering, College of Engineering & Petroleum, Hadhramout University, Mukalla, Yemen
  2. Department of Computer Engineering, College of Engineering & Petroleum, Hadhramout University, Mukalla, Yemen

IRJIET, Volume 5, Issue 2, February 2021 pp. 5-9

doi.org/10.47001/IRJIET/2021.502002

References

  1. Twitter Inc., ‘Twitter turns six’, Twitter turns six, Mar. 21, 2012. https://blog.twitter.com/official/en_us/a/2012/twitter-turns-six.html.
  2. R. Szymanski, ‘How to Collect Big Data Sets From Twitter’. DZone, Jun. 07, 2019, Accessed: Apr. 12, 2020. [Online]. Available: https://dzone.com/articles/how-to-collect-big-data-from-twitter-for-sentiment.
  3. P. J. Tighe, R. C. Goldsmith, M. Gravenstein, R. Bernard, and R. B. Fillingim, ‘The Painful Tweet: Text, Sentiment, and Community Structure Analyses of Tweets Pertaining to Pain’, vol. 17, no. 4.
  4. C. Kingston, J. R. C. Nurse, I. Agrafiotis, and A. B. Milich, ‘Using semantic clustering to support situation awareness on Twitter: the case of world views’, Hum.-Centric Comput. Inf. Sci., vol. 8, no. 1, p. 22, Jul. 2018, doi: 10.1186/s13673-018-0145-6.
  5. H. C. Wu, R. W. Pong Luk, K.-F. Wong, and K.-L. Kwok, ‘Interpreting TF-IDF term weights as making relevance decisions’, ACM Trans. Inf. Syst., vol. 26, no. 13, p. 13:1-13:37, 2008.
  6. Y. Zhang, R. Jin, and Z.-H. Zhou, ‘Understanding bag-of-words model: a statistical framework’, Int. J. Mach. Learn. Cybern., vol. 1, no. 1, pp. 43–52, Dec. 2010, doi: 10.1007/s13042-010-0001-0.
  7. P. Selvaperumal and A. Suruliandi, ‘A short message classification algorithm for tweet classification’, presented at the Inerantional Conference  Recent Trends in Information Technology (ICRTIT), 2014.
  8. A. Zubiaga, D. Spina, V. F. Fernández, and R. Martínez-Unanue, ‘Real-Time Classification of Twitter Trends’, J. Assoc. Inf. Sci. Technol., vol. 66, no. 3, pp. 462–473, 2015.
  9. Q. Li, S. Shah, M. Ghassemi, R. Fang, A. Nourbakhsh, and X. Liu, ‘Using Paraphrases to Improve Tweet Classification: Comparing WordNet and Word Embedding Approaches’, presented at the IEEE International Conference on Big Data, 2016.
  10. ‘BabelNet’, BabelNet, 2009. https://babelnet.org/about (accessed Apr. 13, 2020).
  11. ‘WordNet’, WordNet, 2005. https://wordnet.princeton.edu/ (accessed Apr. 13, 2020).
  12. ‘Twitter Sentiment Analysis Training Corpus (Dataset)’, Twitter Sentiment Analysis Training Corpus (Dataset), Sep. 22, 2012. http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/ (accessed Apr. 15, 2020).
  13. N. S. Altman, ‘An introduction to kernel and nearest-neighbor nonparametric regression’, Am. Stat., vol. 46, no. 3, pp. 175–185, 1992.
  14. C. Cortes and V. N. Vapnik, ‘Support-vector networks’, Mach. Learn., vol. 20, no. 3, pp. 273–297, 1995.
  15. L. Rokach and O. Maimon, Data mining with decision trees: theory and applications. World Scientific Pub Co Inc, 2008.
  16. T. K. Ho, ‘Random Decision Forests’, in Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, 1995, pp. 14–16.
  17. A. and J. Rehman Javed, ‘Ensemble adaboost classifier for accurate and fast detection of botnet attacks in connected vehicles’, p. e4088, 2020.
  18. C. D. Manning, P. Raghavan, and H. Schütze, ‘Naive Bayes text classification’, in Introduction to Information Retrieval, Cambridge University Press, 2008.
  19. V. Korde and C. N. Mahender, ‘TEXT CLASSIFICATION AND CLASSIFIERS: A SURVEY’, Int. J. Artif. Intell. Appl., vol. 3, no. 2, pp. 85–99, Mar. 2012.
  20. D. Ignatov and A. Ignatov, ‘Decision stream: Cultivating deep decision trees’, in 2017 ieee 29th international conference on tools with artificial intelligence (ictai), Nov. 2017, pp. 905--912.