Semantic Classification Model for Twitter Dataset Using Wordnet

Seham A. Bamatraf; Rasha A. Bin-Thalab

doi:https://doi.org/10.47001/IRJIET/2021.502002

Semantic Classification Model for Twitter Dataset Using Wordnet

Seham A. BamatrafDepartment of Computer Engineering, College of Engineering & Petroleum, Hadhramout University, Mukalla, YemenRasha A. Bin-ThalabDepartment of Computer Engineering, College of Engineering & Petroleum, Hadhramout University, Mukalla, Yemen

Vol 5 No 2 (2021): Volume 5, Issue 2, February 2021 | Pages: 5-9

International Research Journal of Innovations in Engineering and Technology

OPEN ACCESS | Research Article | Published Date: 07-02-2021

doi.org/10.47001/IRJIET/2021.502002

Full Text PDF

Abstract

Twitter is an emerged field in today social media. As twitters increasing, an increasing demand is emerged to mine these twitters and extract useful information. Traditional classification methods have a problem with tweets due to its short sentences. This paper handles the problem of classifying tweets by adapting bag of words feature with semantic tools for natural processing language. The experiments showed a stable performance of classifications in accuracy compared with traditional features of text classification.

Keywords

Text mining, Classification, Big data, Twitter

Citation of this Article

Seham A. Bamatraf, Rasha A. Bin-Thalab, “Semantic Classification Model for Twitter Dataset Using Wordnet” Published in International Research Journal of Innovations in Engineering and Technology - IRJIET, Volume 5, Issue 2, pp 5-9, February 2021. Article DOI https://doi.org/10.47001/IRJIET/2021.502002

This work is licensed under Creative common Attribution Non Commercial 4.0 Internation Licence

References

Twitter Inc., ‘Twitter turns six’, Twitter turns six, Mar. 21, 2012. https://blog.twitter.com/official/en_us/a/2012/twitter-turns-six.html.
R. Szymanski, ‘How to Collect Big Data Sets From Twitter’. DZone, Jun. 07, 2019, Accessed: Apr. 12, 2020. [Online]. Available: https://dzone.com/articles/how-to-collect-big-data-from-twitter-for-sentiment.
P. J. Tighe, R. C. Goldsmith, M. Gravenstein, R. Bernard, and R. B. Fillingim, ‘The Painful Tweet: Text, Sentiment, and Community Structure Analyses of Tweets Pertaining to Pain’, vol. 17, no. 4.
C. Kingston, J. R. C. Nurse, I. Agrafiotis, and A. B. Milich, ‘Using semantic clustering to support situation awareness on Twitter: the case of world views’, Hum.-Centric Comput. Inf. Sci., vol. 8, no. 1, p. 22, Jul. 2018, doi: 10.1186/s13673-018-0145-6.
H. C. Wu, R. W. Pong Luk, K.-F. Wong, and K.-L. Kwok, ‘Interpreting TF-IDF term weights as making relevance decisions’, ACM Trans. Inf. Syst., vol. 26, no. 13, p. 13:1-13:37, 2008.
Y. Zhang, R. Jin, and Z.-H. Zhou, ‘Understanding bag-of-words model: a statistical framework’, Int. J. Mach. Learn. Cybern., vol. 1, no. 1, pp. 43–52, Dec. 2010, doi: 10.1007/s13042-010-0001-0.
P. Selvaperumal and A. Suruliandi, ‘A short message classification algorithm for tweet classification’, presented at the Inerantional Conference Recent Trends in Information Technology (ICRTIT), 2014.
A. Zubiaga, D. Spina, V. F. Fernández, and R. Martínez-Unanue, ‘Real-Time Classification of Twitter Trends’, J. Assoc. Inf. Sci. Technol., vol. 66, no. 3, pp. 462–473, 2015.
Q. Li, S. Shah, M. Ghassemi, R. Fang, A. Nourbakhsh, and X. Liu, ‘Using Paraphrases to Improve Tweet Classification: Comparing WordNet and Word Embedding Approaches’, presented at the IEEE International Conference on Big Data, 2016.
‘BabelNet’, BabelNet, 2009. https://babelnet.org/about (accessed Apr. 13, 2020).
‘WordNet’, WordNet, 2005. https://wordnet.princeton.edu/ (accessed Apr. 13, 2020).
‘Twitter Sentiment Analysis Training Corpus (Dataset)’, Twitter Sentiment Analysis Training Corpus (Dataset), Sep. 22, 2012. http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/ (accessed Apr. 15, 2020).
N. S. Altman, ‘An introduction to kernel and nearest-neighbor nonparametric regression’, Am. Stat., vol. 46, no. 3, pp. 175–185, 1992.
C. Cortes and V. N. Vapnik, ‘Support-vector networks’, Mach. Learn., vol. 20, no. 3, pp. 273–297, 1995.
L. Rokach and O. Maimon, Data mining with decision trees: theory and applications. World Scientific Pub Co Inc, 2008.
T. K. Ho, ‘Random Decision Forests’, in Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, 1995, pp. 14–16.
A. and J. Rehman Javed, ‘Ensemble adaboost classifier for accurate and fast detection of botnet attacks in connected vehicles’, p. e4088, 2020.
C. D. Manning, P. Raghavan, and H. Schütze, ‘Naive Bayes text classification’, in Introduction to Information Retrieval, Cambridge University Press, 2008.
V. Korde and C. N. Mahender, ‘TEXT CLASSIFICATION AND CLASSIFIERS: A SURVEY’, Int. J. Artif. Intell. Appl., vol. 3, no. 2, pp. 85–99, Mar. 2012.
D. Ignatov and A. Ignatov, ‘Decision stream: Cultivating deep decision trees’, in 2017 ieee 29th international conference on tools with artificial intelligence (ictai), Nov. 2017, pp. 905--912.

For Authors

Publication Archives

Volume 1 - 2017

Volume 2 - 2018

Volume 3 - 2019

Volume 4 - 2020

Volume 5 - 2021

Volume 6 - 2022

Volume 7 - 2023

Volume 8 - 2024

Volume 9 - 2025

Volume 10 - 2026

For Board Members

Downloads

Research Areas

Semantic Classification Model for Twitter Dataset Using Wordnet

Abstract

Keywords

Citation of this Article

References

International Research Journal of Innovations in Engineering
and Technology - IRJIET

Editorial Policies

Quick Links