Crowd Control, Planning, and Prediction Using Sentiment Analysis: An Alert System for City Authorities

Bibliographic Details
Title: Crowd Control, Planning, and Prediction Using Sentiment Analysis: An Alert System for City Authorities
Authors: Tariq Malik, Najma Hanif, Ahsen Tahir, Safeer Abbas, Muhammad Shoaib Hanif, Faiza Tariq, Shuja Ansari, Qammer Hussain Abbasi, Muhammad Ali Imran
Source: Applied Sciences, Vol 13, Iss 3, p 1592 (2023)
Publisher Information: MDPI AG, 2023.
Publication Year: 2023
Collection: LCC:Technology
LCC:Engineering (General). Civil engineering (General)
LCC:Biology (General)
LCC:Physics
LCC:Chemistry
Subject Terms: Twitter, alert system, sentiment analysis, machine learning algorithms, vectorization techniques, manual annotation, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
More Details: Modern means of communication, economic crises, and political decisions play imperative roles in reshaping political and administrative systems throughout the world. Twitter, a micro-blogging website, has gained paramount importance in terms of public opinion-sharing. Manual intelligence of law enforcement agencies (i.e., in changing situations) cannot cope in real time. Thus, to address this problem, we built an alert system for government authorities in the province of Punjab, Pakistan. The alert system gathers real-time data from Twitter in English and Roman Urdu about forthcoming gatherings (protests, demonstrations, assemblies, rallies, sit-ins, marches, etc.). To determine public sentiment regarding upcoming anti-government gatherings (protests, demonstrations, assemblies, rallies, sit-ins, marches, etc.), the alert system determines the polarity of tweets. Using keywords, the system provides information for future gatherings by extracting the entities like date, time, and location from Twitter data obtained in real time. Our system was trained and tested with different machine learning (ML) algorithms, such as random forest (RF), decision tree (DT), support vector machine (SVM), multinomial naïve Bayes (MNB), and Gaussian naïve Bayes (GNB), along with two vectorization techniques, i.e., term frequency–inverse document frequency (TFIDF) and count vectorization. Moreover, this paper compares the accuracy results of sentiment analysis (SA) of Twitter data by applying supervised machine learning (ML) algorithms. In our research experiment, we used two data sets, i.e., a small data set of 1000 tweets and a large data set of 4000 tweets. Results showed that RF along with count vectorization performed best for the small data set with an accuracy of 82%; with the large data set, MNB along with count vectorization outperformed all other classifiers with an accuracy of 75%. Additionally, language models, e.g., bigram and trigram, were used to generate the word clouds of positive and negative words to visualize the most frequently used words.
Document Type: article
File Description: electronic resource
Language: English
ISSN: 2076-3417
Relation: https://www.mdpi.com/2076-3417/13/3/1592; https://doaj.org/toc/2076-3417
DOI: 10.3390/app13031592
Access URL: https://doaj.org/article/0b68fc38cdd5455fb97f7f6bfec3c5d8
Accession Number: edsdoj.0b68fc38cdd5455fb97f7f6bfec3c5d8
Database: Directory of Open Access Journals
Full text is not displayed to guests.
More Details
ISSN:20763417
DOI:10.3390/app13031592
Published in:Applied Sciences
Language:English