We know the secret of your success
According to Iacus, S.M. (2020), COVID-19 is a communicable disease that can be spread through physical touch and or non-physical contact such as through sneezing, coughing, or talking. COVID-19 is the acronym for Corona Virus Disease of 2019. The World Health Organization (WHO) labelled Convid’19 as a global pandemic on March 11th 2020 (Chakraborty, K. et al. 2020). The impact of Convid’19 is social and economical and was reported to be an epidemic on January 30th, 2020 (Guo, Y.R 2019). Convid’19 spread rapidly and led to ill health and even deaths. According to Rustam, F. et al. (2021), the number of death from Convid’19 was over 600 thousand as of May 31st 2020 and this was predicted to increase. Chintalapudi, N. et al. (2021) argued that the world population's mental and physical health has been proven to be related to the Convid’19 pandemic. As of August 27th 2020; more than twenty-four million people had tested positive for Convid’19 worldwide (Chakraborty, K. et al. 2020).
In order to reduce the spread of Convid’19, every country was obligated to develop the best possible way of tracing physical contacts and managing public gathering. One of the most widely adopted way of reducing the spread of the virus was through social distancing and lockdown. Social media has become an important part of human life because It connects people to the outside world. People now heavily depend on posts and tweets published on social media platforms to get news and information. According to Dubey A. D. (2020), social networking traffic saw a dramatic increase during the lockdown.
According to Liu, I.L., Cheung, C.M. and Lee, M.K., (2010), Twitter is the major social media platform where Convid’19 news are timely tweeted and shared. News and thoughts regarding Convid-19 are widely shared on social media (Rustam, F. et al. 2021) and just like a virus, they spread very fast. Koohikamali, M. and Sidorova, A., (2017) argued that due to personal sentiment and bias, most of these news are subjective and the have resulted to misleading information, fake news, and negativity. Just like Convid’19 news gotten from social media are responsible for anxiety due to some misinformation and widespread of fake news and the rapid spread of false information on social has negatively affected mental health (Chintalapudi, N. et al. 2021). To curtail this, it is vital that news spread through mainstream media be checked and regulated. Rustam, F. et al. (2021) thus pointed out that the ability to identify Covid-19 sentiments in tweets would make more educated judgments to be made on how to best handle the spread of misinformation and the Convid’19 pandemic. Simultaneously, the attention of scholars has been drawn to these issues and one way of understanding and providing solution to the viral misinformation in social media is to use data for Convid’19 sentiment analysis of tweets.
News that are false and highly biased are rampantly tweeted in Twitter during a global pandemic like the Convid’19 pandemic and they often have extremely negative sentiment. The word "misinfodemic" refers to disinformation that contributes to the transmission of disease (Gyenes, N. and Mina, A.X., 2018). Banerjee, D., (2020) argued that “misinfodemic” was quite common during the Covid’19 outbreak and that scientists largely ignored “misinfodemic” by only focusing on developing a Corona virus vaccine. Extremely biased misinformation is detrimental to the mental health of people (Chintalapudi, N. et al. 2021). “Misinfodemic” have a negative impact on an individual's wellbeing and can lead to societal dysfunction, hysteria (Banerjee, D., 2020) and even suicide.
The aim of this project is to explore, predict and evaluate the sentiments of tweets related to Convid’19. The objectives of this research work are as follows:
The research questions for this project are as follows:
The deliverables of this project are exploratory data analysis, data mining, data munging, modeling, evaluation, model deployment and a project report. The report should thoroughly explain the source of the data used, exploratory data analysis, code implementation, feature selection, model selection as well as model performance.
This project is mainly focused on building and deploying a machine learning model that would transform tweets related to Convid’19 and predict the sentiment of the tweets.
This project would consider a secondary research, modeling, and deployment. They are further buttressed below:
Secondary research
The secondary research in this project will utilize a systematic approach (Johnson et al., 2016) to review the works of literature. The steps involved in the systematic review of the literature are provided below:
Modeling
This section involves sourcing for and making sense out of data. The steps involved are as follows:
Deployment
This section is geared towards packaging the model in a web app and deploying the model. The steps in achieving these are:
The risk assessment conducted for this project is provided in the table below:
Table 1: Risk assessment
Risk
Impact
Mitigation Plan
Inability to meet the deadline
Low
Get an extension from the supervisor in due time
Inability to get sufficient data
medium
Refer to communities like Kaggle, and Twitter for assistance.
Inability to build the model
low
Refer to supervisor and communities like Stackoverflow.
Inability to deploy model
Table 2: Project Plan
Task Name
Start Date
End Date
Duration (Days)
Initial Research
23/09/2021
07/10/2021
14
Proposal
28/10/2021
21
Secondary Research
07/12/2021
40
Introduction Chapter
12/12/2021
5
Literature Review Chapter
05/01/2022
24
Methodology Chapter
17/01/2022
12
Data Collection and modelling
15/03/2022
60
Presentation 1
23/03/2022
8
06/04/2022
Evaluation of Results Gotten
13/04/2022
7
Discussion Chapter
23/04/2022
10
Evaluation Chapter
28/04/2022
Conclusion Chapter
30/04/2022
2
Project Management Chapter
01/05/2022
Abstract and Report compilation
03/05/2022
Report Proofreading
13/05/2022
Presentation 2
23/05/2022
References
Banerjee, D., 2020. The COVID-19 outbreak: Crucial role the psychiatrists can play. Asian journal of psychiatry, 50, p.102014.
Chakraborty, K., Bhatia, S., Bhattacharyya, S., Platos, J., Bag, R. and Hassanien, A.E., 2020. Sentiment Analysis of COVID-19 tweets by Deep Learning Classifiers—A study to show how popularity is affecting accuracy in social media. Applied Soft Computing, 97, p.106754.
Chintalapudi, N., Battineni, G. and Amenta, F., 2021. Sentimental Analysis of COVID-19 Tweets Using Deep Learning Models. Infectious Disease Reports, 13(2), pp.329-339.
Dubey AD. 2020.Twitter Sentiment Analysis during COVID19 Outbreak. Available at SSRN 3572023.
Guo, Y.R., Cao, Q.D., Hong, Z.S., Tan, Y.Y., Chen, S.D., Jin, H.J., Tan, K.S., Wang, D.Y. and Yan, Y., 2020. The origin, transmission and clinical therapies on coronavirus disease 2019 (COVID-19) outbreak–an update on the status. Military Medical Research, 7(1), pp.1-10.
Gyenes, N. and Mina, A.X., 2018. How misinfodemics spread disease. The Atlantic.
Iacus, S.M., Natale, F., Santamaria, C., Spyratos, S. and Vespe, M., 2020. Estimating and projecting air passenger traffic during the COVID-19 coronavirus outbreak and its socio-economic impact. Safety Science, 129, p.104791.
Johnson, D., Deterding, S., Kuhn, K.A., Staneva, A., Stoyanov, S. and Hides, L., 2016. Gamification for health and wellbeing: A systematic review of the literature. Internet interventions, 6, pp.89-106.
Koohikamali, M. and Sidorova, A., 2017. Information Re-Sharing on Social Network Sites in the Age of Fake News. Informing Science, 20.
Liu, I.L., Cheung, C.M. and Lee, M.K., 2010. Understanding Twitter Usage: What Drive People Continue to Tweet. Pacis, 92, pp.928-939.
Rustam, F., Khalid, M., Aslam, W., Rupapara, V., Mehmood, A. and Choi, G.S., 2021. A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis. Plos one, 16(2), p.e0245909.
Last updated: Oct 04, 2021 05:01 PM
Your one-stop website for academic resources, tutoring, writing, editing, study abroad application, cv writing & proofreading needs.