Cranfield Online Research Data (CORD)
Repository for (35.06 MB)

Data supporting: 'Analysing the Sentiment of Air-Traveller: A Comparative Analysis'

Download (35.06 MB)
Version 3 2022-08-31, 11:55
Version 2 2022-06-24, 13:14
Version 1 2022-03-17, 14:35
posted on 2022-08-31, 11:55 authored by Mohammed Salih A HomaidMohammed Salih A Homaid, Desmond Bala BisanduDesmond Bala Bisandu, Irene MoulitsasIrene Moulitsas, Karl JenkinsKarl Jenkins

Airport service quality is considered to be an indicator of passenger satisfaction. However, assessing this by conventional methods requires continuous observation and monitoring. Therefore, during the past few years, the use of machine learning techniques for this purpose has attracted considerable attention for analysing the sentiment of the air traveller. A sentiment analysis system for textual data analytics leverages the natural language processing and machine learning techniques in order to determine whether a piece of writing is positive, negative or neutral. Numerous methods exist for estimating sentiments which include lexical-based methodologies and directed artificial intelligence strategies. Despite the wide use and ubiquity of certain strategies, it remains unclear which is the best strategy for recognising the intensity of the sentiments of a message. It is necessary to compare these techniques in order to understand their advantages, disadvantages and limitations. In this paper, we compared the Valence Aware Dictionary and sentiment Reasoner, a sentiment analysis technique specifically attuned and well known for performing good on social media data, with the conventional machine learning techniques of handling the textual data by converting it into numerical form. We used the review data obtained from the SKYTRAX website for each airport. The machine learning algorithms evaluated in this paper are VADER sentiment and logistic regression. The term frequency-inverse document frequency is used in order to convert the textual review data into the resulting numerical columns. This was formulated as a classification problem, whereby the prediction of the algorithm was compared with the actual recommendation of the passenger in the dataset. The results were analysed according to the accuracy, precision, recall and F1-score. From the analysis of the results, we observed that logistic regression outperformed the VADER sentiment analysis.


Authoriser (e.g. PI/supervisor)

Usage metrics



    Ref. manager