logo

Text Mining Approach Assignment

9 Pages6902 Words137 Views
   

Added on  2021-01-02

Text Mining Approach Assignment

   Added on 2021-01-02

ShareRelated Documents
Profiling Travellers' Mode Choice towards airport access (HKIA) – Introducing the Text Mining ApproachIntroduction (Background, Motivation, Problem Identification, Expected Outcome, Significance)Situated in the Pearl River Delta, Hong Kong as a regional logistics hub, Asia’s top travel destination and aninternational centre, has drawn more than 58 millions of visitors 2018, generating nearly 300 billions per Capita Spending with an average stay of 3.2 nights (Hong Kong Tourism Board, 2018). Amongst all, 30 millions travelled to Hong Kong through air transport and landed at the Hong Kong International Airport (Civil Aviation Department (CAD), 2018), while 4.6 millions of passengers has used HKIA’s cross-boundary land and sea transport (Airport Authority Hong Kong, 2018). Opened on 6th July 1998, the Hong Kong International Airport (HKIA) connects to over 200 destinations worldwide by more than a hundred airlines. The Airport Authority aims to enhance its capacity as a leading aviation hub to cater for the growing demand and serve as the key engine in facilitating economic growth With outstanding operational performance, HKIA currently ranked 5th as the world’s top 10 airports by Skyrax (Airport Authority Hong Kong, n.d.). To accommodate the massive arrivals, the egress links to city-centre, Mainland China and Macau are well-developed. Visitors are able to choose from a variety of transportation ranging from bus service, airport express, to taxi and on-demand transportation such as Uber. Airport bridges people from around the world with the aviation system and the other modes of transport in-city. Passengers’ mode choice is paramount to evaluate the efficiency of the airport transport system, as well as providing valuable insights for policy making and system planning. Yet, there are limited studies on visitors preference in their airport access mode choice to HKIA. Further to previous research done regarding airport access mode choice on different airport such as Turkey (Gokasar & Gunay, 2017), Korea (Choo, You, & Lee, 2013), HKIA (Tam, Tam, & Lam, 2005), data are collected through conducting survey.The process of data collection from surveys is time consuming, from setting up questionnaires to engaging with respondents. Yet, it is difficult to acquire respondents over a longer period of time (over a week), to determine seasonal trends. Furthermore, the sample size is small, leading to cautious interpretation of the modelling results. The new era of data collection through the uprising technique of text mining extracts a much larger pool than of surveys, across a longer period, with a shorter time. It also provides a different typeof insights compared to the traditional survey and modelling approach.Text mining, as a knowledge discovery technique, acquires increasing importance in this digitalized era. Information are readily available on medium such as forums, Facebook, Twitter, etc. The technique, as an extension of extracting logical patterns from structured database, comprises of multiple fields to generate decision analytics from large data set through information retrieval, text analysis, natural language processing, and information classification (Irfan, et al., 2015). It covers disciplines in statistics, linguistics and machine learning; generally includes categorization of information, clustering of text, eaccess (HKIA) – Introducing the Text Mining Approach. Purpose of text mining approach is to process or transform unstructured data or information ( textual) into meaningful numeric indices from text given so that to frame information that is available in accessible text that is available in various mining forms. In general terms text mining is responsible for turning text into numbers or meaningful indices which can further be used in in other forms of analysis or examination such as predictive data mining projects , unsupervised learning methods etc. There are various approaches of textmining which have further been stated as under: Using well tested methods: In this process or approach, once a data matrix has been derived from input documents it is important that well developed and well known analytical tools and techniques are used for further processing their data . This method can further infuse methods such as clustering, factoring or predictive data mining.Black box approach: There are a number of text mining applications that involve black box method so that a deep meaning from documents can be extracted with involving a certain amount of human effort. In this
Text Mining Approach  Assignment_1
method text minnig mainly depends on proprietary algorithms that can be used for gaining concepts from text. This technology is expected to yet in its infant stage in current scenario. Text mining as document search: This is another important application that is often known as text mining. This approach occurs in a domain form. For example, popular internet search engines that are used by individuals for providing efficient access to web pages that have important content. It is a quite important type of application software which is very beneficial for business entities that have to search data in quite larger directory form. With the help of this, maximum benefits could be gained by business entities in pulling out right amount of information in specific time frame.Have done the intro and literature review; most need help on methodology and result part, ofc the text mining part too. Dont mind any changes on the scope/objective of the project, but mostly will be using data from TripAdvisor. Traction of concepts, and formulation of general taxonomies. Text mining help extract useful information from bulk data efficiently in a short period of time, as well as assisting the prediction of future aspect based on the provided observations and statistics generated from the concluded trends from data sets (Hashimi, Hafez, & Mathkour, 2015). Social media mining has been employed by many businessesto perform competitive analysis through transforming data into insights. In contrast with traditional data analytics, social media tools show the interactivity between users, which has become a crucial role in changing people’s communication. Traditional media engages people in a one way connection. Referrals and promotions from the social word-of-mouth also cultivate the understanding of their customer base, which brings about business value for companies’ to develop their marketing and business strategies (Shen, Chen, & Wang, 2018).This project puts together the text mining techniques with social media to unveil travellers’ preference in their mode choice of airport access. The motivations are twofold: first, to apply data since methodology to collect and analyse social media data; second, to present past and current trends of transportation preferencesand their implications, hence, provide interesting insights. SignificanceThe objectives of this study are as follow: 1.To analyse the concept of text-mining as a new approach to look into mode choices and transportation 2.To identify the explanatory variables for mode choice3.To find out travellers’ experience with the transportation system to and from Hong Kong International Airport4.To analyse the preferred mode choice5.To determine the change of preferences over time and seasonal preference Expected OutcomeTravellers’ preference and experience with the access mode of Hong Kong International Airport are expected to be found through parsing and analysing online data. Insights and trends are expected to bring recommendations for enhancing the current system, policy and planning of airport access mode. Most importantly, give an outline of the approach of text-mining for finding airport access mode choice and set grounds for a wider scope of study in the future.Literature ReviewAirport access mode choiceTo facilitate the advancement of airport management, gaining understandings of air passengers concerning airport access modes is of crucial importance. Alhussein (2011) has done the very first research on ground access modes choice to King Khaled International Airport (KKIA) in Riyadh, Saudi Arabia, aiming to analyse access mode behaviour to KKIA. Tam, Tam and Lam (2005) examine the access mode choices of departing passengers are studied to provide source information for transport operators to improve their services planning and increase their shares at the airport ground access market. Choo, You and Lee (2013) explored passengers’ airport access mode choice and hence developed mode choice models after conducting
Text Mining Approach  Assignment_2
Chi-square and ANOVA tests to identify key explanatory variables of the airports. All of these researches done have one thing in common: data are collected through conducting survey or face-to-face interview at the terminals targeting departing passengers at random.In the research done by Tam, Tam and Lam (2005), not only did the structural relations between passengers’personal characteristics, trip characteristics were included, but also Expectation and Perception, the two latent variables previous researches have not taken into account. Personal and trip characteristics including gender, age, education level, flight length and travel cost all negatively impact the use of public transport modes for airport ground access, also suggested by Alhussein (2011). Public transport dominate the top preference of mode choice in Hong Kong, opposite to western countries. Visitors on business trip or visit theHKIA less frequently have a tendency to select private cars/taxi as their ground access mode choice. Results has indicated that respondents’ perceived levels of satisfaction are lower than their expectations on the five selected service attributes (franchised buss, AEL, private car, taxi, others). Passengers found travel time reliability as the most satisfactory service attribute; while waiting time of franchised buses, walking distanceto and from the Airport Express stations, travel cost for taxi and private car, as well as waiting time for airport shuttle buses offered by hotels and travel agencies all have a high priority for improvement. Alhussein (2011), Tam, Tam and Lam (2005) suggested that future studies could collect data to determine the effects of travel seasons on airport ground access mode choice, with an inclusion of more service attributes, and the latent variables. Text mining and social mediaSocial media such as online forums have gain increasing popularity in exchanging ideas and advice. Discovering from the online communities could be rewarding. Park, Conway and Chen (2017) employed thetext mining ,qualitative analysis and visualization approach to compare online discussion content from three online mental health communities. Corpus was downloaded using Python Reddit API Wrapper (PRAW). Python Natural Language Toolkit and Scikit-learn was then used to pre-process the dataset – removing stop words, punctuations, both high- and low-frequency terms, and tokenization. K-mean clustering followed after to identify main discussion themes in a large collection of documents. The frequency of term appearance was then visualized as a bubble chart, proportional to the cluster size, by D3 and a network visualization by Gephi. Venn diagram was used to visualize the thematic overlaps among the three online communities. Qualitative comparison was carried out as a result. Louvain modularity algorithm (in Gephi) and heatmap visualization of Jaccard similarity scores were used as an illustration of how clusters are topically similar and dissimilar from one another. The research findings facilitates more nuanced discussionsand encourage future researches to include multiple methods in fully understanding of differences among conditions with shared symptomatology. Yet, the approach serves as a valuable take away for analysing and visualizing textual comparisons. Social media is a modern day approach by which companies can enhance their popularity among maximum number people at very high speed. It can be said that this approach is mosteffective one as it contributes in communicating high number of people. It has been analysed that now a days people belonging to every generation is having their account on social media. That provides ensures thecompany that if they shares any information on this platform it will be transferred to everyone that means from youth-old age people. As a result, this shared information will provide them business to company and contribute in attaining desired targets. It will also maximise profitability of the company in effective manner within less consumption of time. The site is also helpful in taking suggestions from customers as the users can share their personal experience. On the basis of their experience, they also provide advices on the official account of company as it will help them out in improving weaknesses. By, improving these issues, company can work on the mentioned areas and enhance its quality that is being provided to customers in effective manner. In context to text mining, it can be said that raw data can be used at this place in order to convert it into meaningful data. Thus, it can be said that social media can be used here for sharing this collected information to maximum people.MethodologyText-mining
Text Mining Approach  Assignment_3

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents
SWOT Analysis of Hong Kong International Airport
|17
|5069
|91

Managerial Economics - Aerotropolis
|13
|3583
|24