Weizun Zhao’s research while affiliated with The University of Hong Kong and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (5)


Fig. 1. Hong Kong's Mass Transit Railway (MTR) system map as of 2016 (Wikimedia Commons contributors, 2020).
Fig. 8. Treatment area and control area for Ocean Park & Wong Chuk Hang.
Fig. 9. DID analysis time settings.
Fig. 10. Tweet activity change over time for three MTR station-influenced areas.
Fig. 12. Footprint comparison of sustaining users in Whampoa & Ho Man Tin.

+12

How do new transit stations affect people's sentiment and activity? A case study based on social media data in Hong Kong
  • Article
  • Full-text available

March 2022

·

443 Reads

·

13 Citations

Transport Policy

·

·

·

[...]

·

Urban rail development can increase land value, reduce commute time, and increase accessibility, as reported in the literature. However, little is known about the impact of opening urban rail transit stations on people's sentiment, particularly in the context of large metropolises where population density is significantly high. This paper investigates such impact by studying six new transit stations opened in Hong Kong. People's sentiment and activity in station nearby areas are estimated by tweet sentiment and tweet activity. We use the difference-in-difference model to study the impact of opening new transit stations. Tweet sentiment, tweet activity, tweet content, and footprints of people who visit the station-influenced area ‘before and after’ the opening of transit stations are analyzed. The results suggest that, in general, the introduction of transit stations causes a positive change in tweet activity, and the change is statistically significant after six months. Regarding tweet sentiment, new transit stations tend to pose a mixed effect in a short-term, a positive influence on areas with high-density residential places, yet a negative influence on areas with a large proportion of nature reserve areas. These short-term effects, positive or negative, become not significant in the long term (after twelve months). Our analysis also confirmed that the introduction of new transit stations increased accessibility from (to) other parts of the city to(from) the station's nearby area, which was shown by the expanded locations sustaining users visited. These findings indicate that the urban rail transit system in Hong Kong promotes more active neighborhoods yet does not always promotes positive influence on people's sentiment. Further studies are needed to make future urban rail transit systems promoting active and happy neighborhoods. The study is relevant to the Belt and Road Initiative (BRI) in methodologies, data, and findings. The social media analysis method used in this study, including text mining and sentiment analysis, can be easily extended to multiple language analysis for Singapore, Malaysia, as well as other regions in the belt and road plan. The developed tools could contribute to analyzing the influence of cross-country projects on local neighborhoods in the belt and road plan.

Download

An incremental clustering method for anomaly detection in flight data

November 2021

·

516 Reads

·

38 Citations

Transportation Research Part C Emerging Technologies

Safety is a top priority for civil aviation. Data mining in digital Flight Data Recorder (FDR) or Quick Access Recorder (QAR) data, commonly referred to as black box data on aircraft, has gained interest for proactive safety management. New anomaly detection methods, primarily clustering methods, have been developed to monitor pilot operations and detect any risks from such flight data. However, all existing anomaly detection methods are offline learning — the models are trained once using historical data and used for all future predictions. In practice, new flight data are accumulated continuously and analyzed every month at airlines. Clustering such dynamically growing data is challenging for an offline method because it is memory and time intensive to re-train the model every time new data come in. If the model is not re-trained, false alarms or missed detections may increase since the model cannot reflect changes in data patterns. To address this problem, we propose a novel incremental anomaly detection method based on Gaussian Mixture Model (GMM) to identify common patterns and detect outliers in flight operations from digital flight data. It is a probabilistic clustering model of flight operations that can incrementally update its clusters based on new data rather than to re-cluster all data from scratch. It trains an initial GMM model based on historical offline data. Then, it continuously adapts to new incoming data points via an expectation–maximization (EM) algorithm. To track changes in flight operation patterns, only model parameters need to be saved, not the raw flight data. The proposed method was tested on three sets of simulation data and two sets of real-world flight data. Compared with the traditional offline GMM method, the proposed method can generate similar clustering results with significantly reduced processing time (57 %–99 % time reduction in testing sets) and memory usage (91 %–95 % memory usage reduction in testing sets). Preliminary results indicate that the incremental learning scheme is effective in dealing with dynamically growing data in flight data analytics.


An Incremental Clustering Method for Anomaly Detection in Flight Data

May 2020

·

209 Reads

Safety is a top priority for civil aviation. Data mining in digital Flight Data Recorder (FDR) or Quick Access Recorder (QAR) data, commonly referred as black box data on aircraft, has gained interest from researchers, airlines, and aviation regulation agencies for safety management. New anomaly detection methods based on supervised or unsupervised learning have been developed to monitor pilot operations and detect any risks from onboard digital flight data recorder data. However, all existing anomaly detection methods are offline learning - the models are trained once using historical data and used for all future predictions. In practice, new QAR data are generated by every flight and collected by airlines whenever a datalink is available. Offline methods cannot respond to new data in time. Though these offline models can be updated by being re-trained after adding new data to the original training set, it is time-consuming and computational costly to train a new model every time new data come in. To address this problem, we propose a novel incremental anomaly detection method to identify common patterns and detect outliers in flight operations from FDR data. The proposed method is based on Gaussian Mixture Model (GMM). An initial GMM cluster model is trained on historical offline data. Then, it continuously adapts to new incoming data points via an expectation-maximization (EM) algorithm. To track changes in flight operation patterns, only model parameters need to be saved, not the raw flight data. The proposed method was tested on two sets of simulation data. Comparable results were found from the proposed online method and a classic offline model. A real-world application of the proposed method is demonstrated using FDR data from daily operations of an airline. Results are presented and future challenges of using online learning scheme for flight data analytics are discussed.



Citations (4)


... Internationally, Arku et al. [14] analyzed Twitter sentiment on four African smart city projects, highlighting positive sentiment despite limited physical progress. Chang et al. [15] examined sentiment shifts around new transit stations in Hong Kong using geolocated tweets. Qi et al. [16] applied a sentiment framework to Miami's transit system, revealing concerns and trends that complemented surveys. ...

Reference:

Fundamental Research on X (formerly Twitter) Attitude Survey on Utsunomiya LRT Project
How do new transit stations affect people's sentiment and activity? A case study based on social media data in Hong Kong

Transport Policy

... Predictive maintenance Mathew et al. (2017) [1], Jiangyan et al. (2024), Baptista et al. (2021), Kefalas et al. (2021) [2], Boujamza and Elhaq (2022) [3], Vollert and Theissler (2021), Wang et al. (2023) [4], Zhang et al. (2019) [5], Li et al. (2018) RUL LSTM, RFE Janakiraman and Nielsen (2016) [6], Das et al. (2010) [7], Liu et al. (2023) [8], Zhao et al. (2021a) [9], Lee et al. (2020) [10], Zhong et al. (2021) [11], Jalawkhan and Mustafa (2021) [12], Corrado et al. (2021) Bejarano et al. (2022) [36], Topal et al. (2023) [37], Giovanni et al. (2021) [38] Human-AI Teaming CNN, LSTM, ANN Ma and Tian (2020) [39], Rohani et al. (2023) [40], Zeng et al. (2020) [41], Shi et al. (2020) [42], Choi et al. (2021) [43], Schimpf et al. (2023) [44], Shi et al. (2018) [45], Jia et al. (2022) [46] ...

An incremental clustering method for anomaly detection in flight data

Transportation Research Part C Emerging Technologies

... These clustering algorithms focus on anomaly detection for historical flight data, which causes its inability to track the cluster changes in flight. Zhao et al. [21] developed an online clustering algorithm to achieve cluster adjustment as onboard flight data update. Because these models do not consider multi-parameter coupling and timevarying characteristics from the perspective of flight safety, it is difficult to accurately describe the relationship between flight state and LOC risk. ...

An Adaptive Online Learning Model for Flight Data Cluster Analysis
  • Citing Conference Paper
  • September 2018

... The study showed how environmental uncertainties and flight conditions affect the performance of the statistical model. Similar challenges were faced in a different approach, which used fuel flow data from Quick Access Recorder (QAR) to estimate take-off mass [10]. Although statistical methods mostly proved to be more accurate in TOW prediction compared to open-loop studies discussed above, their main disadvantage is the use of not publicly available data. ...

Aircraft Mass Estimation using Quick Access Recorder Data
  • Citing Conference Paper
  • September 2018