TY - JOUR
T1 - Missing traffic data imputation for artificial intelligence in intelligent transportation systems
T2 - review of methods, limitations, and challenges
AU - Chan, Robin Kuok Cheong
AU - Lim, Joanne Mun Yee
AU - Parthiban, Rajendran
N1 - Funding Information:
This work was supported by the Malaysian Ministry of Higher Education (MOHE), Fundamental Research Grant Scheme (FRGS), through the purview of Monash University Malaysia, under Grant FRGS/1/2019/TK08/MUSM/03/1.
Publisher Copyright:
© 2013 IEEE.
PY - 2023/4/3
Y1 - 2023/4/3
N2 - Missing data in Intelligent Transportation Systems (ITS) could lead to possible errors in the analyses of traffic data. Applying Artificial Intelligence (AI) in these circumstances can mitigate such problems. Past works focused only on specific data imputation methods, such as tensor factorization or a specific neural network model. While there are review papers covering singular topics regarding missing data, there are none in the field of traffic, to the best of our knowledge, that introduces the process of missing data collection and the viability of the traffic data collected while also broadly covering the popularly used models of recent years. This has led to non-uniformity of the terms used in missing data imputation, limited research in areas where datasets are not available, and a narrowed view of the methods used for data imputation. Hence, this paper aims to standardize the terms used in missing data classifications, look into the limitations of using available public or private datasets for urban traffic research, and discuss popular statistical and data-driven methods used by recent AI and ITS papers. It was found that tensor decomposition-based methods are the most popular for missing data imputation, followed by Generative Adversarial Networks and Graph Neural Networks, all of which rely on a large training dataset. Meanwhile, Probability Principle Component Analysis (PPCA) methods provide valuable insights via traffic analysis and are used for real-time traffic imputation. This paper also highlights the need for more efficient and reliable methods for traffic data collection, such as online APIs.
AB - Missing data in Intelligent Transportation Systems (ITS) could lead to possible errors in the analyses of traffic data. Applying Artificial Intelligence (AI) in these circumstances can mitigate such problems. Past works focused only on specific data imputation methods, such as tensor factorization or a specific neural network model. While there are review papers covering singular topics regarding missing data, there are none in the field of traffic, to the best of our knowledge, that introduces the process of missing data collection and the viability of the traffic data collected while also broadly covering the popularly used models of recent years. This has led to non-uniformity of the terms used in missing data imputation, limited research in areas where datasets are not available, and a narrowed view of the methods used for data imputation. Hence, this paper aims to standardize the terms used in missing data classifications, look into the limitations of using available public or private datasets for urban traffic research, and discuss popular statistical and data-driven methods used by recent AI and ITS papers. It was found that tensor decomposition-based methods are the most popular for missing data imputation, followed by Generative Adversarial Networks and Graph Neural Networks, all of which rely on a large training dataset. Meanwhile, Probability Principle Component Analysis (PPCA) methods provide valuable insights via traffic analysis and are used for real-time traffic imputation. This paper also highlights the need for more efficient and reliable methods for traffic data collection, such as online APIs.
KW - artificial intelligence
KW - communication system operations and management
KW - Intelligent transportation systems
KW - reviews
UR - http://www.scopus.com/inward/record.url?scp=85153344255&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2023.3264216
DO - 10.1109/ACCESS.2023.3264216
M3 - Review Article
AN - SCOPUS:85153344255
SN - 2169-3536
VL - 11
SP - 34080
EP - 34093
JO - IEEE Access
JF - IEEE Access
ER -