Reveal the hidden layer via entity embedding in traffic prediction

Bo Wang, Khaled Shaaban, Inhi Kim

Research output: Chapter in Book/Report/Conference proceedingConference PaperOther

5 Citations (Scopus)

Abstract

The neural network-based models have been widely used in traffic prediction. They have improved accuracy and efficiency in traffic flow, speed, passenger flow, and delay. Many variables are considered to predict traffic indicators and good techniques for choosing the most influenced variables to results have been developed. Since the neural network models treat independent variables as continuous variables, there are few studies on the use of categorical variables. In addition, the neural network has been criticized as the internal relationships of hidden layers are generally unknown. This paper investigates neural networks to predict the use of bike-sharing systems in Suzhou, China considering a large amount of categorical data. Two methods here, Entity embedding and one-hot encoding are applied. The comparison experiments verify that the entity embedding method is more efficient than one-hot encoding. Furthermore, the hidden layers are visually analyzed by t-SNE, and the relationships with time, weather, surroundings and other variables for the traffic volume at shared bike sites are discussed. The research results show that: 1. Entity embedding can effectively increase the continuity of categorical variables and therefore, improve the prediction efficiency for the neural network models. 2. The relationship between variables can be identified through visual analysis, and the trained embedding vectors can also be used to supervise clustering.

Original languageEnglish
Title of host publicationThe 10th International Conference on Ambient Systems, Networks and Technologies (ANT 2019) / The 2nd International Conference on Emerging Data and Industry 4.0 (EDI40 2019) / Affiliated Workshops
EditorsElhadi Shakshuki
Pages163-170
Number of pages8
Volume151
DOIs
Publication statusPublished - 1 Jan 2019
EventInternational Conference on Ambient Systems, Networks and Technologies (ANT) 2019 - Leuven, Belgium
Duration: 29 Apr 20192 May 2019
Conference number: 10th
https://www.sciencedirect.com/journal/procedia-computer-science/vol/151 (Proceedings)

Publication series

NameProcedia Computer Science
PublisherElsevier
ISSN (Print)1877-0509
ISSN (Electronic)151

Conference

ConferenceInternational Conference on Ambient Systems, Networks and Technologies (ANT) 2019
Abbreviated titleANT 2019
Country/TerritoryBelgium
CityLeuven
Period29/04/192/05/19
Internet address

Keywords

  • Entity embedding
  • Neural networks
  • One-hot encoding
  • Traffic prediction
  • Visualization

Cite this