Skip to main navigation Skip to search Skip to main content

An input residual connection for simplifying gated Recurrent Neural Networks

  • Nicholas I.H. Kuo
  • , Mehrtash Harandi
  • , Nicolas Fourrier
  • , Christian Walder
  • , Gabriela Ferraro
  • , Hanna Suominen

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Gated Recurrent Neural Networks (GRNNs) are important models that continue to push the state-of-the-art solutions across different machine learning problems. However, they are composed of intricate components that are generally not well understood. We increase GRNN interpretability by linking the canonical Gated Recurrent Unit (GRU) design to the well-studied Hopfield network. This connection allowed us to identify network redundancies, which we simplified with an Input Residual Connection (IRC). We tested GRNNs against their IRC counterparts on language modelling. In addition, we proposed an Input Highway Connection (IHC) as an advance application of the IRC and then evaluated the most widely applied GRNN of the Long Short-Term Memory (LSTM) and IHC-LSTM on tasks of i) image generation and ii) learning to learn to update another learner-network. Despite parameter reductions, all IRC-GRNNs showed either comparative or superior generalisation than their baseline models. Furthermore, compared to LSTM, the IHC-LSTM removed 85.4% parameters on image generation. In conclusion, the IRC is applicable, but not limited, to the GRNN designs of GRUs and LSTMs but also to FastGRNNs, Simple Recurrent Units (SRUs), and Strongly-Typed Recurrent Neural Networks (T-RNNs).

Original languageEnglish
Title of host publication2020 International Joint Conference on Neural Networks (IJCNN), 2020 Conference Proceedings2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings
EditorsAsim Roy
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages994-1001
Number of pages8
ISBN (Electronic)9781728169262
ISBN (Print)9781728169279
DOIs
Publication statusPublished - 2020
EventIEEE International Joint Conference on Neural Networks 2020 - Virtual, Glasgow, United Kingdom
Duration: 19 Jul 202024 Jul 2020
https://ieeexplore.ieee.org/xpl/conhome/9200848/proceeding (Proceedings)
https://wcci2020.org/ijcnn-sessions/ (Website)

Publication series

NameProceedings of the International Joint Conference on Neural Networks
PublisherIEEE, Institute of Electrical and Electronics Engineers
ISSN (Print)2161-4393
ISSN (Electronic)2161-4407

Conference

ConferenceIEEE International Joint Conference on Neural Networks 2020
Abbreviated titleIJCNN 2020
Country/TerritoryUnited Kingdom
CityVirtual, Glasgow
Period19/07/2024/07/20
Internet address

Keywords

  • GRU
  • Hopfield network
  • interpretability
  • LSTM

Cite this