A gradient boosting approach to the Kaggle load forecasting competition

Research output: Contribution to journalArticleResearchpeer-review

104 Citations (Scopus)

Abstract

We describe and analyse the approach used by Team TinTin (Souhaib Ben Taieb and Rob J Hyndman) in the Load Forecasting track of the Kaggle Global Energy Forecasting Competition 2012. The competition involved a hierarchical load forecasting problem for a US utility with 20 geographical zones. The data available consisted of the hourly loads for the 20 zones and hourly temperatures from 11 weather stations, for four and a half years. For each zone, the hourly electricity loads for nine different weeks needed to be predicted without having the locations of either the zones or stations. We used separate models for each hourly period, with component-wise gradient boosting for estimating each model using univariate penalised regression splines as base learners. The models allow for the electricity demand changing with the time-of-year, day-of-week, time-of-day, and on public holidays, with the main predictors being current and past temperatures, and past demand. Team TinTin ranked fifth out of 105 participating teams.
Original languageEnglish
Pages (from-to)382 - 394
Number of pages13
JournalInternational Journal of Forecasting
Volume30
Issue number2
DOIs
Publication statusPublished - 2014

Cite this