Optimal scheduling over time-varying channels with traffic admission control: Structural results and online learning algorithms

Khoa T. Phan, Tho Le-Ngoc, Mihaela Van Der Schaar, Fangwen Fu

Research output: Contribution to journalArticleResearchpeer-review

25 Citations (Scopus)

Abstract

This work studies the joint scheduling-admission control (SAC) problem for a single user over a fading channel. Specifically, the SAC problem is formulated as a constrained Markov decision process (MDP) to maximize a utility defined as a function of the throughput and queue size. The optimal throughput-queue size trade-off is investigated. Optimal policies and their structural properties (i.e., monotonicity and convexity) are derived for two models: simultaneous and sequential scheduling and admission control actions. Furthermore, we propose online learning algorithms for the optimal policies for the two models when the statistical knowledge of the time-varying traffic arrival and channel processes is unknown. The analysis and algorithm development are relied on the reformulation of the Bellman's optimality equations using suitably defined state-value functions which can be learned online, at transmission time, using time-averaging. The learning algorithms require less complexity and converge faster than the conventional Q-learning algorithms. This work also builds a connection between the MDP based formulation and the Lyapunov optimization based formulation for the SAC problem. Illustrative results demonstrate the performance of the proposed algorithms in various settings.

Original languageEnglish
Article number6585729
Pages (from-to)4434-4444
Number of pages11
JournalIEEE Transactions on Wireless Communications
Volume12
Issue number9
DOIs
Publication statusPublished - 2013
Externally publishedYes

Keywords

  • learning
  • Markov decision process (MDP)
  • Scheduling
  • structural results
  • traffic admission control

Cite this