TY - JOUR
T1 - Efficient hardware accelerators for the computation of Tchebichef moments
AU - Chang, Kah Hyong
AU - Paramesran, Raveendran
AU - Asli, Barmak Honarvar Shakibaei
AU - Lim, Chern Loon
PY - 2012/3
Y1 - 2012/3
N2 - Moments extraction from high resolution images in real time may require a large amount of hardware resources. Using a direct method may involve a critically high operating frequency. This paper presents two improved digital-filter based moment accelerators, as exemplified by a Tchebichef moments computation engine, to introduce features that contribute to an area-efficient and timing-efficient accelerator design. The design of the accelerators invariably consists of two on-chip units: the digital filter and the matrix multiplication units. Among the features introduced are: a data-shifting means, a filter load distribution method, a reduced set of column filters, sectioned left shifters, a double-line buffer, time-multiplexed and pipelined matrix multiplication sections, and multichip amenable features. A total of 98 frames of test data from high definition videos, real and synthetic images are used in the functional tests. The single-chip field-programmable gate array implementation results show the successful realizations of accelerators capable of moment computations of (31, 31) orders, at 50 frames of 1920×1080 8-bit pixels per second, and (63, 63) orders, at 30 frames of 512×512 pixels per second. These performances have exceeded that of existing multichip and multiplatform designs.
AB - Moments extraction from high resolution images in real time may require a large amount of hardware resources. Using a direct method may involve a critically high operating frequency. This paper presents two improved digital-filter based moment accelerators, as exemplified by a Tchebichef moments computation engine, to introduce features that contribute to an area-efficient and timing-efficient accelerator design. The design of the accelerators invariably consists of two on-chip units: the digital filter and the matrix multiplication units. Among the features introduced are: a data-shifting means, a filter load distribution method, a reduced set of column filters, sectioned left shifters, a double-line buffer, time-multiplexed and pipelined matrix multiplication sections, and multichip amenable features. A total of 98 frames of test data from high definition videos, real and synthetic images are used in the functional tests. The single-chip field-programmable gate array implementation results show the successful realizations of accelerators capable of moment computations of (31, 31) orders, at 50 frames of 1920×1080 8-bit pixels per second, and (63, 63) orders, at 30 frames of 512×512 pixels per second. These performances have exceeded that of existing multichip and multiplatform designs.
KW - Accelerator architectures
KW - digital filters
KW - large dynamic range
KW - moment methods
UR - http://www.scopus.com/inward/record.url?scp=84858012271&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2011.2163980
DO - 10.1109/TCSVT.2011.2163980
M3 - Article
AN - SCOPUS:84858012271
SN - 1051-8215
VL - 22
SP - 414
EP - 425
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 3
ER -