TY - GEN
T1 - On sufficient statistics of least-squares superposition of vector sets
AU - Konagurthu, Arun S.
AU - Kasarapu, Parthan
AU - Allison, Lloyd
AU - Collier, James H.
AU - Lesk, Arthur M.
N1 - Conference code: 18th
PY - 2014
Y1 - 2014
N2 - Superposition by orthogonal transformation of vector sets by minimizing the least-squares error is a fundamental task in many areas of science, notably in structural molecular biology. Its widespread use for structural analyses is facilitated by exact solutions of this problem, computable in linear time. However, in several of these analyses it is common to invoke this superposition routine a very large number of times, often operating (through addition or deletion) on previously superposed vector sets. This paper derives a set of sufficient statistics for the least-squares orthogonal transformation problem. These sufficient statistics are additive. This property allows for the superposition parameters (rotation, translation, and root mean square deviation) to be computable as constant time updates from the statistics of partial solutions. We demonstrate that this results in a massive speed up in the computational effort, when compared to the method that recomputes superpositions ab initio. Among others, protein structural alignment algorithms stand to benefit from our results.
AB - Superposition by orthogonal transformation of vector sets by minimizing the least-squares error is a fundamental task in many areas of science, notably in structural molecular biology. Its widespread use for structural analyses is facilitated by exact solutions of this problem, computable in linear time. However, in several of these analyses it is common to invoke this superposition routine a very large number of times, often operating (through addition or deletion) on previously superposed vector sets. This paper derives a set of sufficient statistics for the least-squares orthogonal transformation problem. These sufficient statistics are additive. This property allows for the superposition parameters (rotation, translation, and root mean square deviation) to be computable as constant time updates from the statistics of partial solutions. We demonstrate that this results in a massive speed up in the computational effort, when compared to the method that recomputes superpositions ab initio. Among others, protein structural alignment algorithms stand to benefit from our results.
UR - http://www.scopus.com/inward/record.url?scp=84958549033&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-05269-4_11
DO - 10.1007/978-3-319-05269-4_11
M3 - Conference Paper
SN - 9783319052687
SP - 144
EP - 159
BT - Research in Computational Molecular Biology
A2 - Sharan, Roded
PB - Springer
CY - Cham [Switzerland]
T2 - International Conference on Computational Molecular Biology 2014
Y2 - 2 April 2014 through 5 April 2014
ER -