TY - JOUR
T1 - Information content of ToF-SIMS data
T2 - Effect of spectral binning
AU - Madiona, Robert M.T.
AU - Alexander, David L.J.
AU - Winkler, David A.
AU - Muir, Benjamin W.
AU - Pigram, Paul J.
PY - 2019/11/1
Y1 - 2019/11/1
N2 - Surface analysis methods such as Time of Flight Secondary Ion Mass Spectrometry (ToF-SIMS) have become essential for probing surfaces and interfaces that are critical determinants of the properties of a diverse range of materials. These methods generate copious amounts of information but this is rarely analyzed by modern artificial intelligence and machine learning methods. Here we calculate the information content of ToF-SIMS spectra, and how this changes with variation in the size of the bins into which the spectra can be assigned. We find that Shannon entropy of the spectra of 10 diverse polymers correlates well with molecular information content of the monomers from which the polymers derive. Surprisingly, we find that most of the information in ToF-SIMS spectra resides in resolutions (bin sizes) of 0.02–1 m/z. At very small bin sizes the information content of the spectra is close to that expected for mass spectral information uniformly distributed across bins. Conversely, for large bin sizes we find that the information content of the spectra is close to that expected for mass spectral information randomly distributed across bins.
AB - Surface analysis methods such as Time of Flight Secondary Ion Mass Spectrometry (ToF-SIMS) have become essential for probing surfaces and interfaces that are critical determinants of the properties of a diverse range of materials. These methods generate copious amounts of information but this is rarely analyzed by modern artificial intelligence and machine learning methods. Here we calculate the information content of ToF-SIMS spectra, and how this changes with variation in the size of the bins into which the spectra can be assigned. We find that Shannon entropy of the spectra of 10 diverse polymers correlates well with molecular information content of the monomers from which the polymers derive. Surprisingly, we find that most of the information in ToF-SIMS spectra resides in resolutions (bin sizes) of 0.02–1 m/z. At very small bin sizes the information content of the spectra is close to that expected for mass spectral information uniformly distributed across bins. Conversely, for large bin sizes we find that the information content of the spectra is close to that expected for mass spectral information randomly distributed across bins.
KW - Materials informatics
KW - Shannon entropy
KW - Spectral binning
KW - Time-of-flight secondary ion mass spectrometry (ToF-SIMS)
UR - http://www.scopus.com/inward/record.url?scp=85069738847&partnerID=8YFLogxK
U2 - 10.1016/j.apsusc.2019.07.044
DO - 10.1016/j.apsusc.2019.07.044
M3 - Article
AN - SCOPUS:85069738847
SN - 0169-4332
VL - 493
SP - 1067
EP - 1074
JO - Applied Surface Science
JF - Applied Surface Science
ER -