In:
Atmospheric Measurement Techniques, Copernicus GmbH, Vol. 13, No. 6 ( 2020-06-09), p. 2995-3022
Abstract:
Abstract. Online analysis with mass spectrometers produces complex
data sets, consisting of mass spectra with a large number of chemical
compounds (ions). Statistical dimension reduction techniques (SDRTs) are
able to condense complex data sets into a more compact form while preserving
the information included in the original observations. The general principle
of these techniques is to investigate the underlying dependencies of the
measured variables by combining variables with similar characteristics into
distinct groups, called factors or components. Currently, positive matrix
factorization (PMF) is the most commonly exploited SDRT across a range of
atmospheric studies, in particular for source apportionment. In this study,
we used five different SDRTs in analysing mass spectral data from complex gas-
and particle-phase measurements during a laboratory experiment investigating
the interactions of gasoline car exhaust and α-pinene. Specifically,
we used four factor analysis techniques, namely principal component analysis (PCA),
PMF, exploratory factor analysis (EFA) and
non-negative matrix factorization (NMF), as well as one clustering
technique, partitioning around medoids (PAM). All SDRTs were able to resolve four to five factors from the gas-phase measurements,
including an α-pinene precursor factor, two to three oxidation product
factors, and a background or car exhaust precursor factor. NMF and PMF provided
an additional oxidation product factor, which was not found by other SDRTs.
The results from EFA and PCA were similar after applying oblique rotations.
For the particle-phase measurements, four factors were discovered with NMF:
one primary factor, a mixed-LVOOA factor and two α-pinene secondary-organic-aerosol-derived (SOA-derived)
factors. PMF was able to separate two factors: semi-volatile oxygenated organic aerosol (SVOOA) and low-volatility oxygenated organic aerosol (LVOOA). PAM
was not able to resolve interpretable clusters due to general limitations of
clustering methods, as the high degree of fragmentation taking place in the
aerosol mass spectrometer (AMS) causes different compounds formed at different stages in the experiment
to be detected at the same variable. However, when preliminary analysis is
needed, or isomers and mixed sources are not expected, cluster analysis may
be a useful tool, as the results are simpler and thus easier to interpret. In
the factor analysis techniques, any single ion generally contributes to
multiple factors, although EFA and PCA try to minimize this spread. Our analysis shows that different SDRTs put emphasis on different parts of
the data, and with only one technique, some interesting data properties may
still stay undiscovered. Thus, validation of the acquired results, either by
comparing between different SDRTs or applying one technique multiple times
(e.g. by resampling the data or giving different starting values for
iterative algorithms), is important, as it may protect the user from
dismissing unexpected results as “unphysical”.
Type of Medium:
Online Resource
ISSN:
1867-8548
DOI:
10.5194/amt-13-2995-2020
DOI:
10.5194/amt-13-2995-2020-supplement
Language:
English
Publisher:
Copernicus GmbH
Publication Date:
2020
detail.hit.zdb_id:
2505596-3