Improving Low-Cost Optical PM Sensor Accuracy in Humid Conditions via Aerosol Liquid Water Estimation Using U.S. EPA CSN Data
Yuhang Guo, Alexandra Catena, Margaret J. Schwab, Amanda Teora, Oliver V. Rattigan, Violet Harder, Yasaman Hassanzadeh, James J. Schwab, Jie Zhang

TL;DR
This paper improves the accuracy of low-cost PM2.5 sensors in humid conditions by estimating aerosol liquid water using EPA data.
Contribution
A novel framework for correcting PM2.5 sensor measurements in high humidity using aerosol liquid water estimation and optical calibration.
Findings
Corrected PM2.5 data align well with EPA reference measurements.
Aerosol liquid water quantification aids aqueous-phase aerosol chemistry studies.
Practical guidance is provided for regions without EPA CSN data.
Abstract
High ambient relative humidity (RH) poses a substantial challenge to the accuracy of low-cost optical sensors used for measuring the fine particulate matter (PM2.5) concentration. In this study, we developed a novel, practical, and feasible framework for mechanistically correcting low-cost PM2.5 sensor measurements under high-humidity conditions by quantitatively separating aerosol liquid water mass (ALW) using the widely available EPA Chemical Speciation Network (CSN) data set, after accounting for the necessary optical calibration procedures that affect sensor performance at elevated RH. We introduced two key correction processes for a low-cost optical PM2.5 measurement system comprising a nephelometer and an optical particle counter: (1) optical calibration grounded in Mie theory to account for variations in sensor performance driven by aerosol size distribution, refractive index,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
1
2
3
4
5
6| city | city/rural style, region | particle source | season |
| averaged Chemical fractions (%) (Org/SO4/NH3/NO3/Cl/Na/Ca/K/Mg) |
|
|---|---|---|---|---|---|---|
| Albany, NY | Midsized city, Northeastern U.S. | Wildfire | Late summer | 0.82 | 83.9/10.2/1.1/2.8/0.7/0.4/0.3/0.0 | 0.12 |
| Local urban | Fall | 0.82 | 72.7/16.4/1.2/6.2/0.9/0.6/0.8/0.5/0.6 | 0.12 |
- —U.S. Environmental Protection Agency10.13039/100000139
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAtmospheric chemistry and aerosols · Air Quality Monitoring and Forecasting · Atmospheric aerosols and clouds
Introduction
1
Traditional monitoring of ambient particulate matter with aerodynamic diameters smaller than 2.5 μm (PM_2.5_) primarily depends on certified reference methods established by national and local regulatory agencies, such as the U.S. Environmental Protection Agency (USEPA). These methods provide high measurement accuracy but are often limited in spatial coverage due to high operational costs and logistical deployment challenges. ?−? ? To address these limitations, low-cost optical sensors have been increasingly adopted to complement regulatory monitoring networks. These sensors are cost-effective, compact, and easy to deploy, offering rapid response times and enhanced spatial and temporal resolution. As a result, they are now widely used to assess community-level exposure risks, track wildfire smoke, monitor traffic emissions, evaluate air quality in schools, and assess indoor environments. ?−? ? ? ? ?
Currently available low-cost optical sensors can be broadly categorized into two types based on their underlying optical measurement principles: nephelometer-based (Thermo Scientific PDR-1500) or nephelometer-linked sensors (e.g., Plantower PMS5003), which are highly sensitive to PM_1_ (particulate matter with aerodynamic diameters less than 1.0 μm), ?,? and optical particle counters (OPCs), such as the Alphasense OPC-N2, which demonstrate greater sensitivity to PM_1_-2.5 (particles between 1.0 and 2.5 μm in aerodynamic diameter). ?,? It should be noted that the widely used Plantower PMS5003 was regarded as a nephelometer-like monitor here, given recent findings that its PM_1_ concentration is strongly correlated with the submicron aerosol scattering coefficient, like a nephelometer, while it counted single particle scattering events acting like an imperfect optical particle counter. ?,? Nephelometric sensors estimate PM mass concentrations by detecting light scattering intensity at specific angles, typically calibrated using factory-set conversion factors. In contrast, the OPCs detect and size individual particles as they traverse a laser beam, generating particle size distributions that are subsequently converted into mass concentrations. The performance of both sensor types is fundamentally governed by the principles of Mie scattering and is strongly influenced by several particle-specific parameters, including particle size distribution, complex refractive index, the wavelength of the laser source, and the sensor’s scattering angle. ?,? These dependencies introduce significant uncertainty, particularly under varying atmospheric conditions, and can considerably limit the accuracy and reliability of the reported mass concentrations. ?−? ? ? ?
Ambient meteorological conditions, particularly elevated relative humidity (RH), introduce additional uncertainties in the performance of the optical sensors. Unlike regulatory methods, which dry particles to a consistent and low RH, optical-based methods sample and measure particles at ambient RH, which means that the aerosol contains mass from the aerosol liquid water (ALW). High RH promotes the hygroscopic growth of particles, altering their size and refractive index, which in turn affects light scattering behavior governed by Mie theory and compromises sensor accuracy. ?−? ? ? ? To mitigate RH-induced bias, empirical correction equations have traditionally been employed, ?,? alongside more recent approaches utilizing machine learning (ML) algorithms. ?−? ? ?
Notably, a recent study based on Mie scattering theory has demonstrated that the RH-related bias in nephelometric measurements is quantitatively linked to particle hygroscopic growth? and then ALW content. ?,? ALW is strongly dependent on aerosol chemical composition, ambient RH, and temperature, and its mass concentration can exceed two to three times the dry aerosol mass under humid conditions. ?,? This dependence highlights the limitations of empirical and ML-based correction models when applied across different regions or seasons, as the ALW mass concentration can vary substantially. To address these challenges, the accurate quantification and removal of ALW from sensor measurements are essential. Separating ALW will not only improve the reliability of PM_2.5_ estimates but also offer critical insights into aqueous-phase aerosol chemistry and secondary aerosol formation processes, including cloud condensation nuclei (CCN) activity.
To separate aerosol liquid water (ALW) from nephelometer-based/like sensors (e.g., Plantower PMS5003, Thermo Scientific PDR-1500), high-time-resolution (minute-to-hourly scale) aerosol chemical composition data are essential. Such data can be obtained from advanced instruments such as the high-resolution time-of-flight aerosol mass spectrometer (HR-ToF-AMS), ?,? the aerosol chemical speciation monitor (ACSM),? or the particle-into-liquid sampler (PILS).? However, the high purchase cost of these instruments, combined with the substantial maintenance and personnel requirements for long-term field deployments, makes their use impractical for widespread application within low-cost optical sensor networks.
As an alternative, the U.S. EPA’s Chemical Speciation Network (CSN) provides a widely distributed and long-term record of aerosol chemical composition, albeit at a much lower temporal resolution (typically every 3 or 6 days). In this study, we developed a novel, practical, and feasible framework for mechanistically correcting low-cost PM_2.5_ sensor measurements under high-humidity conditions by quantitatively separating aerosol liquid water mass using the widely available EPA CSN data set, after accounting for the necessary optical calibration procedures that affect sensor performance at elevated RH. The corrected results were validated against the reference-grade EPA monitor to assess the performance of the correction algorithm. In addition, practical recommendations are proposed for extending this framework to regions lacking nearby CSN sites, thereby facilitating broader deployment of low-cost sensors with improved accuracy under humid conditions.
Methods
2
Albany Air Quality School Network (AAQSN)
2.1
The Albany Air Quality School Network (AAQSN) was established to address the spatial gap left by the limited EPA routine regulatory monitoring stationscurrently only one for PM_2.5_ mass concentration and one for ozone concentration, in the New York Capital District region, with the goal to enhance the detection of local air pollutants at the community level. Five K-12 schools were selected as monitoring sites, with each school equipped with two QuantAQ MODULAIR packagesone placed indoors and the other outdoorsto measure indoor and outdoor air pollutants. Two schools are located in the Albany Downtown region, one in the Albany Uptown region, one at Green Island near Troy, and one in Schenectady. AAQSN is designed to address three key objectives: (1) improving the accuracy of low-cost sensors for community-level air pollution measurements, (2) analyzing the spatial distribution of air pollutants at the community level, and (3) investigating the penetration of outdoor air pollutants into indoor environments. This study focuses on the performance of the PM_2.5_ sensors in Objective #1, using outdoor measurements from one Albany Downtown site, which is the closest to the U.S. EPA/New York State Department of Environmental Conservation (NYS DEC) Albany County Health Department PM_2.5_ site (ACHD site) being approximately 140 m (Figure S1), which provided continuous PM_2.5_ measurements using a Teledyne T640 as the reference instrument for calibration in this study, as well as the CSN data set.
The QuantAQ MODULAIR is an integrated, cost-effective sensor package (∼$6000) that incorporates multiple low-cost sensors to measure various air pollutants,? including PM_1_, PM_2.5_, PM_10_, ozone (O_3_), nitric oxide (NO), nitrogen dioxide (NO_2_), and carbon monoxide (CO). For PM_2.5_ mass concentrations, the MODULAIR system integrates readings from a nephelometer (Plantower PMS5003) for PM_1_ and an optical particle counter (OPC, Alphasense OPC-N2) for PM_1−2.5_. To better understand and characterize particle hygroscopic growth under high RH and its impact on MODULAIR PM_2.5_ measurements, two processes were conducted: (1) the optical calibration to correct the performance of PM_2.5_ sensors caused by the factors such as aerosol size, refractive index, density, and particle hygroscopic growth; and (2) separation of aerosol liquid water (ALW) from the optically calibrated data set to derive the dry-equivalent aerosol mass concentration under high RH conditions, as detailed described below.
Besides the PM_1_ and PM_1−2.5_ reading from MODULAIR, the air flow RH and temperature measured by embedded RH-Temperature detectors were also used for the ALW mass concentration calculation, as described in Section. The time resolution was set to be 1 h, to match the time resolution of EPA/DEC ACHD site measurements.
To avoid analytical bias arising from incomplete observationsparticularly the wintertime data gaps in the outdoor QuantAQ record due to solar-panel snow coverage, and ∼6-month availability lag in CSN data, we restricted the analysis to 08/15/2024 to 10/30/2024. This period ensures the overlapping observations between the uninterrupted QuantAQ measurements and 13 CSN sampling days with the key CSN measurement parameters summarized in Table S1. The results presented in this study primarily reflect aerosol characteristics during the late summer and fall seasons. A full two-year analysis will be completed by late 2026, in which we will extend the method developed here to provide a more comprehensive assessment that resolves seasonal variability and incorporates a wider range of source influences.
The Teledyne T640 at ACHD site is routinely operated and quality-assured by the NYS DEC in accordance with U.S. EPA Federal Equivalent Method (FEM) protocols and was used in this study as the reference instrument after constrained by the 24 h Federal Reference Methods (FRMs) data set at the same site (Figure S2). The T640 and the 24 h FRM PM_2.5_ data could be downloaded from (1) “U.S. EPA Air Data: Air Quality Data Collected at Outdoor Monitors Across the US” (https://www.epa.gov/outdoor-air-quality-data) or (2) “Federal Land Manager Environmental Database” (https://views.cira.colostate.edu/fed/QueryWizard/). Based on the fitted relationships, the study period (08/15/2024 to 10/30/2024) was further separated into two distinct regimes: a “wildfire-influenced period” (08/15/2024 to 09/04/2024) and a “local-urban-influenced period” (09/04/2024−10/30/2024). Significant correction was applied for the “wildfire-influenced period” as the T640 substantially overestimated PM_2.5_ mass by nearly 38% during this period, while it matched very well with FRM results during the “local-urban-influenced period”.
CSN Data Analysis and Mass Growth Factor Calculation
2.2
The PM_2.5_ component mass concentrations (NO_3_ ^−^, SO_4_ ^2−^, NH_4_ ^+^, and organic carbon (OC)) from the Chemical Speciation Network (CSN, https://www.epa.gov/amtic/chemical-speciation-network-csn) ACHD site were used to determine their dry mass fractions, which were then used to calculate the ALW under high RH conditions, as well as the mass growth factor (MGF). The MGF is defined as the ratio of (1) the total particle mass, including aerosol liquid water, after particle hygroscopic growth to (2) the original measured dry particle mass concentration (also the “dry-equivalent particle mass concentration” in this study). The CSN data could be downloaded from the same website as T640 as well as the 24 h FRM PM_2.5_ data. For this study, an OM/OC ratio of 1.6 was applied to convert CSN OC to organic matter (OM) mass concentration, the same as the value used in the CSN data set for the reconstructed PM_2.5_ mass concentration of the ACHD site. However, the OM/OC ratio can vary substantially depending on emission sources, monitoring location, season, and meteorological conditions, with reported values ranging from approximately 1.4 to above 2. Therefore, using a constant ratio of 1.6 may introduce some uncertainty into our estimates for the MGF, which is further discussed below.
It should be noted that the CSN measurements at the EPA/DEC ACHD site were collected on a 1-in-6-day schedule, providing much lower temporal resolution than the continuous PM_2.5_ measurements from both the MODULAIR unit and the EPA/DEC ACHD monitor (∼1 h resolution). To address the resulting temporal gaps, we assumed that the dry-equivalent particles between two adjacent PM_2.5_ mass−concentration valley edges (marked by the black points in Figure S2) were subject to similar sources and chemical processing and therefore exhibited comparable mass fractions. Accordingly, the mass fractions derived from CSN measurements (Table S1) were extended to the full period between two valley edges to generate an hourly data set, as illustrated in Figure S2. It should be acknowledged that notable variations in aerosol chemical component mass concentrations may occur during the valley-to-valley periods as a result of intrainterval and diurnal variability. ?,? In the present study, however, the mass growth factor (MGF) was used for correction, which depended on mass fractions rather than absolute mass concentrations. As mass fractions have been exhibited comparatively smaller temporal variability than mass concentration,? as well as shown in Figure S2, this treatment is expected to partially mitigate the uncertainties arising from short-term compositional fluctuations, with more discussed in Section.
Mass Growth Factor Calculation
2.3
Based on the determined component mass fraction and assuming a normalized PM_2.5_ mass concentration of 1 μg m^−3^ as the basis, the value of the MGF can be equal to the value of the sum of (1) the ALW associated with inorganic components (ALW_inorg_), (2) the ALW associated with organic components (ALW_org_), and (3) the dry particle mass (normalized to 1). Meanwhile, we simply assumed the mass fraction would be the same for PM_1_ and PM_1−2.5_ in this study, which resulted in the same MGF for these two size ranges. For the aerosol under low RH conditions (RH < 60%), MGF would be 1.
ALW_inorg_ was estimated using the thermodynamic equilibrium model ISORROPIA II.? The model utilized (1) the inorganic component concentrations (NO_3_ ^−^, SO_4_ ^2−^, NH_4_ ^+^, Na+, Cl^−^, Ca^2+^, K^+^, Mg^2+^), calculated from the newly determined component mass fraction and the normalized PM_2.5_ mass concentration (1 μg m^−3^), and (2) the relative humidity (RH) and temperature measured inside MODULAIR. Due to the unknown air mass history, the “metastable” mode was adopted, as to better represent the persistence of a fully aqueous phase under ambient conditions and prevent premature crystallization from being artificially imposed. Meanwhile, M ALW,org was calculated based on the empirical equation?
where k org is the organic particle hygroscopicity parameter, set to 0.12 as the initial value,? ρ water is the water density (1.0 g cm^−3^), M org is the organic component concentrations calculated form the organic mass fraction from CSN and the reference PM_2.5_ mass concentration (1 μg m^−3^), ρ org is the organic aerosol density (1.4 g cm^−3^),? and RH is the relative humidity reported by MODULAIR. It should be noted that, in this study, a RH threshold of approximately 60% was applied to distinguish between “low RH” and “high RH” conditions. This choice assumes negligible hygroscopic growth below 60%, which is higher than the commonly reported threshold of ∼40% by Li et al. (2021),? who found that the enhancement in aerosol scattering (i.e., f(RH) > 1) typically begins when RH exceeds ∼40% in urban environments. The 60% threshold was selected as a balance between (1) including more low RH data to improve the robustness of optical calibration (as described in Step 3 of Section) and (2) limiting the uncertainty introduced by neglecting ALW under low RH conditions. At RH being 60%,the calculated ALW mass fraction is approximately 20%, which remains within an acceptable uncertainty range for this study. In the following analysis, aerosol under RH < 60% is classified as “dry aerosol” with ALW being 0 and MGF being 1, while aerosol under RH ≥ 60% is referred to as “wet aerosol”.
With knowledge of ALW (= ALW_inorg_ + ALW_org_), the overall ambient particle density after particle hygroscopic growth can be obtained as
where M NH4NO3, M (NH4)2SO4, and M org are the mass concentrations of particle NH_4_NO_3_, (NH_4_)2_SO_4, and organic mass concentration through the mass fraction from CSN and the reference PM_2.5_ mass concentration (1 μg m^−3^), MGF is the calculated following the method as described above based on the normalized PM_2.5_ mass concentration (1 μg m^−3^), the density used for NH_4_NO_3_ is 1.73 g cm^−3^, (NH_4_)2_SO_4 is 1.77 g cm^−3^, organic is 1.4 g cm^−3^, and water is 1.0 g cm^−3^. Meanwhile, when RH < 60%, ALW will be 0, and ρ ambient will be the dry aerosol density under low RH conditions. In this study, ρ ambient is used to represent the aerosol density under both low and high RH conditions, with only the ALW set being 0 at low RH conditions (RH < 60%).
Results
3
Overview of Measurements and Correction Processing
3.1
Figure illustrates the time series of PM_2.5_ concentrations recorded by the MODULAIR sensor package, alongside hourly PM_2.5_ mass concentrations measured by a T640 instrument constrained by CSN data set at the EPA/DEC ACHD site. A notable discrepancy was observed under high relative humidity (RH) conditions, particularly when RH exceeded 80%. Under high RH conditions (RH ≥ 60%), the PM_2.5_ concentrations measured by the EPA/DEC ACHD site T640 were, on average, only 60% of those reported by MODULAIRsignificantly lower than the mean ratio of 82% observed when RH was below 60%. These observations highlight the influence of humidity, especially the hygroscopic growth of particles, on the accuracy of the MODULAIR optical PM monitor.
Overview of the measurements as well as their correlation. (1) Time series of PM2.5 concentrations reported by MODULAIR, color-coded by RH, alongside PM2.5 mass concentrations from the EPA/DEC ACHD T640. (2) Scatter plots illustrating the correlation between MODULAIR-reported PM2.5 and EPA/DEC ACHD T640 PM2.5 mass concentrations with data points color-coded by RH.
To address the discrepancies under high RH conditions, two key correction procedures were implemented, as summarized in Figure. First, optical calibration was applied to MODULAIR data to account for particle optical-related properties such as size distribution, refractive index, density, and hygroscopic growth (Steps 2 and 3 in Figure). Second, the aerosol liquid water (ALW) component was then separated from the optically calibrated values to obtain the dry-equivalent aerosol mass concentration (Step 4). The dry-equivalent PM_2.5_ concentration under high RH conditions refers to the particle mass measured after the aerosol stream has been driedcomparable to the method used in EPA/DEC standard PM_2.5_ measurements. These dry-equivalent concentrations were then compared to the EPA/DEC ACHD measurements to assess the effectiveness of the correction procedure. It is important to note that, in this study, optically measured particle mass under high RH includes ALW as part of the total mass, whereas the dry-equivalent concentration represents only the dehydrated (initial dry) fraction.
Correction process for the MODULAIR PM2.5 data sets (MGF was 1 for aerosol under low RH conditions (RH < 60%)).
Optical Calibration of OPC (“Step 2”)
3.2
As optical sensors, both nephelometers and OPCs estimate aerosol mass concentrations based on the scattering of light by ambient particles. These sensors are typically calibrated using specific test aerosolssuch as monodispersed spheres, under low RH conditions, with defined particle sizes, refractive indices, and densities. As a result, their measurement accuracy under dry ambient conditions depends heavily on the aerosol’s light-scattering properties, which are primarily influenced by particle size and refractive index.
However, under high RH conditions, hygroscopic growth transforms particles from their dry state to a hydrated state by incorporating water. This process increases particle size while reducing their refractive index and density, thereby altering their light-scattering characteristics compared to those under dry conditions. To account for these variations, we define an optical calibration factor , where M ref is the reference mass concentration from the EPA/DEC measurements in this study and M opt is the value reported by the optical sensor. This factor reflects the performance of the sensor under specific environmental conditions.
Due to fundamental differences in the operating principles of the two sensors, we applied separate calibration strategies: (1) Nephelometer-based sensors infer PM_1_ mass concentrations from the intensity of scattered light, and (2) OPCs count the number of particles in predefined size bins and estimate PM_1−2_.5 mass concentrations by summing across those bins, as described in the QuantAQ’s Technical Notes: “Introduction to the MODULAIR PM” (https://quant-aq.com/technical-notes).
Therefore, we first examined the calibration factor of the OPC, f OPC, under both low and high RH conditions. The value of f OPC, determined under dry conditions (RH < 60%, with justification of this threshold provided in Section), was then used as a basis to derive the calibration factor for the nephelometer, f nep.
Hagan et al. (2020)? characterized the relationship between a particle’s scattering cross-section and its sizereferred to hereafter as the “scattering-size relationship”which is essential for OPC calibration. In the case of the crystalline OPCs, this relationship enables the assignment of particles to appropriate size bins based on their measured scattering cross sections.
Under both low and high RH conditions, the scattering cross-section of an ambient particle is governed by two factors: (1) its physical size and (2) a scattering-size relationship dependent on the particle’s refractive index. Each measured scattering cross-section is matched against the factory-calibrated curve (originally developed using standard monodispersed polystyrene latex spheres) to determine the particle’s size bin and ultimately calculate its mass contribution.
Accordingly, the optical calibration factor, f OPC, is obtained using eq,
where ρ ambient is the density of ambient aerosol under low/high RH conditions and was calculated as described in Section, D ambient is the ambient particle diameter, ρ OPC is factory used constant particle density for MODULAIR raw data set (1.65 g m^−3^), and D OPC is the determined particle diameter through the factory scattering-size relationship and the detected scattering cross-section by ambient aerosol.
To determine the ratio between the ambient aerosol diameter and the diameter inferred by the OPC calibration (D ambient/D OPC), it is necessary to evaluate the “scattering-size relationship” of ambient aerosols relative to the factory calibration curve. This comparison heavily depends on the refractive index of the aerosol, which affects its light-scattering properties. The average ambient aerosol refractive index (RI) was estimated using a volume-weighted mixing rule: RI = ∑i * ^n^RI* _ i _ × VF_ i , ?,? where RI was the average refractive index, RI _ i _ was the refractive index of components (1.52 for (NH_4)2_SO_4, 1.56 for NH_4_NO_3_, 1.33 for water, 1.55 for organic),? and VF _ i _ was the volume fraction, which were determined based on CSN data set. The estimated ambient aerosol refractive index for low RH conditions is around 1.55, which is very near to the factory calibration used for monodispersed polystyrene latex spheres, being around 1.59. This suggests that the scattering behavior of dry ambient aerosols is reasonably well represented by the standard calibration particles.
However, under high RH conditions, hygroscopic growth causes the inclusion of water in the particles, reducing the effective refractive index. In extreme cases (RH > 95%), the RI can decrease to as low as 1.33, equivalent to that of pure water.
Figure illustrates the scattering-size relationships under three representative conditions: (1) Factory calibration particle (RI = 1.59), (2) Dry ambient particle (RI = 1.55), and (3) Extremely wet particles with near-complete water content (RI = 1.33). The differences in the scattering-size relationships among these scenarios, particularly for particles in the 1−2.5 μm diameter range, were not substantial. Although this analysis involves some simplifications, especially given the complex processing algorithms used by the OPCs to infer mass concentration, the introduced uncertainty is likely minimal. This is primarily because the PM_1−2.5_ fraction contributes less than 20% to the total PM_2.5_ mass under typical conditions.?
Relationship of the estimated scattering cross-section with the particle diameter (“scattering-size relationship”) for different particles.
Therefore, it is reasonable to assume that the inferred particle diameter from the OPC measurements (D _OPC _) closely approximates the actual ambient aerosol diameter (D ambient). Under this assumption, the optical calibration factor for OPC, f OPC, can be simplified as the ratio of the ambient aerosol density to the assumed constant particle density used in MODULAIR’s calculation method (1.65 g cm^−3^), as shown in eq
where M NH4NO3, M (NH4)2SO4, and M org are the mass concentrations of particle NH_4_NO_3_, (NH_4_)2_SO_4, and organic mass concentration through the mass fraction from CSN data and scaled using a reference PM_2.5_ mass concentration (1 μg·m^−3^), respectively, MGF is the mass growth factor under high RH conditions and was the sum of the calculated ALW mass concentration based on the reference PM_2.5_ mass concentration and 1 (as described in Section), and the densities are 1.73 g cm^−3^ for NH_4_NO_3_, 1.77 g cm^−3^ for (NH_4_)2_SO_4, 1.4 g cm^−3^ for organic, and 1.0 g cm^−3^ for water. Note that when RH < 60%, ALW will be 0, and ρ ambient will be the dry aerosol density under low RH conditions.
Under most circumstances, the calculated ambient aerosol density was lower than the constant particle density used in the factory calibration of the OPC sensor’s factory calibration. The estimated f OPC was generally below 0.9 (Figure), with a noticeable drop at high RH conditions, reaching values as low as approximately 0.7. Using these estimated f OPC _ values, the raw mass concentration reported by the OPC sensor for particles in the 1−2.5 μm diameter range (M OPC) was corrected to obtain an optically calibrated concentration, M OPC-cal. This calibrated mass concentration was then combined with the EPA/DEC ACHD PM_2.5 data set to infer the reference PM_1_ mass concentration (PM_1-ref_), by subtracting M OPC-cal from the reference PM_2.5_ mass, and the PM_1-ref_ will be used for the subsequent calibration of the nephelometer measurements, as shown in Figure.
Time series of reported mass concentration from OPC (M OPC, black line) and the calibrated concentration (M OPC-cal, red line), along with the estimated f OPC (blue points with its corresponding axis on the right).
Optical Calibration of Nephelometer (“Step 3”)
3.3
Our previous study showed that the total relative scattering intensity detected by the nephelometer for dry particles is proportional to the ratio of values reported by the reference monitor to those from the nephelometer (f nep,RH<60%).? Assuming a similar aerosol size distribution and refractive index for aerosols originating from similar sources, the f _nep,RH<60**%** _ can be obtained through the linear fitting of the measurements from the reference monitor and the nephelometer with similar particle source conditions. Meanwhile, the relative scattering intensity detected by a nephelometer was further verified to be unaffected by aerosol hydration states and exhibited minimal sensitivity to RH fluctuations between 45% and 95%, with an uncertainty less than 5%.?
Therefore, the calibration factor derived under low RH conditions can be applied across the entire RH range, as expressed in eq
where PM_1-ref_ can be obtained by subtracting the M OPC-cal from the EPA/DEC ACHD site measured PM_2.5_ data set, as mentioned in Section, f nep is the calibration factor for the nephelometer, b denotes the regression intercept, reflecting the intrinsic instrument bias and baseline scattering signal, and accounted for systematic offsets between the nephelometer measurement and the reference monitor, including optical noise and instrument zero drift.
Figure shows the correlations between the PM_1-ref_ values and the nephelometer measurements under low RH conditions (RH < 60%), while the “wildfire-influenced” and “local-urban-influenced” periods exhibit no distinct differences. Therefore, a single f nep value of 0.82 and an intercept of 0.71 were obtained by fitting the data with RH < 60% from both periods, and these parameters were applied to all data points within each period to derive the optically calibrated mass, M nep-cal. Under high RH conditions (RH ≥ 60%), M nep-cal represents the total mass concentration of the dry-equivalent particle mass and ALW.
Correlations between the PM1-ref values and the nephelometer measurements under low RH conditions (RH < 60%)
ALW Separation and Method Validation (“Step 4”)
3.4
With the estimated M OPC-cal from Section (Step 2) and the M nep-cal from Section (Step 3), the derived MGF was then applied to partition the calibrated total mass concentrations from MODULAIR into dry-equivalent particle mass and ALW content. Specifically, (1) the optical calibrated OPC measurement, M opc-cal, was divided by MGF to yield the dry-equivalent PM_1−2.5_ mass concentration PM OPC-cal,dry-equ, and (2) the optical calibrated nephelometer measurement, M nep-cal, was divided by MGF to yield the dry-equivalent PM_1_ mass concentration: PM nep-cal,dry-equ. The sum of these two dry-equivalent mass concentrations was then defined as PM 2.5,dry-equ, and closely aligned with EPA/DEC T640 PM_2.5_ measurements with a regression slope of 1.02 (Figure), validating the method presented in this study. Meanwhile, the difference between (M opc-cal + M _nep-cal ) and PM 2.5,dry-equ was ALW concentration. Notably, the contribution of ALW to the total PM_2.5 was approximately 30% at RH between 70 and 80%, increased to about 40% at 80−90% RH, and approached or exceeded 50% when RH was above 90% (Figure S4), generally matching the results from previous studies. ?,? During the second wildfire plume episode (August 24−27), ALW concentrations exceeded 10 μg m^−3^, underscoring the substantial contribution of ALW under humid and smoke-influenced conditions. These results highlight both the magnitude of ALW occurrence and the significance of separating ALW in this study, providing critical insight for future investigations into aqueous-phase aerosol chemistry,? secondary aerosol formation, and CCN activity.
(a) Time series of the separated ALW, the final corrected MODULAIR PM2.5 data, as well as the EPA/DEC ACHD site PM2.5 measurements, and (b) the correlations between PM2.5,dry-equ and EPA/DEC PM2.5 measurements being colored by RH.
Uncertainties and Limitations
3.5
Among the parameters influencing the MGF calculation, the large variations in (1) OM/OC ratio, (2) the organic hygroscopicity parameter (k org), and/or (3) the organic aerosol density (ρ org) represented the major sources of uncertainty. Besides the default value of OM/OC ratio of 1.6, k org of 0.12, ρ org of 1.4 g cm^−3^, sensitivity tests were performed for (1) the OM/OC with a lower and upper bounds of [1.4 1.8] with default value of k org and ρ org, (2) the k org with a lower and upper bounds of [0 0.3] with default value of OM/OC and ρ org, and (3) the ρ org with a lower and upper bounds of [1.0 1.8] with default value of OM/OC and k org. Sensitivity tests were conducted for each parameter individually. Specifically, [1.4 1.8] was used within the lower and upper bounds of the OM/OC ratio while keeping k org and ρ org fixed at their default values; k org was varied between 0 and 0.3 with default OM/OC and ρ org; and ρ org was varied between 1.0 and 1.8 g cm^−3^ with default OM/OC and k org. The results are summarized in Table S2, which shows the largest variation of regression slopes between the EPA/DEC T640 PM_2.5_ measurements and the calibrated MODULAIR PM_2.5_ concentrations being approximately 10% when k org is 0. This relatively small variation suggests low sensitivity of the calibration outcomes to the tested parameters, supporting the applicability of the default parameter set across different seasons and regions.
In addition, potential volatility artifacts in the CSN measurementssuch as NH_4_NO_3_ loss and semivolatile OM evaporation or condensationcould also influence the calculated MGF and consequently the estimated ALW content, leading to additional uncertainty in the estimated k org. Furthermore, contributions from dust and sea salt were neglected, as these components are expected to be minor for inland PM_2.5_ at this site, though they may be more relevant in other regions.? Finally, the composition mass fraction between weekly CSN samples was interpolated across valley-to-valley periods to represent comparable sources and processing; however, intrainterval and diurnal variability may still occur, and subsegmentation using higher-temporal-resolution proxies is recommended where available. Meanwhile, the simplified assumption of similar mass fractions for PM_1_ and PM_1−2.5_, as well as ignoring the compounds’ mixing state, may also introduce additional uncertainty. However, uncertainty caused by the similar mass fractions for PM_1_ and PM_1−2.5_ was expected to be small, as the relatively dominant mass fraction of PM_1_ to PM_2.5_ (near 80%) for the NY region.? Given that the primary objective of this study was to develop a practical and feasible framework for correcting low-cost PM_2.5_ sensor measurements under high-humidity conditions and obtain a first-order estimation of ALW, the uncertainties associated with the aforementioned assumptions were not explicitly quantified. However, based on the previous sensitivity analysis results, these uncertainties are also expected to remain within an acceptable range. Further investigation using higher-accuracy measurements and advanced instrumentation is warranted to better constrain these effects.
Discussion
4
This study developed a novel, practical, and feasible framework for mechanistically correcting low-cost PM_2.5_ sensor measurements under high-humidity conditions by quantitatively separating aerosol liquid water mass (ALW) using the widely available EPA Chemical Speciation Network (CSN) data set, after accounting for the necessary optical calibration procedures that affect sensor performance at elevated RH. It explored two key correction processes for the QuantAQ MODULAIR PM_2.5_ system, which integrates a nephelometer for PM_1_ and an OPC for PM_1−2.5_, including: (1) optical calibration to account for sensor performance variations due to aerosol size, refractive index, density, and hygroscopic growth, and (2) ALW determination from the optically calibrated data set, to derive dry-equivalent aerosol mass concentration under high RH conditions. Following the description of the processes, one of the most critical parameters for the optical sensor correction would be the particle chemical component mass fraction, and it provides essential information for estimating ambient particle density for the OPC optical calibration and deriving the mass growth factor associated with particle hygroscopic growth under high RH conditions, which was then used for ALW separation. Meanwhile, the optical calibration factor for the nephelometer, which is the ratio of reference instrument measurements to the nephelometer, under low RH conditions (“f nep,RH<60%”), is critical to calibrate the nephelometer for its optical performance under high RH conditions.
Nevertheless, a significant challenge arises from the limited spatial coverage of high-quality reference PM_2.5_ measurements and detailed chemical composition data sets, which constrain the full correction potential of the optical sensors, especially under high RH conditions. Despite this, the key parameters identifiedparticle size, refractive index, chemical component mass fraction, and k _org_tend to exhibit source-specific consistency. That is, aerosols originating from similar sources generally share stable physical and chemical properties. This source-dependence suggests that large-scale, long-term statistical analyses of aerosol source characteristics can be leveraged to partially overcome geographic data limitations.
Given the widespread use of low-cost sensors such as the Plantower PMS5003 nephelometer and the Alphasense OPC-N2 for particle size distribution measurements, we strongly advocate further research that leverages the sensors which are near regulatory PM_2.5_ monitoring stations and aerosol chemical speciation networks. Long-term colocated data sets offer a valuable opportunity to develop a comprehensive look-up table or reference databasesuch as the illustrative Table in this study, containing optical calibration factors and correction parameters tailored to specific aerosol source types and seasonal conditions. Such databases would also allow researchers to account for seasonal shifts in aerosol properties due to photochemical aging, as well as being applied to regions without chemical speciation data while with similar source type and geographical characteristics as those represented in the look-up table.
1: Summary of the Particle Source and Characterizations for This Study
Importantly, our analysis further showed that using the averaged aerosol chemical composition for similar particle source regimes (as summarized in Table) introduces only a modest bias (approximately 5%, Figure S5), compared with the ∼3% bias obtained using the valley-to-valley period-averaged method described above. This finding indicates that regime-based average composition is both practically useful and scientifically defensible when local chemical speciation data is unavailable.
In regions without CSN or other speciation measurements, chemical composition can be assigned using a source- and region-informed surrogate CSN site for nonwildfire periods. Specifically, when a target location shares similar urban characteristics, emission profiles, and geographic settings with an existing CSN site, the composition from the most representative site within the same region can be used. For example, Syracuse, NY (located approximately 200 km northwest of Albany) shares comparable urban size and emission characteristics with Albany, making Albany’s CSN composition an appropriate surrogate for nonwildfire conditions. During wildfire events, however, composition was taken from the nearest CSN site, influenced by the same smoke plume, reflecting the regional nature of smoke transport.
Generally speaking, this regime-based approach enables this method to be transferable across regions while preserving the physical consistency of the correction algorithm. Collectively, these advancements are expected to significantly improve the accuracy and reliability of low-cost optical PM sensors under high RH conditions. Moreover, the separation of ALW from particle mass measurements provides additional scientific value by offering insight into aqueous-phase aerosol chemistry including secondary aerosol formation mechanisms and CCN activity.
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1EPA . Monitoring PM 2.5 in Ambient Air Using Designated Reference or Class I Equivalent Methods, Quality Assurance Guidance Document 2.12. 2016 https://www.epa.gov/amtic/amtic-pm 25-monitoring-network.
- 2Lewis, A. ; Peltier, W. R. ; von Schneidemesser, E. Low-cost sensors for the measurement of atmospheric composition: overview of topic and future applications; Research Report. World Meteorological Organization (WMO), 2018 https://eprints.whiterose.ac.uk/135994/.
- 3Ródenas García M.SpinazzéA.Branco P. T.Review of low-cost sensors for indoor air quality: Features and applications Appl. Spectrosc. Rev.2022579−1074777910.1080/05704928.2022.2085734 · doi ↗
- 4Yang L. H.Hagan D. H.Rivera-Rios J. C.Investigating the Sources of Urban Air Pollution Using Low-Cost Air Quality Sensors at an Urban Atlanta Site Environ. Sci. Technol.202256117063707310.1021/acs.est.1c 0700535357805 · doi ↗ · pubmed ↗
- 5Pradhan B.Jayaratne R.Thompson H.Morawska L.An application of low-cost sensors to monitor children’s exposure to air pollution at five schools in Queensland, Australia Atmos. Environ.202432512042410.1016/j.atmosenv.2024.120424 · doi ↗
- 6Bi J.Wallace L. A.Sarnat J. A.Liu Y.Characterizing outdoor infiltration and indoor contribution of PM 2.5 with citizen based low-cost monitoring data Environ. Pollut.202127611676310.1016/j.envpol.2021.11676333631689 · doi ↗ · pubmed ↗
- 7Crawford B.Hagan D. H.Grossman I.Mapping pollution exposure and chemistry during an extreme air quality event (the 2018 Kilauea eruption) using a low-cost sensor network Proc. Natl. Acad. Sci. U.S.A.202111827202554011810.1073/pnas.2025540118 PMC 827169734155096 · doi ↗ · pubmed ↗
- 8Kramer A. L.Liu J.Li L.Environmental justice analysis of wildfire-related PM 2.5 exposure using low-cost sensors in California Sci. Total Environ.202385615921810.1016/j.scitotenv.2022.15921836206902 · doi ↗ · pubmed ↗
