OMI Satellite and Ground-Based Pandora Observations and Their Application to Surface NO2 Estimations at Terrestrial and Marine Sites
Abstract
The Pandora spectrometer that uses direct-Sun measurements to derive total column amounts of gases provides an approach for (1) validation of satellite instruments and (2) monitoring of total column (TC) ozone (O3) and nitrogen dioxide (NO2). We use for the first time Pandora and Ozone Monitoring Instrument (OMI) observations to estimate surface NO2 over marine and terrestrial sites downwind of urban pollution and compared with in situ measurements during campaigns in contrasting regions: (1) the South African Highveld (at Welgegund, 26°34′10″S, 26°56′21″E, 1,480 m asl, ~120 km southwest of the Johannesburg-Pretoria megacity) and (2) shipboard U.S. mid-Atlantic coast during the 2014 Deposition of Atmospheric Nitrogen to Coastal Ecosystems (DANCE) cruise. In both cases, there were no local NOx sources but intermittent regional pollution influences. For TC NO2, OMI and Pandora difference is ~20%, with Pandora higher most times. Surface NO2 values estimated from OMI and Pandora columns are compared to in situ NO2 for both locations. For Welgegund, the planetary boundary layer (PBL) height, used in converting column to surface NO2 value, has been estimated by three methods: co-located Atmospheric Infrared Sounder (AIRS) observations; a model simulation; and radiosonde data from Irene, 150 km northeast of the site. AIRS PBL heights agree within 10% of radiosonde-derived values. Absolute differences between Pandora- and OMI-estimated surface NO2 and the in situ data are better at the terrestrial site (~0.5 ppbv and ~1 ppbv or greater, respectively) than under clean marine air conditions, with differences usually >3 ppbv. Cloud cover and PBL variability influence these estimations.
Key Points
- This analysis estimates surface NO2 from total column Pandora and OMI observations over South Africa and over North Atlantic Ocean
- Ground and satellite surface NO2 estimations over land are more similar as compared to over ocean with cleaner air
- Planetary boundary layer height and weather conditions impact the performance of this method in both terrestrial and marine environments
1 Introduction
Satellite remote sensing measurements of trace gases in the atmosphere are a valuable tool for atmospheric chemistry and pollution research by providing synoptic atmospheric observations over a large domain including over ocean. They can supplement regional air pollution monitoring networks (Brauer et al., 2012; He et al., 2013; van Donkelaar et al., 2010). Ground remote sensing measurements support validation of new measurements that focus on North American (Tropospheric Emissions: Monitoring of Pollution (TEMPO; Chance et al., 2013; Fishman et al., 2012; Zoogman et al., 2017)), East Asian (Geo-stationary Environmental Monitoring Spectrometer (GEMS; Kim, 2012)), European and North African (Sentinel-4 (Ingmann et al., 2012)), and global (Tropospheric Monitoring Instrument on Sentinel 5 Precursor (TROPOMI; Loyola et al., 2017) missions. Satellite observations of tropospheric NO2 column abundance have been conducted since 1995 (Burrows et al., 1999) and are well suited for inferring NOx emissions (Chai et al., 2009; Lamsal et al., 2011; Lu & Streets, 2012; Streets et al., 2013) and monitoring air pollution across large spatiotemporal scales (Beirle et al., 2011; Boersma et al., 2008; Duncan et al., 2010, 2013, 2014, 2016; Fishman et al., 2008; Hilboll et al., 2013; Kim et al., 2006; Lamsal et al., 2015; Martin, 2008; Russell et al., 2012). However, high-resolution ground-measured NO2 column abundances often show strong temporal and spatial variability beyond the capability of current satellite observations. These data are essential for understanding the influence of various NOx sources on local and regional pollution episodes in urban and coastal ecosystems (Herman et al., 2009; Reed et al., 2015; Tzortziou et al., 2015).
The Ozone Monitoring Instrument (OMI; Boersma et al., 2002; Bucsela et al., 2006) on board the Aura satellite and the ground-based Pandora instrument (Herman et al., 2009) derive the total amount of NO2, among other chemical species, in a vertical column of air through a retrieval algorithm. However, it is a challenge to relate total column (TC) observations to ambient surface volume mixing ratios. For example, one difficulty comes from accounting for variability in the planetary boundary layer (PBL). Because tropospheric NOx has a relatively short lifetime of several hours up to a few days, the largest concentrations of NO2 are measured in the boundary layer close to emission sources.
Multiple methods can relate the column retrieved abundances to surface concentrations. Boersma et al. (2009) used surface NO2 data to derive a boundary-layer column to show how OMI can capture intraday variability caused by diurnal variation of NO2. Lamsal et al. (2008, 2010) showed that ground-level NO2 concentrations obtained from OMI simulated with a chemical transport model agree well with corrected in situ measurements in the U.S. and Canada. Similar studies (e.g., Halla et al., 2011; Lee et al., 2011; Leigh et al., 2007; Luo et al., 2012; Petritoli et al., 2004; Piters et al., 2012), comparing satellite or ground-retrieved columns to surface measurements, have utilized complex chemical transport models and/or large averaging times (weekly, seasonal, or annual) that effectively average out the influence of PBL variability. These methods are often inadequate for interpreting health-related NO2 fluctuations that occur on the order of hours. Knepp et al. (2015) first estimated surface mixing ratios of daily NO2 with TC NO2 measurements from a ground-based Pandora spectrometer at a rural and an urban site in the U.S., scaling column concentrations by the boundary layer height. Similarly, Flynn et al. (2014) expanded this study to include several ground sites from the DISCOVER-AQ MD 2011 campaign ranging from suburban (Beltsville) to rural (Aldino) and found the degree of correlation ranging from R2 = 0.29 to 0.01, respectively, between in situ and column NO2 from Pandora instruments. Here we adapt the Knepp et al. (2015) method (referred to as Knepp–Kollonige method below) to estimate surface NO2 concentration with ground- and satellite-based TC observations taken over marine and terrestrial sites downwind of urban pollution when local measurements of the boundary layer height are not available. Rather than using a modeled PBL height, the Knepp–Kollonige method employs for the first time satellite observations of the PBL height when estimating surface NO2 concentrations.
The objective of this study is to evaluate the first application of the Knepp–Kollonige method on TC NO2 measurements. Both a marine (along the U.S. East Coast in the Atlantic Ocean) and a terrestrial (Welgegund measurement site that is ~120 km from the Johannesburg–Pretoria (J-P) megacity, South Africa) environment were selected for this evaluation. Surface concentrations of NO2 (expressed in volume mixing ratios) measured by ground-level trace gas analyzers at the locations are compared to estimated volume mixing ratios obtained from tropospheric vertical column densities retrieved by (a) coincidental, co-located vertical ground-level Pandora measurements and (b) close overpasses of the OMI instrument aboard the NASA Aura satellite. The height of the PBL, a key variable in this method of estimation for scaling the TC NO2 measurements, is derived from three different approaches, depending on the site and available data: observations from radiosonde launches at the Irene weather station (150 km northeast of Welgegund); co-located Atmospheric Infrared Sounder (AIRS) observations; and a model simulation. Case studies are presented to demonstrate the strengths and weaknesses of this method over each locale and are meant as a prototype for TROPOMI and upcoming geostationary satellite data sets.
2 Data and Methods of Analysis
2.1 Site Locations and In Situ Measurements
Two locations with separate measurement campaigns were selected for the evaluation of the Knepp–Kollonige method for estimating surface NO2 concentrations from TC NO2 abundances. The results are compared to available in situ gas analyzer measurements described in the following sections and Table 1.
Campaign/site name | Location (lat, lon) | Instrument | Data | Time resolution for analysis |
---|---|---|---|---|
Welgegund, South Africa | 26°34′S, 26°56′E | Teledyne AU 200 gas analyzer | NO detected by chemiluminescence w/ O3, NO2 = NOx − NO | 15 min and 1 h |
Environnement s.a.41M | O3 concentration | 15 min | ||
Rotronic MP 101A | Temperature, relative humidity | 15 min and 1 h | ||
Vector A101ML, Vector W200P | Wind direction and speed | 15 min and 1 h | ||
Vaisala QMR102 | Precipitation | 1 h | ||
Pandora (#8) spectrometerc | Total column NO2 | 15 min and 1 h | ||
DANCE, North Atlantic Ocean | 35–39°N, 72–75°W | Aerodyne CAPS NO2 | NO2 concentration | 15 min |
— | Temperature, relative humidity | 15 min | ||
Pandora (#24) spectrometerc | Total column NO2 | 15 min and 1 h |
- a A full list of gas, aerosol, and ecosystem observations at Welgegund is available at www.welgegund.org.
- b Measurements described in further detail in Petäjä et al. (2013) and Tiitta et al. (2014). Martins et al. (2016) describes in situ ship measurements from DANCE.
- c Instrument from Sci-Globe.
2.1.1 Welgegund, South Africa Campaign
A prominent NO2 “hotspot” observed in remote sensing maps derived from satellite observations is present over the South African Mpumalanga Highveld (Duncan et al., 2016; Lourens et al., 2011, 2012) with a similar intensity to other major regional pollution “hotspots,” such as eastern China. This anomaly is attributed to the presence of 15 coal-fired power stations and a number of metallurgical smelters, as well as a large petrochemical industry. In addition to the well-known NO2 “hotspot,” there are also widespread regional sources, including traffic, biomass burning, and informal settlements with extensive coal and biofuel use, as well as large industrial point sources from metallurgical, mining, and petrochemical operations (Freiman & Piketh, 2003; Laakso et al., 2008, 2012; Venter et al., 2012, 2016). The plumes originating in the Highveld typically mix down to ground level only under very unstable conditions. For example, Balashov et al. (2014) reported relatively low NOx surface concentrations over the Highveld with no significant NO2 trend. Duncan et al. (2016), using 10 years of satellite observations, discovered regions of both increasing and decreasing trends in NO2 over the South African Highveld depending on proximity to sources. With the troposphere usually well stratified (Newell et al., 1999), elevated layers of accumulated pollutants are common over southern Africa (Cosijn & Tyson, 1996; Freiman & Tyson, 2000; Garstang et al., 1996; Giannakaki et al., 2015).
The Welgegund measurement site (26°34′10″S, 26°56′21″E, 1,480 m above sea level) is located approximately 25 km northwest of Potchefstroom (Figure 1, red star, and Figure 2, magenta triangle) and 100 km southwest of the J-P megacity with no significant local pollution sources (Beukes et al., 2013; Booyens et al., 2015; Jaars et al., 2014, 2016; Tiitta et al., 2014; Vakkari et al., 2014). Welgegund is frequently impacted by air masses from a number of source regions including J-P (Jaars et al., 2014; Tiitta et al., 2014), the industrialized Mpumalanga Highveld, the Vaal Triangle to the southeast, known for its metallurgical and petrochemical processing facilities, and the Bushveld Igneous Complex (Venter et al., 2012) with additional mining and metallurgical activities to the northeast (Figure 1). Air masses, passing over the regional background from the west of Welgegund, are relatively clean during the wet season (South African summer, but extends November–April) with no significant point sources (Jaars et al., 2014; Tiitta et al., 2014). During the South African winter months and dry and biomass burning seasons (May–October), limited fire pollution originates from this sector due to its aridity and low biofuel load (Vakkari et al., 2014).


Surface measurements at Welgegund are described in previous studies starting with a mobile atmospheric monitoring trailer in Petäjä et al. (2013) and the permanent placement of instruments at the site in May 2010 in Tiitta et al. (2014) and Beukes et al. (2015). Table 1 summarizes the primary trace gases (O3 and NOx) and meteorological parameters from 25 January to 15 March 2011 that were used in this study. Temperature, relative humidity, and precipitation are recorded every minute. The trace gas instruments share a Teflon (polytetrafluoroethylene) sampling line and record measurements at 1 min intervals.
A Teledyne 200 AU gas analyzer detects NO by chemiluminescence after reaction with O3 and records NO and NOx measurements. Nitrogen dioxide is determined by conversion to NO through a reduction reaction with molybdenum and subtracting NO from the sum. However, this gas analyzer also converts other reactive, oxidized nitrogen species, NOz (HNO3, PAN, and other organic nitrates), and can cause overestimations of NOx (and thus NO2) ranging between ~17% to greater than 50% (Dunlea et al., 2007; Grosjean & Harrison, 1985; Steinbacher et al., 2007). For the time period studied, the in situ NO2 gives the upper limit of the NO2 at surface, but due to the abovementioned issue, the actual NO2 concentration can be up to 50% less. We adopted an uncertainty of ~50% for this type of gas analyzer NO2 measurements.
The Welgegund instruments are maintained weekly and calibrated every 3 months. Any possible drift in instrument response between calibration checks is corrected for in the quality-controlled data according to Laakso et al. (2008). All surface data were quality controlled and then averaged over 15 min intervals for comparison to the remotely sensed observations. During this time period, cloud cover and precipitation (strongly influenced by the SA Highveld wet season prevalent in late spring, summer, and early autumn) were highly influential on local air photochemistry. As a result of poor (cloudy/rainy) weather conditions at the station, only 22 days of observations were used between 25 January and 15 March 2011. This time period of observation was a campaign of opportunity and specific to the Fulbright Award afforded to A. M. Thompson (January–April 2011), which only included the wet season for the Welgegund site. This analysis is a demonstration of the application of measurements taken at Welgegund through the limited time period of the Fulbright Award.
2.1.2 North Atlantic Ocean, DANCE Campaign
The Deposition of Atmospheric Nitrogen to Coastal Ecosystems (DANCE) campaign encompassed the area of 34–39°N, 71–76°W (Figure 3) in the North Atlantic Ocean off the mid-Atlantic coast on 29 July to 15 August 2014 where trace gas and meteorological measurements were collected on board the Research Vessel Hugh R. Sharp. Leaving from Lewes, Delaware (38.79°N, 75.15°W; magenta triangle in Figures 3a and 3b), the ship targeted semipermanent anticyclonic eddies at ~38.35°N, 72.56°W and ~35.64°N, 72.16°W (black solid line traces ship track in Figures 3a and 3b). Due to the scarcity of atmospheric measurements offshore, this data set is important for understanding (1) atmospheric deposition, which influences marine ecosystems (Reay et al., 2008), and (2) any biases between TC trace gas abundances from remote sensors and in situ instruments over open ocean, which impact retrievals of other constituents such as ocean color (Fishman et al., 2012). Figure 3 is an example of the daily OMI NO2 tropospheric column abundances observed over the Atlantic Ocean during the main pollution event observed over open water on 5–6 August 2014.

Martins et al. (2016) detail the DANCE onboard ship measurements including in situ NO2 used in this analysis (see also Table 1). Nitrogen dioxide data collected with an Aerodyne CAPS NO2 instrument was calibrated with an uncertainty of 5%. The in situ measurements were filtered to remove influences from the ship's exhaust because of the placement of the inlet port side. TC NO2 measurements from a Pandora instrument on board the ship are described in Martins et al. (2016) and the following section.
2.2 Total Column Observations of Nitrogen Dioxide
Co-located total column observations of NO2 retrieved from a Pandora spectrometer at each site (Table 1) and OMI on board the Aura satellite were selected for this analysis (Table 2). All TC NO2 observations and their respective surface estimations derived from the Knepp–Kollonige method were compared to each other here. Additionally, Martins et al. (2016) showed direct comparisons between TC NO2 and in situ surface measurements of NO2 during the DANCE campaign.
Instrument | Data | Time resolution for analysis |
---|---|---|
Aura OMI | Total, stratospheric, and tropospheric column NO2, and cloud fraction | 1–2 obs per day |
Aqua AIRS | Planetary boundary layer top | 1–2 obs per day |
2.2.1 Pandora
At the Welgegund site during 25 January to 15 March 2011, the Sun-only Pandora (280–500 nm) detector, mounted on a computer-controlled Sun tracker, was installed temporarily to retrieve total column O3 and NO2 densities approximately every 2 min using direct Sun irradiances. Herman et al. (2009) provides a detailed technical characterization of Pandora, and it is briefly described here. The instrument has a 1.6° field of view and spectral resolution of 0.6 nm. Uncertainties in ambient temperature, absorption cross sections, and cloud cover introduce uncertainties into the Pandora TC NO2 retrievals and are accounted for in the instrument's retrieval algorithm described in Herman et al. (2009) and more recently in Tzortziou et al. (2012, 2015).
During DANCE, a ship-based direct-Sun Pandora spectrometer was mounted on top of the bridge in front of the ship's exhaust to avoid contamination. TC NO2 and O3 were similarly retrieved with data collected on 16 out of 17 days of operation. Further data and instrumentation details are outlined in Martins et al. (2016), including analysis of the influence of the ship's plume on these TC measurements.
Validation studies with other ground-based instruments revealed that Pandora total vertical column NO2 densities compared well with their retrievals. For example, a Fourier transform ultraviolet spectrometer agreed within 4% of Pandora retrieved TC NO2 (Herman et al., 2009). Wang et al. (2010) showed that Pandora slant column abundances agree with those obtained from a multifunction differential optical absorption spectroscopy (DOAS) to within 5%. Validation results from the CINDI (Cabauw Intercomparison Campaign for Nitrogen Dioxide Measuring Instruments) campaign have also indicated that MAX-DOAS slant column densities of NO2 from multiple instruments, including Pandora, have agreement of approximately 5–10% (Piters et al., 2012) under clear sky conditions. This measurement site of the CINDI campaign, Cabauw, the Netherlands, is similar to Welgegund in that it does not have major local sources of trace gases like NO2 but rather is impacted from transported pollution being downwind of urban areas (i.e., Rotterdam, Netherlands).
Because of temporal and spatial sampling, Pandora TC NO2 retrievals capture tropospheric NO2 variability better than once daily satellite-retrieved TC NO2 products (e.g., OMI; Herman et al., 2009; Reed et al., 2015; Tzortziou et al., 2015). For this analysis, we averaged Pandora Level 4 quality-controlled vertical TC NO2 data over 15 min intervals to match the suggested data frequency for the gas analyzers (Table 1). Absolute error in Pandora retrievals is ±0.1 Dobson units (0.1 DU ~ 2.69 × 1015 molecules/cm2), with a precision of about ±0.01 DU in clear skies according to data documentation. Note this error grows with noise created by clouds in a given retrieval, such as the conditions observed during our 2011 Welgegund observing period. The Pandora data were filtered to only include data with uncertainty in TC NO2 < 0.05 DU (~1.3 × 1015 molecules/cm2) for retrieval error and nonfunctioning periods and normalized root mean square of weighted spectral fitting residuals <0.05 for cloud cover threshold exceedances as suggested by Herman et al. (2009) and Tzortziou et al. (2012). The filtered Pandora TC NO2 retrievals have uncertainties of ~7% and ~17% during the Welgegund and DANCE campaigns, respectively (including both polluted and clean atmospheric conditions).
2.2.2 Ozone Monitoring Instrument
OMI, a near-UV/visible charged-coupled device nadir-viewing spectrometer, was launched on the NASA EOS Aura satellite in July 2004 and crosses the equator between 13:40 and 13:50 local time every day as a part of the A-Train satellite constellation (Boersma et al., 2007; L'Ecuyer & Jiang, 2010; Levelt et al., 2006; Schoeberl et al., 2006). OMI measurements cover a spectral range of 264–504 nm and provide retrievals with a spatial resolution of 13–26 km along and 24–128 km across track, achieving daily global coverage (every 2 days) of several trace gases, which include O3, NO2, SO2, and HCHO. The OMI Level 2 version 3 (OMNO2-L2) retrieval algorithm uses a DOAS fit to calculate the slant column amount of NO2 (Bucsela et al., 2006, 2013). Vertical TC NO2 amounts are determined from the slant column abundances using air mass factors (AMFs), which are affected by both cloud cover and aerosols. Reed et al. (2015) demonstrated the effect of aerosols on OMI NO2 column observations in urban and rural environments of the Baltimore/Washington, DC region, recognizing aerosols as a large source of error in AMFs. A monthly mean climatology of NO2 profile shapes from the Global Modeling Initiative Chemical Transport Model (Strahan et al., 2007) accounts vertical variability in NO2 in the retrieval algorithm (Boersma et al., 2002; Bucsela et al., 2006, 2013; Lamsal et al., 2014).
We use total, tropospheric, and stratospheric column NO2 from the standard OMNO2-L2 data product available at the NASA GES DISC (Goddard Earth Sciences Data and Information Services Center; http://disc.sci.gsfc.nasa.gov/Aura/data-holdings/OMI/omno2_v003.shtml). Uncertainties in the tropospheric NO2 columns (TropNO2) arise from the uncertainties in the retrieval of the slant column abundances, AMF calculations, and the separation of stratospheric and troposphere columns. The total expected error, dominated by AMF uncertainties over continental polluted regions, is <20% for clear skies and 30–80% under cloudy conditions (Dobber et al., 2008). OMNO2-L2 data files include variables such as Cloudfraction and XTrackQualityFlag that are used for filtering and quality control. We selected retrievals in all cross track positions with cloud fractions <20% and within 100 km of the Welgegund station or the ship track in the Atlantic. We also accounted for the “Row Anomaly” with XTrackQualityFlag = 0; this is a marker for Level 1B OMI radiances that are compromised by high signal-to-noise ratios due to limitations of the detector (Dobber et al., 2008). Figure 2 is an example of OMI TropNO2 abundances over South Africa on 10 March 2011 (day with minimal cloud coverage), indicating hotspot values >10 × 1015 molec/cm2 within the J-P megacity. A map of OMI TC NO2 during DANCE is shown in Figure 3.
As a result of weather conditions (i.e., rain and cloud cover) at the Welgegund station between 25 January and 15 March 2011, only 22 days of observations (satellite and ground) could be used for our comparisons based on the quality controls outlined. For the DANCE case study, the Knepp–Kollonige method could be applied to only 8 days of observations of the total 17 day campaign. Each day an average TC abundance of NO2 was used for the area surrounding the site. Because a Pandora was located at either “ground” location, Welgegund site, or on board the ship during DANCE, OMI retrievals within 100 km or ~1° were collected. A mean TC was then calculated from those retrievals for a direct comparison to the Pandora TC.
2.3 Planetary Boundary Layer Height Variability
The estimation of surface mixing ratios from tropospheric column abundances depends on the daily variability of the PBL, which changes throughout the day, responding to local surface heating and synoptic meteorology. Based on Korhonen et al. (2014), where PBL tops showed good correlation with satellite and modeled observations at a measurement site (Elandsfontein) near J-P in the Mpumalanga industrial Highveld of South Africa, we selected both modeled and observed PBL heights for the analysis at Welgegund (Figure 4). Because PBL heights were not directly measured at the Welgegund site, we used three methods for calculating PBL heights: a model simulation, spatially and temporally co-located AIRS satellite observations, and observations from radiosonde launches at a nearby weather station. Each technique was considered for the column conversions; their respective observations are presented in Figure 4.

The first method used to determine PBL structure and evolution was a model simulation using Monin–Obukhov similarity theory. Inputs into the fifth generation Mesoscale Model (MM5) included hourly-averaged temperature, humidity, pressure, wind speed and direction, potential temperature gradient, global upwelling radiation, global incoming radiation and net radiation from the Welgegund measurement site, and NCEP reanalysis. Using the Hong and Pan (1996) boundary layer parameterization scheme, simulations provided hourly vertical temperature profile estimates. The hourly boundary layer height was derived similarly to Cimorelli et al. (2004) as used in the Aermod air dispersion regulatory model. For comparison to nearby observed PBL heights (Figure 4a), 1 h mean modeled PBL heights above sea level (ASL; green squares) within 1 h of the Aura overpass for days with Pandora and OMI measurements are presented for Welgegund.
Observations from the AIRS instrument, on board the NASA EOS Aqua satellite since 2002 and part of the A-Train satellite constellation, provide daily global observations within 15 min of the Aura satellite (Aumann et al., 2003) and were collected for a second method of PBL height estimations. AIRS L2 V6 support files (described in http://disc.sci.gsfc.nasa.gov/AIRS/documentation/v6_docs/v6releasedocs-1/V6_L2_Product_User_Guide.pdf) contain a new product, PBL top, derived from temperature and humidity profiles with 100 levels via the method in Martins et al. (2010). The boundary layer top height is the pressure of the level with the largest gradients of potential temperature and relative humidity (relative to liquid phase of water) calculated on the support pressure 100 layer grid. Within the PBL, the data set has ~25 hPa vertical resolution and 45 km horizontal resolution (Susskind et al., 2006). AIRS quality control flags for PBL top are heavily dependent on the relative humidity and temperatures retrieved at the surface and throughout the lower troposphere (Martins et al., 2010; Susskind et al., 2010). Filtered AIRS retrievals were co-located temporally and spatially with the selected OMI retrievals for the surface NO2 estimation method (Figure 4a, red circles) and compared to modeled PBL heights. The mean difference between 1,000 bootstrap samples of these two data sets is ~ 30% and statistically significant with 95% confidence (shown in Figure 4b).
In addition to model-derived and satellite-observed PBL heights, we also used radiosonde observations from a South African Weather Service station at Irene, 150 km northeast of Welgegund and at a similar elevation above sea level ~1,500 m, to determine an approximate height of the PBL for a similar environment. The radiosonde data were smoothed to 100 m resolution using a moving average filter. The World Meteorological Organization definition of the PBL top was calculated for simplicity (i.e., temperature gradient greater than 5 K/km). Radiosonde launches that corresponded in time with the Pandora and OMI observations (daytime overpass) numbered 16 out of the total 22 days. Figure 4a shows the calculated PBL heights from the radiosondes (blue stars). The median sonde-PBL height for the area near Welgegund is ~2,700 m ASL compared to ~3,000 m ASL and 4,000 m ASL for AIRS and the model, respectively (Figure 4b). This result suggests an approximate mixing layer depth of ~ 1.2 km measured from the sondes versus ~1.5 km as observed by AIRS. The mean difference between 1,000 bootstrap samples of the sonde-PBL and the AIRS-PBL is ~11%. These PBL heights are consistent with recent ceilometer measurements at Welgegund (unpublished) ranging from ~1 to 2 km for ~11:30 UTC, the overpass time for Aura and Aqua.
PBL heights from the AIRS observations (within 100 km) were selected for the column conversions over Welgegund based on the better comparison with the sonde-PBL and for having at least once daily observations of the PBL. For DANCE, Martins et al. (2016) presented model-derived PBL heights ranging from ~160 to 950 m during the campaign time period. AIRS-observed PBL heights within the DANCE region (34–39°N; 72–76°W) and time period are shown in Figure 5, and median values range between ~1,000 and 2,000 m ASL. This large difference between observations and simulations over ocean highlights the need for in situ PBL height measurements for validation. An infrared sensor, like AIRS, requires large thermal contrast between surface and atmosphere for strong signals. Over ocean, low signal strength contributes to higher retrieval errors and large variability in AIRS-observed PBL heights versus over land where signal strengths are higher (i.e., the Welgegund site). Due to no in situ ship PBL height measurements, the AIRS-observed values were selected for the DANCE surface estimation calculations to be consistent in using observations over model simulations and again provided similar data coverage with once daily daytime PBL heights.

2.4 Surface NO2 Estimation, Knepp–Kollonige Method
NO2 mixing ratios often display considerable vertical gradients within the PBL (Piters et al., 2012). The method for estimating surface NO2 from a column amount developed by Knepp et al. (2015) assumes that surface emissions are well-mixed within the PBL as a first-order approximation (Leigh et al., 2007). To calculate Pandora TropNO2 densities, daily OMI stratospheric column NO2 densities (using the OMNO2 version 3 L2 data) were resampled to the same temporal frequency as the quality-controlled Pandora measurements and subtracted from the Pandora TC NO2 observations.
This method assumes that the NO2 stratospheric column remains constant on a daily basis to simplify the calculation used to estimate surface mixing ratios. Also, this method assumes that most NO2 in the troposphere is within the PBL, and the free troposphere contribution can be considered negligible under certain conditions. According to Lee et al. (2011), where surface NO2 mixing ratios were inferred from OMI TCs over rural and suburban areas downwind of Detroit, MI, the free tropospheric contribution on average was 0.8 × 1015 molecules/cm2 and smaller than most tropospheric columns even in the rural areas similar to our two locations. For our analysis, these conditions are most likely to occur if pollution transport is within the PBL or instabilities in the lower troposphere allow vertical transport of pollution layers toward the surface. The stratospheric contribution is significantly less than tropospheric column amount under polluted conditions, allowing for this assumption to work. This may break down in clean air conditions, such as a marine environment, so analysis was not completed at each site if stratospheric columns (~2 × 1015 molecules/cm2) were close to or larger than the tropospheric column amounts to reduce the uncertainty this assumption adds to the estimated mixing ratios.


3 Results and Discussion
3.1 Pandora and OMI TC NO2
When either the Pandora data or in situ data was selected to be co-located with OMI, the measurements were a ±30 min mean value. The overpass time of OMI did vary (centered around 11:30 UTC), and the data were selected accordingly with the possibility of having more than one overpass. For Pandora comparisons at Welgegund, only three out of 22 days of observations had two overpasses for comparison (see Figure 6). For in situ versus OMI comparisons, this was also the case. For DANCE, data were selected similarly, and only one day of the 8 days of observations had more than one overpass.

To determine the temporal and spatial limitations of OMI for detecting NO2 within the boundary layer over Welgegund, we first compared Pandora TC NO2 measurements averaged over 1 h of the OMI overpass on 22 days, January 25 to 15 March 2011. OMI retrievals within 100 km (or ~1°) were collected for each day, and a mean TC was calculated from those retrievals for a direct comparison to the corresponding Pandora TC amount. Figure 6 (and insert) shows a relatively good correlation with OMI TC NO2, that is, a correlation coefficient of R = 0.44 with a corresponding slope = 0.78, considering the rain and cloud cover during the campaign. The noise in the data, or uncertainties, in Figure 6 for the Pandora and OMI TC NO2 measurements under the cloudy conditions we observed at Welgegund was ~7% and ~35%, respectively. The difference between the two instruments was ~ 20%, which is comparable to what Reed et al. (2015) determined for similar cloudy conditions.
In general, OMI observed lower TC NO2 abundances than Pandora during the experiment, and the largest discrepancies occurred on the more polluted days with higher cloud cover over the area (see Figure 7 ground measurements of O3, NOx, and NO). This result is consistent with areas impacted by strong (local or transported) pollution (Reed et al., 2015; Tzortziou et al., 2015). In particular, Pandora retrieved higher TC NO2 over Welgegund on ~days 27, 47, and 74 (Figure 6) when fresh pollution (high NOx and low O3) was measured by ground analyzers as shown in Figure 7. The coarse spatial resolution of the OMI retrievals contributes to reduced NO2 values here. When relatively clean rural air is adjacent to relatively more polluted air within the same satellite-retrieved field of view, this can cause averaging of higher NO2 concentrations with lower NO2 concentrations. This effect may lead to systematic OMI underestimations. Multiple days also correspond to cloudy and rainy conditions, and this was most likely the cause for the largest differences between Pandora and OMI TC NO2. Cloud cover affects TC retrievals through misinterpretation of the depth of the column observed by the remote sensing instrument (i.e., difficult for OMI to see below clouds and Pandora to see above). OMI could not detect enhanced NO2 beneath the clouds; both effects (coarse resolution and cloud cover) lead to OMI underestimations. This limitation will result in low surface estimations of NO2 from our OMI observations.

The diurnal cycle of NO2 and O3 measured at Welgegund for this time period is shown in Figure 8. Restricted by the once daily observations, OMI TCs cannot capture this variability like the Pandora instrument (Knepp et al., 2015). The highest surface NO2 concentrations are observed earlier in the day (morning) before the Aura overpass of Welgegund at 11:30UTC. This denotes the importance of high temporal frequency measurements when discussing TC NO2 variability, whether from a Pandora instrument or from upcoming geostationary missions.

Martins et al. (2016) presented comparisons between coincident and co-located OMI and Pandora-ship TC NO2 on 8 days of DANCE observations. A total least squares fit between the data sets resulted in a slope = 0.46 and correlation coefficient of R = 0.57 with OMI overestimating in all but one data pair (data not shown here), which is common at nonurban sites (Reed et al., 2015; Tzortziou et al., 2015). Better correlation of the TC NO2 abundances over ocean suggests the retrieval of similar columns unobstructed by cloud cover versus over Welgegund, where OMI retrieved a tropospheric column obstructed by clouds with the majority of NO2 below the cloud cover on multiple days based on the surface and Pandora observations. However, the Welgegund campaign as a whole had a better 1:1 ratio of Pandora to OMI TC NO2 abundances. During DANCE, Pandora TC NO2 values (not shown) ranged from 0.1 to 0.3 DU (2.67 × 1015–8.01 × 1015 molecules/cm2), whereas at Welgegund, Pandora measured TC NO2 values from 3 × 1015 molecules/cm2 to greater than 1 × 1016 molecules/cm2. This result can be attributed to more frequent NO2 enhancements from nearby Highveld sources (and higher signal strengths) captured by both OMI and Pandora at the Welgegund site versus over clean marine air (only impacted once by downwind pollution transport from the mid-Atlantic U.S.).
3.2 Pandora-Estimations Compared to Surface NO2
3.2.1 Welgegund Campaign
Pandora estimates of surface NO2 at the Welgegund site were calculated using the Knepp–Kollonige method (equation 1 in section 2.4), where co-located and coincident AIRS-observed PBL heights, OMI stratospheric NO2 columns, and number density (N) were resampled to the 15 min averaged Pandora time steps. Figure 9 presents a comparison between Pandora-estimated surface NO2 mixing ratios and the in situ surface NO2 mixing ratios at 15 min intervals. Pandora (green squares) generally agrees with the surface (blue stars) NO2 concentrations (Figure 9a); estimates were mostly within the uncertainty limit of the gas analyzer ~50%. The mean NO2 surface mixing ratio for this study as measured with the gas analyzer was 1.55 ppbv, while the Pandora-estimated surface mean was ~1.2 ppbv (a mean bias of ~20%). A reduced-major axis (RMA) fit between the two data sets (blue line in Figure 9b) resulted in a slope of 0.661. With PBL height as a major source of uncertainty, the resampling of daily AIRS observations to the Pandora time intervals will affect the Pandora-estimated surface mixing ratios.

A low correlation coefficient of R = 0.10 and the diurnal differences between the estimates and the in situ measurements (Figure 9a) may also be attributed to different sources of NO2 at the ground site. If a pollution plume was advected over the Welgegund site with NO2 aloft, Pandora may detect an enhancement in TC NO2 before high concentrations of NO2 were measured at the surface. Before any downward vertical mixing, Pandora estimates would peak before the surface in situ measurements (day 42, Figure 9a). On days 42–44, wind direction at the site suggests NO2 most likely came from surrounding industrial sources to the northeast (Western Bushveld Igneous Complex) and southeast (Vaal Triangle). Conversely, if the source of NO2 was local, any enhancements in NO2 measured at the surface may be observed prior to Pandora detection unless convection was present. This effect could account for any temporal differences in the diurnal variability observed from Pandora and the surface monitor.
In Figure 9c, the histogram is centered on a 0–0.5 ppbv difference between the Pandora-estimated and in situ surface NO2. The largest discrepancies between Pandora and the gas analyzer, where surface NO2 was significantly underestimated by Pandora, fell on days 56–59 (25–28 February 2011) and days 71–74 (12–15 March 2011). During both time periods, NO2 concentrations were enhanced at the surface (>5 ppbv; Figure 9a), but Pandora-estimated surface NO2 mixing ratios were ~2 ppbv or less. Low clouds and intermittent rain in the region on days 56–59 likely impacted the Pandora TC NO2 used in the calculation. For days 71–74, hard rain and clouds also influenced Pandora-estimated surface NO2. In previous campaigns on the South African Highveld (i.e., SAFARI-1992), intense rainfall released NO from soil. In an agricultural environment like Welgegund, the soil can add NO emissions (thus NO2) to the air that can be captured by the in situ instrument more easily under these weather conditions. These large differences between the ground measurements and Pandora could also be attributed to interference species measured by the gas analyzer using a molybdenum converter. In Piters et al. (2012), a comparison between this type of instrument and a photolytic converter found differences as large as 5 ppbv during the day time. The solar UV needed to generate these interfering species was still present on many of these days where rainfall was not constant the entire day. The group of outliers in Figure 9b where surface concentrations are much higher than those estimated from Pandora TC NO2 could be directly related to the fact that this gas analyzer measures more than just NO2 as compared to Pandora.
The only notable time period during which Pandora overestimated surface NO2 in Figures 9a and 9b occurred on days 62–63 (3–4 March) when high clouds over the region affected the TC NO2 retrievals. Pandora TropNO2, derived from Pandora TC NO2 and daily OMI stratospheric columns, could be overestimated if the OMI retrieval had difficulty in separating the stratospheric and tropospheric column abundances in the presence of high clouds. This result suggests that OMIstrat (~2.5–3 × 1015 molecules/cm2) could influence our estimations under those particular observed conditions, but this is only of importance under clean atmospheric conditions. The best agreement between the Pandora-estimated and ground site surface NO2 measurements (Figure 9b denoted by red stars) followed after this time period (days 63–65; 4–6 March 2011), where their relationship was close to the 1:1 line in Figure 9b (black dashed line).
3.2.2 DANCE Campaign
Using the Knepp–Kollonige method, a similar analysis was completed over open water during the DANCE campaign with the onboard Pandora TC and in situ NO2 ship observations. Co-located and coincident AIRS-observed PBL heights, OMI stratospheric NO2 columns, and number density (N, calculated with in situ ship measurements) were interpolated to the 15 min averaged Pandora time steps. Eight days (out of 17 total days) of Pandora estimations and ship observations were compared and are presented in Figure 10.

The surface estimate of NO2 using the Pandora data was largely underestimated (Figure 10a), assuming the in situ NO2 measurements from the ship are true, with differences of ~3 ppbv or greater (higher than those observed at Welgegund). A RMA fit resulted in a slope of 0.0165 and a statistically significant low correlation coefficient of R = 0.0818 (Figure 10b) based on the bootstrap method using 10,000 samples. Pandora did observe a change in TC NO2 initially, but missed most of the polluted days during DANCE (06–10 August) due to cloudy conditions, affecting the Knepp–Kollonige method; whereas this method demonstrated good performance at Welgegund, South Africa, with a mean bias less than 1 ppbv. The main source for error with the DANCE data set is attributed to the large variability and uncertainty associated with the AIRS-observed PBL heights (Figure 5) over water discussed in section 2.3. Overestimating PBL height leads to underestimating Pandora NO2 mixing ratios due to their inverse relationship (equation 1). This dependence appears to have a larger effect over a marine environment with shallow boundary layer heights and limited measurements versus a land site like Welgegund with typically higher PBL heights. Martins et al. (2016) showed that model PBL heights were not necessarily an improvement over the AIRS-observed PBL heights over water. Based on these results, we can conclude that without in situ PBL height measurements, estimates of surface NO2 from Pandora TC NO2 measurements over ocean are likely to be in error.
3.3 OMI Estimations Compared to Surface NO2 Measurements
The remotely sensed (Pandora and OMI) TC data are sensitive and responsive to clear changes in boundary layer concentrations at Welgegund. For example, higher TC NO2 abundances during late February and early March are consistent with the ground measurements of Figure 7, where more pollution (higher O3 and NOx concentrations) was measured after 11 February (day 42). This result was also captured by the Pandora estimates at Welgegund. However, the following results demonstrate the limited use of the Knepp–Kollonige method with OMI-like observations over land (i.e., Welgegund) and marine (i.e., DANCE) environments.
For the 22 days of coincident and co-located observations at Welgegund, OMI-estimated surface NO2 levels using AIRS-observed PBL heights (from equation 2) are compared to in situ surface measurements (Figure 11). Because OMI has only one to two overpasses per day (at ~1:30 pm local time or 11:30 UTC), a direct comparison with the 15 min means of Pandora and the gas analyzer (in Figure 9) is not representative of the performance of this method applied to OMI observations. Diurnal cycles of surface NO2 (top panel, blue) and O3 (bottom panel, red) measured at the Welgegund site are presented in Figure 8. The time of day of the OMI overpass over Welgegund actually corresponds to a minimum in surface NO2 concentrations, and the range of NO2 concentrations is between 0 and 5 ppbv (not including outliers, open circles). By 1:30 pm (11:30 UTC), local photochemistry is dominant, and most of the NO2 at the site has been lost to O3 production (i.e., O3 at a maximum concentration). For a better representation of the observed conditions at Welgegund, only mean surface NO2 mixing ratios measured by the gas analyzer within 1 h of the OMI overpass are considered for a direct comparison to OMI-estimated surface NO2.

A RMA fit of the OMI-estimated versus in situ surface NO2 in Figure 11a indicates a slope of 0.225 and a statistically significant correlation coefficient of R = 0.145. Differences between the two data sets range between 0 and 4 ppbv (Figure 11b) with the most frequent as ~1 ppbv, which is larger than the bias of ~0.5 ppbv from Pandora estimates at Welgegund. The smallest differences evolved from favorable weather conditions. During March (days 66–75), the largest differences are a factor of 2 or more between the surface measurements and the OMI estimations (not shown). Rain and cloud cover during this time period affected the OMI retrievals and caused underestimates of the surface NO2 concentration. Another reason for underestimations by OMI is the assumption that NO2 is always evenly mixed in the PBL, a requirement for the conversion from TC to TropNO2 with this method. This assumption is not always the case because fresh plumes might take a few hours to get to Welgegund from the nearest large point sources without mixing to the full height of the PBL.
The days where our OMI-estimated surface NO2 compared well with the in situ measurements suggest that OMI is sensitive to boundary layer NO2 over Welgegund. However, larger satellite signals (more urban than rural locations) of NO2 under clear sky conditions would be best for this method when applied to satellite observations. This highlights the influence of cloudy versus clear sky conditions and related effects on the PBL. The effect of coarse spatial and temporal resolution is still problematic for OMI estimations, and new higher spatial resolution OMI products are being tested in future analyses with the Knepp–Kollonige method to determine any improvements.
For the DANCE campaign, the analysis was completed similarly to the Pandora estimates with the OMI tropospheric column NO2 values instead. Limited coincidence and co-location between OMI and the in situ ship measurements (~10 data pairs) led to sparse OMI estimations. The differences between OMI-estimated surface NO2 and the in situ ship measurements were mostly 3 ppbv or greater with OMI missing any NO2 transport downwind during the pollution event. Typical OMI-estimated surface NO2 was ~1 ppbv or less versus in situ values of ~3 ppbv or greater at overpass times. The results are not shown because differences were larger than the Pandora comparisons and can be attributed to similar effects that impacted the Pandora estimations discussed in section 3.2.2. In situ PBL heights would improve this method's performance over ocean versus using highly variable AIRS-observed PBL heights. The clean, source -free marine NO2 amounts also approach the detection limit of the OMI instrument, which demonstrates the limitations of this method over water with the current generation of satellite instruments.
4 Summary and Conclusions
The first comparisons of total column NO2 from a direct-Sun Pandora spectrometer located in South Africa (Welgegund site on the Highveld, southwest of the J-P conurbation) and over open ocean in the North Atlantic (DANCE campaign off the U.S. East Coast) are presented. Surface NO2 mixing ratios estimated from Pandora and from OMI are compared to data from a co-located surface NO2 analyzer at the land site, 25 January to 15 March 2011, and on the research vessel, 29 July to 15 August 2014, respectively. An adaptation of Knepp et al. (2015), Knepp–Kollonige method, has been used for deriving surface NO2 values from OMI and Pandora for the comparison to surface NO2 measured at both the terrestrial and marine environments.
With PBL height as a key variable in converting the column NO2 amount to a surface value, the best PBL observations used in the estimations may vary. Of the three available approaches for calculating PBL height at Welgegund, two are in good agreement with an overall bias of ~10%: a new product, derived from AIRS observations; and PBL measured by radiosondes at Irene, 150 km northeast of Welgegund. This encouraging result appears to be one of the first evaluations of the AIRS-derived PBL product over land. During DANCE, however, the AIRS PBL product was the only available PBL observation, and no other comparisons could be made to determine performance over ocean.
The two main sources of error for the conversion of column to surface NO2 are (1) poor weather conditions (e.g., cloud cover and precipitation) and (2) PBL height estimates, both affecting the NO2 column–surface relationship and instrument sensitivities to boundary layer NO2. Due to a higher-than-normal amount of cloud cover associated with a La Niña in early 2011, which results in higher than average rainfall in South Africa, only 22 days of observations are suitable for the Welgegund analysis. Nitrogen dioxide concentrations near J-P were sensitive to the weather conditions of a La Niña, as suggested by Balashov et al. (2014). Comparisons of coincidental, quality-controlled OMI and Pandora total column NO2 show relatively good correlation (i.e., R = 0.44) with agreement of ~20%, comparable to previous studies under similar conditions. Absolute differences between Pandora-estimated surface NO2 and the ground measurements are ~0.5 ppbv; however, OMI typically underestimated TC NO2, causing estimates to be biased low by 1 ppbv or more. Pandora estimates compare moderately well with low percent differences, but a low correlation with the surface measurements (R = 0.10) demonstrates the influence of mixing within the PBL and the importance of the selected PBL height observations on the NO2 column–surface relationship. Flynn et al. (2014) found similar results at sites with minimal local sources.
For the DANCE campaign, the Knepp–Kollonige method is less successful. The large variability in the co-located AIRS-derived PBL contributes to large differences between the Pandora and OMI estimates as compared to the in situ ship measurements. Additional PBL observations would have been required to determine fully the influence of this variable in the estimates. In addition, due to both Pandora and OMI encountering mostly clean marine air with NO2 concentrations at or below background thresholds detectable by both instruments, their NO2 estimates differ from in situ ship measurements by more than 3 ppbv. This difference suggests that estimates using this method are more reliable in environments with elevated NO2 concentrations due to frequent pollution transport and/or local sources. To evaluate this method and hypothesis further, sites and/more campaigns with longer time series and multiple PBL observations are necessary; we are examining other data for this purpose (i.e., DISCOVER-AQ campaign).
Estimating surface NO2 from satellite observations, as we have demonstrated, can be a challenging problem. The temporal and spatial resolutions of observations from current instruments, such as OMI, add difficulty to the analysis and validation process when limited by few co-located observations over a single specific site. This emphasizes the need for instruments like TROPOMI, launched October 2017, with finer spatial observations and TEMPO (and other geostationary instruments) with higher temporal and spatial resolution for this type of air quality analysis. This work also indicates the need for the careful consideration of the location and coverage (i.e., networks) of ground-based instrumentation for validating satellites.
Acknowledgments
This research was funded through several NASA grants initially to Penn State University (AQAST, NNG11AQ44G; Aura Validation, NNGQ5GO62G; SHADOZ, NNXQ9AJ23G) and also the National Science Foundation Division of Ocean Sciences (OCE 1260574) for the shipboard Pandora measurements. The deployment of Pandora at Welgegund was made possible by the hospitality of the North-West University—Potchefstroom and of a J. W. Fulbright Scholar award to A.M. Thompson. Data provided by OMI (doi: 10.5067/Aura/OMI/DATA2017) and AIRS (doi: 10.5067/AQUA/AIRS/DATA207) science teams are from NASA DISC. Irene data courtesy of an ongoing scientific collaboration with Gerrie Coetzee and the Observations Department from the South African Weather Service (SAWS) in conjunction with SHADOZ (Upper Air Research Program, K. W. Jucks). A special thank you to the Pandora team at GSFC for comments and recommendations on this work.