NRLMSIS 2.0: A WholeAtmosphere Empirical Model of Temperature and Neutral Species Densities
Abstract
NRLMSIS® 2.0 is an empirical atmospheric model that extends from the ground to the exobase and describes the average observed behavior of temperature, eight species densities, and mass density via a parametric analytic formulation. The model inputs are location, day of year, time of day, solar activity, and geomagnetic activity. NRLMSIS 2.0 is a major, reformulated upgrade of the previous version, NRLMSISE00. The model now couples thermospheric species densities to the entire column, via an effective mass profile that transitions each species from the fully mixed region below ~70 km altitude to the diffusively separated region above ~200 km. Other changes include the extension of atomic oxygen down to 50 km and the use of geopotential height as the internal vertical coordinate. We assimilated extensive new lower and middle atmosphere temperature, O, and H data, along with global average thermospheric mass density derived from satellite orbits, and we validated the model against independent samples of these data. In the mesosphere and below, residual biases and standard deviations are considerably lower than NRLMSISE00. The new model is warmer in the upper troposphere and cooler in the stratosphere and mesosphere. In the thermosphere, N_{2} and O densities are lower in NRLMSIS 2.0; otherwise, the NRLMSISE00 thermosphere is largely retained. Future advances in thermospheric specification will likely require new in situ mass spectrometer measurements, new techniques for species density measurement between 100 and 200 km, and the reconciliation of systematic biases among thermospheric temperature and composition data sets, including biases attributable to longterm changes.
Key Points
 A major, reformulated upgrade to NRLMSISE00 is presented using extensive new data sets from the ground to ~100 km altitude
 Vertical structure of the atmosphere is now selfconsistently coupled; O density now extends down to 50 km
 New model is warmer in upper troposphere, cooler in stratosphere and mesosphere; thermospheric N_{2} and O densities are lower
1 Introduction
An empirical atmospheric model provides a description of the average spatiotemporal behavior of atmospheric state variable observations via a parameterized analytical formulation, often with physical constraints (e.g., Emmert, 2015b). Input arguments to empirical atmospheric models typically include geographic location, day of year, time of day, and external drivers such as solar and geomagnetic activity (e.g., Bruinsma, 2015; Drob et al., 2015; Hedin, 1987; Oberheide et al., 2011). Empirical models play several indispensable roles in atmospheric research, data analysis, specification, and prediction, particularly in upper atmospheric applications. They provide a condensed representation of the historical record of observations and thereby serve as benchmarks for testing new observations and techniques. They are extensively used as a background reference for retrieving atmospheric state variables from raw measurements such as radiances. They also often provide initial and boundary conditions for first principles models. Most directly, they are used to specify and forecast the atmosphere, particularly when contemporary or realtime data are sparse or nonexistent.
The Mass Spectrometer Incoherent Scatter radar or MSIS® series of empirical models describes atmospheric temperature, number densities of eight species, and mass density. The acronym derives from the spacebased mass spectrometer and groundbased incoherent scatter radar (ISR) measurements on which the model was originally based. First developed at NASA's Goddard Space Flight Center, the model began as a thermospheric model and was eventually extended down to the ground (Hedin, 1987, 1991; Hedin, Reber, et al., 1977; Hedin, Salah, et al., 1977). In the late 1990s, development continued at the Naval Research Laboratory and resulted in NRLMSISE00 (Picone et al., 2002). That release was focused primarily on the thermosphere and included the assimilation of new middle thermosphere O_{2} data, more recent ISR measurements, mass density data derived from satellite orbit decay, and the introduction of an anomalous oxygen component that accounts for additional mass in the upper thermosphere attributed to a hot population of atomic oxygen and atomic oxygen ions.
An inherent limitation of NRLMSISE00 was the lack of satellite data to define the composition and structure of the atmosphere below 100 km. Motivated by this deficiency, NRLMSIS 2.0 assimilates extensive new (since 2000) measurements and analyses of temperature in the mesosphere, stratosphere, and troposphere, as well as many years of new atomic oxygen (O) and atomic hydrogen (H) measurements in the mesosphere. We also tuned the model's upper thermosphere to better match new orbitderived mass density measurements. Major changes to the model formulation have been implemented, including a temperaturedependent connection between densities in the lower/middle atmosphere and the thermosphere; in NRLMSISE00 and earlier versions, thermospheric densities were treated independently from the lower layers, with a posteriori joining of the upper and lower profiles. NRLMSIS 2.0 densities are fully coupled to temperature from the ground to the exosphere via a hydrostatic/diffusive equilibrium profile. In this sense it can be regarded as a wholeatmosphere empirical model.
In conjunction with the mathematical reformulation, the source code was rewritten in Fortran 90 with modern programming practices. The code is available in the supporting information of this paper and in the repository listed in the acknowledgments. Version numbers will now be decimal based instead of year based. Besides producing a new reference model for the whole atmosphere, this study is also implicitly a scientific analysis of the extant modern atmospheric database, especially below the thermosphere. Because the model synthesizes the database and normalizes out the systematic agreement among data, the subsequent comparison to the individual data sets reveals systematic disagreements among the data sets. Such analysis is a major component of this study.
The new model formulation is presented in section 2. Section 7 describes the data sets used to tune and validate the model and the random sampling procedure. Section 10 describes the fitting procedure to estimate the model parameters, and section 18 summarizes statistical comparisons of the model to independent random samples of the data. In section 19, we examine mutual biases among the model and data sets and discuss scientific issues that the model and its development have illuminated. Conclusions and future development plans are summarized in section 25. Herein, NRLMSISE00 and NRLMSIS 2.0 are often shortened to “MSISE00” and “MSIS 2.0” for brevity; “MSIS” collectively refers to all versions of the model.
2 Model Formulation
This section summarizes the mathematical formulation of MSIS 2.0, highlighting important changes from MSISE00. We focus on the vertical temperature and density parameterizations. Additional details of the other formulation changes are provided in the appendix and supporting information. In the new formulation, we aimed for three characteristics: a connected and continuous representation of the model components (temperature and species number densities) from ground to space, a robust closedform solution to the speciesbyspecies hydrostatic integral coupling temperature to species density, and sufficient flexibility to accommodate variations evident in the data, as described below.
2.1 Geopotential Height
MSIS 2.0 internally uses geopotential height (ζ) as the vertical coordinate, which simplifies the hydrostatic integral term in the density profiles (section 4) while fully accounting for height and latitudedependent gravity. All reference heights and spline nodes are defined in terms of geopotential height. In contrast, earlier versions specify reference heights and nodes on geodetic altitude levels and compute geopotential changes within each geometric shell. This change will be transparent to most users, because the input argument to the model code is geodetic altitude by default. However, the code includes an option to input geopotential height directly, which may facilitate comparison with firstprinciples models and data sets that are given on a geopotential grid.
We calculate geopotential height as function of geodetic altitude and latitude, relative to the World Geodetic System 1984 (WGS84) reference ellipsoid (National Imagery and Mapping Agency, 2000) and associated gravitational field, which excludes longitudinal variations and zonal harmonics higher than order 2. The details of the calculation are provided in the appendix. A key parameter is the reference gravity, which scales the geopotential to a length; its value is somewhat arbitrary but must be applied consistently. We use the standard gravity value of 9.80665 m/s^{2}, which is the standard for meteorological observations (WMO, 2014, Part I, Chapter 12).
2.2 Vertical Temperature Profile
The node spacing of the Bsplines is 5 km below a height of 85 km, increasing to 10 km above 102.5 km. At the joining altitude ζ_{B} = 122.5 km, the profile is constrained to be C2 continuous (continuous in the zeroth, first, and second derivatives), so that the three Bates parameters (T_{ex}, T_{B}, σ) determine the last three Bspline coefficients (and thereby influence the temperature down to 92.5 km). The profile is defined by 24 parameters: the first 21 Bspline coefficients and the 3 Bates parameters. Figure 1 illustrates the global average temperature profile in MSIS 2.0 and the constituent Bsplines. Unless otherwise specified, “global average” herein means including only the lead term of the expansion described in section 5, that is, annual average, moderate solar activity (F_{10.7} = 150) and quiet geomagnetic activity (Ap = 4).
MSISE00 also uses cubic splines and the Bates profile to represent temperature. However, there are several important differences in their application. First, MSIS 2.0 uses Bsplines, whereas MSISE00 uses spline interpolation among specified temperatures at each node. The use of Bsplines renders MSIS 2.0 linear with respect to the model parameters in the spline region, which facilitates fast and robust least squares fits to data (see section 10). Second, MSIS 2.0 is C2 continuous (continuous through the second derivative) throughout the entire model domain, whereas MSISE00 segments the atmosphere into four regions with C1 continuity (continuous through the first derivative) across the boundaries (32.5, 72.5, and 120 km). Third, the shape of the Bates profile and the meaning of the Bates temperature gradient parameter are slightly different in MSIS 2.0, due to its use of a global geopotential height coordinate rather than the geopotential height difference referenced to 120 km (Hedin, 1987, Equation A4a). Finally, MSIS 2.0 adds additional nodes to the parameterization, which provides increased vertical resolution: There are 24 vertical temperature parameters in MSIS 2.0, compared with 17 in MSISE00. The increased resolution is supported by the new data sets described in section 7.
2.3 Vertical Density Profiles
The first term n_{0} on the righthand side of Equation 2 scales the entire profile and (in the absence of chemical and dynamical corrections) is equal to the species density at the fiducial height ζ_{0}. The second term is the hydrostatic integral, which includes an effective mass profile that is described below. The third term represents the ideal gas law; this term and the hydrostatic integral couple the temperature profile to the density profile (cf. Emmert, 2015b; section 7). The fourth term is a Chapmanlike bottomside chemical loss term (Chapman, 1931, Equation 40); the model applies this term to O, H, and N, which experience photochemical production and loss similar to that of the ionosphere. The last term is a logistic function (expressed in hyperbolic tangent form) used for chemical and/or dynamical perturbations. As described in section 10, we also use the logistic correction term to relax upper thermospheric MSIS 2.0 densities to MSISE00. Each of the two correction terms is defined by three parameters: an amplitude (C or R), a reference height (ζ_{C} or ζ_{R}), and a scale height (H_{C} or H_{R}). While the chemical loss term has an upper asymptote of zero and is unbounded on the lower end, the chemical/dynamical correction term has a lower asymptote of zero and an upper asymptote of R. For the chemical/dynamical correction, we additionally accelerate the relaxation to the lower asymptote by applying another hyperbolic tangent taper γ(ζ) to H_{R}; this prevents the correction from projecting downward into the lower atmosphere.
Figure 2 illustrates the global average effective mass profile for N_{2}. The theoretical basis for the approach is discussed in Picone et al. (2016). The choice of a piecewise linear effective mass profile allows the hydrostatic integral to be evaluated in closed form via integration by parts, as detailed in the supporting information (Text S1); in the Bates profile region, this requires the dilogarithm function, which we calculate using the algorithm described in Ginsberg and Zaborowski (1975).
The same spline nodes as the temperature profile (Equation 1) are used. At the joining height ζ_{SO} = 85 km, C1 continuity is imposed, so that the last two spline coefficients are determined from the hydrostatic profile parameters (Equation 2). The O profile is defined down to a geopotential height of 50 km.
In summary, the model defines a density profile of a particular species in terms of up to 35 (possibly fitted and/or variable) parameters: 24 temperature parameters and 1 pressure parameter (which are common to all species); 1 mixing ratio or reference density; 3 mass profile parameters; and up to 6 correction parameters (C and R terms in Equation 2). The O profile additionally uses eight unconstrained spline coefficients between 50 and 85 km. Figure 3 shows the global average density and mixing ratio profiles for seven species represented by the model.
The major difference between the MSIS 2.0 species density profile and earlier versions is the treatment of the transition region (~70–200 km) between the fully mixed lower and middle atmosphere and the diffusively separated upper thermosphere. Earlier versions compute separate mixed and diffusive profiles, with the diffusive profile reference density defined at 120 km, then combine the two profiles using a geometric average (Hedin, 1987, Equation A12a). This decouples the thermosphere from temperature variations in the lower and middle atmosphere. In contrast, the effective mass profile in MSIS 2.0 couples, via the hydrostatic term in Equation 2, several species densities (N_{2}, O_{2}, He, and Ar) to the entire temperature profile and other species densities (O, H, and N) to the temperature profile down to reference altitudes in the mesosphere and lower thermosphere. Thus, in MSIS 2.0, species densities in the thermosphere are affected by temperature variations in the lower and middle atmosphere.
MSIS 2.0 eliminates the thermal diffusion term that in earlier versions was applied to He, Ar, and H. As discussed in Picone et al. (2016), the inclusion of thermal diffusion is not physically consistent with static composition profiles. Furthermore, we found that the effect of thermal diffusion in MSISE00 is statistically unsupported by available He, Ar, and H data in lower and middle thermosphere, where vertical temperature gradients are sufficiently large for thermal diffusion to be active.
However, in MSISE00 and earlier versions, this term is used to produce specified mixing ratios near the turbopause, whereas in MSIS 2.0 this term is an upward projecting correction that represents departures from the hydrostatic/diffusive profile, such as winter bulges in the lighter species (e.g., Reber & Hays, 1973; Sutton, 2016). In MSIS 2.0, this term thus effectively takes on the role of the 120 km reference density in earlier versions, which represented such low frequency perturbations (Picone et al., 2013).
Earlier versions of MSIS use logistic function terms to represent chemical loss of O, H, and N and produce peaks in those species. MSIS 2.0 instead uses the simpler Chapmanlike exponential term in Equation 2, which, unlike the logistic term, does not relax to a lower asymptote (earlier versions impose lower limits of these species' profiles near or above the inflection point of their chemical loss logistic terms). MSIS 2.0 additionally extends the O profile down to 50 km using cubic Bsplines (Equation 7).
MSISE00 includes an anomalous oxygen component in the upper thermosphere, which nominally represents O^{+} and hot O contributions to total mass density (Picone et al., 2002). This population is represented in the model with a separate, fixed temperature of 4000 K. We transferred the anomalous O component to the MSIS 2.0 model, accounting for the new geopotential height calculation but otherwise without modification.
2.4 Expansion of Vertical Profile Parameters
Each of the vertical temperature and density profile parameters is (or can be) expanded as a function of spherical harmonics in latitude and local time or longitude, solar zenith angle, harmonics in day of year, polynomials of the F_{10.7} solar activity index (10.7 cm solar radio flux; Tapping, 2013), and a geomagnetic activity function. This expansion is largely the same as in MSISE00 and earlier versions, as detailed in Hedin (1987, appendix, Equation A22). MSIS 2.0 introduces sigmoid terms in solar zenith angle (for daynight changes in mesospheric O and H) and a solar cycle modulation of the global annual and semiannual oscillations.
One important change in MSIS 2.0 is that the trigonometric terms in the expansion are split into their sine and cosine components (rather than the phase and amplitude parameters estimated in earlier versions for some of the variations). This linearizes the expansion with respect to the model parameters, facilitates the computation of the terms, and makes the parameter estimation process somewhat more robust. Some nonlinear terms have been retained from MSISE00, including the solar cycle modulation of groups of variational terms, geomagnetic activity, UT terms, mixed UT/longitude, and mixed UT/longitude/geomagnetic activity terms.
The full expansion is detailed in the supporting information (Text S2). The expansion terms actually used for each vertical parameter and the parameter values are compiled in Data Set S1 in the supporting information.
3 Data
Table 1 summarizes the data sets and random samples we used to estimate the parameters of MSIS 2.0; acronym definitions are given at the end of the text. Most of the data are temperature measurements or reanalyses in the troposphere, stratosphere, mesosphere, and lower thermosphere. We also used mesosphere and lower thermosphere (MLT) O data from TIMED/SABER and Odin/OSIRIS, MLT H data from SABER, and orbitderived upper thermospheric global average mass density data. In addition to these data, we also constructed synthetic MSISE00 thermospheric data sets in order to relax the fits to the MSISE00 thermosphere. In some cases, MSISE00 thermospheric parameter values were directly mapped to the MSIS 2.0 formulation, as described in section 10.
Data set or station/instrument  Latitude, longitude  Altitude (km)^{a}  Local Time^{b}  Years  No. Days^{c}  No. Obs (10^{3})  Reference 

Temperature  
Reanalysis  
CFSR  90S–90N  0–30  0000–2,400  2002–2018  6,200  3,163  Saha et al. (2014) 
MERRA2  90S–90N  0–55  0000–2,400  2002–2018  6,209  3,211  Gelaro et al. (2017) 
Microwave  
Aura/MLS  82S–82N  10–85  0145, 1,345  2005–2014  3,589  4,940  Schwartz et al. (2008) 
Solar Occultation  
ACE/FTS  85S–87N  15–102.5  Sunrise/set  2004–2013  2,436  5,068  Bernath (2007) 
UARS/HALOE  77S–77N  37.5–102.5  Sunrise/set  2001–2005  880  2,769  Russell et al. (1993) 
AIM/SOFIE  83S–89N  55–102.5  Sunrise/set  2007–2018  3,460  2,696  Marshall et al. (2011) 
Na Doppler Lidar  
Andes  30S, 71 W  88–105  Night  2010–2014  51  645  Liu et al. (2016) 
Boulder  40 N, 105 W  88–105  Night  2011–2014  198  240  Smith and Chu (2015) 
Ft. Collins  41 N, 105 W  88–105  0000–2,400  1995–2010  804  244  Krueger et al. (2015) 
Logan  42 N, 112 W  88–105  0000–2,400  2010–2014  254  246  Krueger et al. (2015) 
ALOMAR  69 N, 16E  88–105  Night  2003–2008  27  582  She et al. (2002) 
Infrared  
TIMED/SABER  83S–84N  40–97.5  0000–2,400  2002–2016  5,060  5,817  Mertens et al. (2002) 
Odin/OSIRIS  90S–90N  70–102.5  0650, 1850  2007–2012  1,848  1,808  Sheese et al. (2010) 
Atomic Oxygen  
TIMED/SABER  83S–83N  50–100  0000–2,400  2002–2013  3,964  3,058  Mlynczak, Hunt, Mast, et al. (2013), Mlynczak et al. (2013) 
Odin/OSIRIS  90S–90N  75–100  0650, 1850  2007–2012  1,865  965  Sheese et al. (2011) 
Atomic Hydrogen  
TIMED/SABER  83S–83N  75–100  0000–2,400  2002–2013  3,963  3,751  Mlynczak et al. (2014) 
Mass Density  
Orbitderived  Global Ave  400–575  Diurnal Ave  1986–2005  7,305  7,305  Emmert (2015a) 
 The numbers shown in this table refer to the fitting ensembles described in section 8. For temperature, O, and H data, the number of days and observations are the aggregate of all 15 ensembles.
 ^{a} For temperature, the altitude ranges indicate the centers of the tapered probability distribution used to generate the samples. Otherwise, they indicate the discrete range of altitudes used in the fit.
 ^{b} For sunsynchronous orbits, the approximate local times of equator crossings are given.
 ^{c} Number of unique observation days in the sample.
To validate and analyze the model results, we used additional, independent random samples of the same data sets listed in Table 1. We also compared the model with middle thermospheric temperature data from Envisat/MIPAS and the Millstone Hill ISR, and with upper thermospheric mass density data from the CHAMP and GOCE accelerometers (see sections 22 and 23).
Brief descriptions of each data set are provided in section 7. The generation of random samples for fitting and validation is described in section 8. All of the data samples used to estimate the model parameters are available in the repository listed in the acknowledgments.
3.1 Data Sets
The temperature data sets listed in Table 1 are grouped into five measurement types: reanalysis, microwave, solar occultation, groundbased sodium (Na) Doppler lidar, and infrared. The O and H data sets are from infrared instruments, and the mass density data are derived from archived satellite and debris orbit data.
CFSR (Saha et al., 2010, 2014) is a reanalysis product of the National Centers for Environmental Prediction. It assimilates global meteorological data into an atmospheric model and outputs gridded atmospheric fields. CFSR version 1 (Saha et al., 2010) covers the years 1979–2011, and version 2 (Saha et al., 2014) covers the years 2011 to present. We used CFSR temperatures from 2002 to 2018: 6hourly output at universal times 0, 6, 12, and 18 hr, on a 0.5° latitudelongitude grid and a 37level pressure grid (1,000 to 1 hPa).
MERRA2 (Gelaro et al., 2017) is a reanalysis product of NASA's Global Modeling and Assimilation Office that covers the years 1980 to present. We used MERRA2 temperatures from 2002 to 2018: 3hourly output, on a 0.5° latitude × 0.625° longitude grid and a 72level hybrideta grid (surface to 0.01 hPa).
MLS on the NASA Aura satellite (Waters et al., 2006) has been providing ~3,500 profiles per day of temperature, geopotential height, and a suite of trace gases since August 2004, from a sunsynchronous orbit. We used version 4.2 data (Livesey et al., 2017; Schwartz et al., 2008), which provides retrieved temperature on 42 fixed pressure surfaces (261–0.001 hPa), with temperature information coming primarily from the 118.75 GHz oxygen line. Vertical resolution in the mesosphere and lower thermosphere ranges from 6 to 13 km, becoming coarser with increasing altitude.
ACE/FTS is a high spectral resolution Fourier transform spectrometer (Bernath, 2007) that covers wavenumbers from 750 to 4,400 cm^{−1} (2.2 to 13.3 μm). ACE/FTS operates in solar occultation mode to provide altitude profiles of temperature, pressure, atmospheric extinction, and the volume mixing ratios for several dozen molecules. We used version 3.5 temperatures, which are given on a 1 km vertical grid from 0 to 150 km with a typical vertical resolution of about 3 km. The 15–102.5 km data used here are derived from the relative intensity of CO_{2} lines.
UARS/HALOE recorded solar occultation measurements from October 1991 until its deactivation on 21 November 2005. The limb transmission measurements were used to infer profiles of temperature, as well aerosol extinction and mixing rations of seven species (Russell et al., 1993). Temperatures from ~35 to ~85 km altitude were retrieved from CO_{2} transmissions measured at 2.80 μm wavelength, using modeled CO_{2} mixing ratio profiles in the forward simulations; the effective vertical resolution is ~3 km. Above ~85 km MSIS temperatures were appended, primarily to enable the NO channel retrievals. HALOE temperatures were validated by Harries et al. (1996) and McHugh et al. (2005), indicating agreement with correlative measurements to within the uncertainties for altitudes of ~35 to 75 km. We used version 19 temperature data.
AIM/SOFIE (Gordley et al., 2009; Russell et al., 2009) has operated from 2007 to present. SOFIE measurements at 16 wavelengths are used to retrieve temperature, as well as 5 species, polar mesospheric clouds, and meteoric smoke. SOFIE retrievals are reported on a 200 m vertical grid with an effective vertical resolution of 2 km. The version 1.3 SOFIE temperature retrievals used here (~55–100 km) are based on CO_{2} transmission measurements at 4.32 μm (Marshall et al., 2011). SOFIE temperature validation reported by Stevens et al. (2012) and Hervig et al. (2016) indicate agreement with correlative measurements to within the uncertainties from ~30 to 95 km altitude.
The Na Doppler lidars listed in Table 1 share the same threefrequency Doppler lidar techniques summarized in Chu and Papen (2005) and references therein. Because of high collision rate, meteoric Na atoms in the mesosphere are believed to be in thermal equilibrium with the ambient atmosphere. By detecting the Dopplerbroadened D2 absorption spectral line of Na atoms at three fixed frequencies and taking the ratios among the threefrequency returns, temperatures and winds in the MLT are inferred simultaneously from the Doppler broadened linewidth and the Doppler frequency shift. The development of Faradayeffectbased daytime filters enabled daytime measurements by several Na Doppler lidars (Arnold & She, 2003; Chen et al., 1996; Krueger et al., 2015; Smith & Chu, 2015).
The Andes Lidar Observatory (ALO) is located in Chile at the Cerro Pachon Mountain astronomy facility, which provides yearround clear viewing conditions (around 300 clear nights per year). The construction of the ALO building was funded by the University of Illinois at UrbanaChampaign. The ALO resonant Na wind/temperature lidar system (Liu et al., 2016) provides temperature profiles from 75 to 140 km altitude, with 1 min temporal resolution and 500 m vertical resolution.
The University of Colorado STAR Na Doppler lidar obtained very highresolution data (Lu et al., 2015, 2017; Smith & Chu, 2015) at Table Mountain near Boulder, with the raw photon counts collected in resolutions of 3–9 s and 24 m. The effective temporal and vertical resolutions are 7.5 min and 0.96 km, respectively, for temperature profiles used in this paper, and the measurement uncertainties in the temperatures are ~0.3–1 K near the Na layer peak. The uncertainties in the winter months are usually smaller than those in the summer months due to the higher winter Na abundance.
The Na Doppler lidar at Ft. Collins, Colorado, operated from 1990 to 2010 and was relocated to Utah State University in Logan, Utah, in summer 2010, where it has been operating ever since. It measures the temperature and winds from ~80 to 105 km in full diurnal cycles (Krueger et al., 2015). The data used for this study have temporal and vertical resolution of 1 hr and 2 km, respectively.
The ALOMAR Na windtemperature lidar operated from 2000 to 2017 at the Andøya Space Center as a U.S./Norwegian partnership (She et al., 2002). The lidar design was largely based on the Fort Collins Na lidar. Temporal and vertical resolutions are typically 2 min/1 km in the winter nighttime from 78 to 105 km and 15 min/2 km in summer daytime from 85 to 97 km. The data used in this study consist of 2003–2008 nighttime measurements averaged at 1 h and 1 km resolution.
TIMED/SABER is a limb scanning radiometer that records vertical profiles of infrared emission in 10 different spectral channels (Russell et al., 1999); it has operated from December 2001 to present. The specific channels on SABER enable a detailed assessment of the thermal structure, composition, and energy budget of the mesosphere and lower thermosphere (Mlynczak, 1996, 1997). In particular, SABER measures emission from carbon dioxide in the vicinity of 15 μm for the purpose of deriving kinetic temperature (Mertens et al., 2002). O and H are crucial to the derivation of the energy budget in the vicinity of the mesopause (Mlynczak & Solomon, 1993). SABER derives O and H densities using photochemical relationships specific to night and day (Mlynczak et al., 2013, 2014). SABER has a channel near 2.0 μm measuring emission from highly excited hydroxyl (OH) formed by the reaction of H and O_{3} which is used in the derivation of H both day and night and in the derivation of O at night. We used version 2.0 SABER T, O, and H data; a later version of nighttime O retrieval produces smaller peak densities (Mlynczak et al., 2018). Panka et al. (2018) also developed an algorithm for nighttime O that agrees well with Mlynczak et al. (2018). The updated nighttime O will be used in future MSIS development.
The Odin/OSIRIS optical spectrograph (McLinden et al., 2012) measures vertical profiles of 280–800 nm emissions from 7 to 110 km altitude with a vertical resolution of 2 km and a spectral resolution of 1 nm; it has operated from 2001 to present. Temperatures in the MLT are derived from the O_{2} Aband emission (Sheese et al., 2010). Daytime O density is inferred iteratively in conjunction with O_{3} using a photochemical forward model of the Aband emission (Sheese et al., 2011). Nighttime O density is derived from Aband emission associated with O recombination (Sheese et al., 2011). Odin is in a sunsynchronous orbit; the equator crossings of the temperature and O data occur near 0700 and 1900 local time.
The orbitderived thermospheric mass density data consist of daily, global average mass density at altitudes from 250 to 575 km (Emmert, 2009, 2015a). The data cover the years 1967–2013 and are derived from twoline orbital element sets (TLEs) on ~5,000 objects (we also denote this data set as “TLE densities”). Following Weimer et al. (2018) and based on the ballistic coefficient estimates of Pilinski et al. (2011), we reduced the values in this data set by 7%. For MSIS 2.0, we used 1986–2005 data for fitting and the remaining years for validation.
The following data sets were not used to estimate the MSIS 2.0 model parameters but are used for independent comparison and analysis in section 19.
The Millstone Hill UHF ISR system (42.6 N, 288.5E, Apex magnetic latitude 54°) has been in operation since 1963. It provides observations of altitudinal profiles of several plasma parameters, including ion temperature, that are determined from the received signal power and spectrum; neutral temperature between 100 and 180 km altitude is derived from the ion temperature (Salah & Evans, 1973). An average ion mass of 31 amu is assumed at altitudes below 130 km, and an ion composition model is used above this altitude. Observations below ~180 km are limited to mostly daytime hours because of the low electron density at night. Availability of data below ~180 km greatly increased after 2002, when improved software radar design patterns (Grydeland et al., 2005) were implemented. We used all available lower and middle thermospheric neutral temperatures from 2002 to 2015: 311,000 observations (at 4 km altitude intervals) taken on 748 unique days.
Envisat/MIPAS measured spectrally resolved 5.3 μm nitric oxide limb emissions in the lower and middle thermosphere in its upper atmospheric observation mode during 2006–2012, from which kinetic temperatures and nitric oxide concentrations are jointly derived (BermejoPantaleón et al., 2011). We used all available version 622 temperature observations in the 105–170 km altitude range: a total of 1.86 million observations (at 5 km altitude intervals) taken on 334 days. The Envisat orbit was sunsynchronous, and the equatorward crossings of the MIPAS observations occurred near 1,015 and 2,215 local time.
CHAMP and GOCE total mass density data were derived from satellite accelerometer measurements, by making use of satellite aerodynamic and geometry models (Doornbos et al., 2010; March et al., 2019). The CHAMP data cover the time period January 2001 to September 2010; we used a random sample of 1.19 million observations on 1,926 days, excluding the anomalous solar minimum years 2005–2009. CHAMP was in a nearpolar orbit with an inclination of 87°.The GOCE data cover the time period November 2009 to October 2013; we used a random sample of version 2.0 data consisting of 0.84 million observations on 1,227 days. GOCE was in a nearsunsynchronous orbit with equator crossings near 0700 and 1900 local time.
3.2 Sampling Procedure
From the temperature data sets listed in Table 1, we assembled 30 random samples or ensembles. We used the first 15 ensembles for fitting via sequential estimation and the second 15 for validation. For the fitting ensembles (1–15) we additionally imposed tapered altitude restrictions on each data set, in order to avoid regions where a given data set is systematically biased relative to the other data sets (some examples are discussed in section 19) or is near the limits of reliability of the data set. The taper, which is intended to avoid sharp statistical gradients in the fitting procedure, was implemented via a hyperbolic tangent probability function with a scale height of 2.5 km. The altitude ranges given in Table 1 denote the centers of the lower and upper tapers; the fitted reanalysis samples extend all the way down to the ground with no lower taper. Note that the validation ensembles (16–30) include some additional data outside these height ranges.
Figure 4a shows the distribution of all the temperature data in the fitting ensembles (ensembles 1–15) as a function of altitude in 2.5 km bins; Figure 4b shows the same distributions but grouped by instrument type. The relative sizes of the samples were chosen subjectively, in order to obtain a balance among instruments and measurement types. Because there are no major discrepancies among the data sets (see section 19), the model results are not sensitive to the relative sample sizes. The distributions in Figure 4 represent the weight of each data set's or group's contribution to the model: In the fitting process, each of the sample observations is weighted equally. Figure S1 further illustrates the distribution of the upper mesospheric fitting ensembles as a function of local time and latitude.
We gave SABER the most weight in the mesosphere, in part because its full local time coverage is important for capturing tides. Where the three occultation data sets overlap (above ~60 km), we gave them approximately equal weight, except that we excluded HALOE observations poleward of 45° between 65 and 95 km, in order to avoid possible contamination by polar mesospheric clouds. The lidar sample is approximately evenly allocated among the three regions where the five instruments are located: Chile, Colorado/Utah, and Norway. In the upper mesosphere, the infrared, lidar, and occultation measurements have roughly equal weight. In the stratosphere, the Aura/MLS measurements are the largest component of the sample, and the reanalysis products provide almost all the data in the troposphere.
The random selection process did not exclude duplicates, so some observations will appear more than once in a given ensemble or across ensembles. For large data sets the statistical influence of duplicates is negligible. For small data sets the duplicates increase the influence of underrepresented measurement techniques. The total number of fitted observations for each data set is listed in Table 1, along with the number of unique days in the sample.
To generate the MSISE00 synthetic data in each fitting ensemble, we randomly selected a set of measurement dates and times from the constituent data sets and random locations on the sphere. We then evaluated MSISE00 at those times and locations on a fixed altitude grid (2 km intervals from 90 to 130 km). In this way, the MSISE00 data represent the same mix of solar activity, geomagnetic activity, and dayofyear conditions as the measurements.
We followed a similar procedure to generate 30 random ensembles of the O data sets and 30 random samples of the H data sets. For the fitting ensembles (1–15), the data were restricted to the altitude ranges listed in Table 1, without any tapering. The validation ensembles (16–30) include additional data slightly outside these ranges. Synthetic MSISE00 data were added to the fitting ensembles as described above, except that they were evenly distributed over 160–500 km altitude for O and 300–500 km for H. Figure 4c illustrates the altitude distribution of the combined O fitting ensembles. The SABER H data are approximately evenly distributed over the 75–100 km interval.
4 Model Parameter Estimation Procedure
In this section, we describe the procedure we used to set and/or estimate the MSIS 2.0 model parameters. Some parameters are set a priori, some are ported from MSISE00, and some are tuned to the fitting ensembles and/or MSISE00 synthetic data. Many of the vertical parameters are not expanded beyond their global values, and some are expanded only sparsely. The final parameter values are tabulated in Data Set S1, which can be consulted to determine which variations the model contains. There are 3,306 nonzero parameter values in MSIS 2.0, compared to ~1,300 in MSISE00.
4.1 Constant, A Priori Parameters
As discussed in section 2 and the appendix, MSIS 2.0 uses the WGS84 reference ellipsoid and a reference gravity value g_{0} = 9.80665 m/s^{2} to calculate geopotential height and in Equation 2. For the effective mass profiles (Equations 3–5) and the lower atmospheric mixing ratios (Equation 6), we used the values in Picard et al. (2008): dryair mean mass in the fully mixed region and species masses and mixing ratios of N_{2}, O_{2}, Ar, and He. We computed species masses of O and N by halving the N_{2} and O_{2} masses, and we set the mass of H to 1.0 Da.
For the three remaining effective mass profile parameters (transition height and lower and upper scale heights), we used values derived from MSISE00 global average profiles (with thermal diffusion and chemical/dynamical corrections turned off). For N_{2}, we additionally ported the MSISE00 turbopause height seasonallatitudinal variation and applied it to the transition height of the MSIS 2.0 N_{2} effective mass profile. Otherwise, the effective mass profiles do not vary with location or geophysical conditions.
The global average surface pressure P_{0} in Equation 6 was set so that the modeled lower tropospheric pressure matches the global average of the reanalysis data sets (CFSR and MERRA2), after subtracting out the water vapor partial pressure from the latter (MSIS is currently a dryair model). The surface pressure in MSIS 2.0 does not vary around P_{0} = 1002.692 hPa.
We set the chemical/dynamical correction reference heights and scale heights (ζ_{R}, H_{R}) to fixed values subjectively chosen to smoothly represent upper thermospheric departures from the terms in equation 2. Currently available data are insufficient to statistically constrain these parameters. The amplitudes of the chemical/dynamical correction terms were set or estimated in subsequent steps described in this section.
4.2 Linear Fit of Temperature up to 122.5 Km
After setting the constants a priori parameters, we estimated the 24 temperature spline coefficients (α_{i} in Equation 1) and selected expansions via a linear, ordinary least squares fit to data ensembles 1–15. The spline domain extends up to 122.5 km geopotential height, whereas the fitting data extend only up to ~105 km. The MSISE00 synthetic data anchored the fit over this data gap. The selected expansion includes latitude, day of year, local time, and longitude dependences; other variations were estimated in subsequent steps. Because the spline part of the temperature component is purely linear with respect to the model parameters (for the selected expansion), this fitting step is carried out via an iterative direct full matrix inversion (cf. Drob et al., 2015).
4.3 Merge Linear Temperature Fit With MSISE00 Thermospheric Parameters
Next, the MSISE00 Bates temperature parameters ( in Equation 1) and their expansions were mapped to the MSIS 2.0 formulation. We then combined the linearly fitted spline parameters with the Bates parameters to form the full MSIS 2.0 temperature construction. In this process, the top three spline parameters from the linear fit are essentially replaced with the Bates parameters, which determine those three spline coefficients via the continuity constraint. The MSIS 2.0 temperature component thus consists of the MSISE00 thermosphere and a new lower and middle atmosphere tuned to contemporary (mostly post2000) data. The MSIS 2.0 temperature between ~122.5 and 200 km cannot exactly match MSISE00, due to the differences in the geopotential height formulation that slightly affect the gradient and shape of the Bates profiles. The temperature differences between MSIS 2.0 and MSISE00 in this region are less than ~5 K and peak near 150 km, which is negligible compared to model uncertainty at these altitudes.
4.4 Subsequent Refinement of Temperature Parameters With Fitting Ensembles
With the linear fit, we found that the sunsynchronous data (MLS, OSIRIS, and the occultation data sets) introduced spurious semidiurnal variations below ~80 km. Therefore, we refined the semidiurnal tidal parameters with these data excluded. This tuning was conducted on the full model via unweighted, LevenbergMarquardt chisquare minimization (using ODRPACK95; Zwolak et al., 2007); although the retuned parameters are linear coefficients, this nonlinear estimation algorithm is more robust to tuning selected parameters while holding others constant. We tuned the semidiurnal parameters using each of the 15 fitting ensembles (without the sunsynchronous data) in sequence, with the parameters derived from one ensemble used as the starting estimate for the next ensemble. For the final parameter estimates, we computed the average of the parameters derived from ensembles 6–15 (i.e., omitting the results from the first five “spinup” ensembles).
Next, we extended the solar activity dependence of the temperature parameters down to ~70 km (in MSISE00, only the three Bates parameters and the temperature at 110 km vary with solar activity). We again used the 15 fitting ensembles (this time with all data sets) sequentially and averaged over the results from ensembles 6–15. The solar activity terms in the model are global; that is, they are not modulated by latitude or other variables. This is the last stage of the temperature parameter estimation process.
4.5 Tune Species Densities to MSISE00 Thermosphere
The initial species density profiles are defined by the temperature profile and the fixed mixing ratio and effective mass profile parameters described in section 10. For O, H, N, and anomalous O, we supplied initial guesses of the global average reference density and chemical loss term parameters; for O, we initially set its spline coefficients to a global, uniform value. We then tuned the chemical/dynamical correction amplitudes, and the chemical loss parameters of N and anomalous O, to approximately match MSISE00 in the upper thermosphere.
4.6 Tune O and H Data to Fitting Ensembles
Next, we tuned the O and H parameters using the 15 fitting ensembles for each species described in section 8. The fitting ensembles include synthetic MSISE00 upper thermospheric data, so that the MSISE00 thermosphere is approximately preserved while improving O and H in the mesosphere and lower thermosphere. As with the temperature tuning, we sequentially applied the ODRPACK minimization algorithm to each of the ensembles and then averaged the parameters estimated with ensembles 6–10. We found that the parameter estimates converged quickly within the first five ensembles, with little variation among ensembles 6–10.
4.7 Tune Upper Thermospheric O to OrbitDerived Mass Density Data
Finally, we further tuned the global intraannual variation (annual and semiannual) of upper thermospheric O to the 1986–2005 orbitderived mass density data set listed in Table 1 (the mass mixing ratio of O is ~60–95% between 400 and 575 km). This adjustment also includes a new modulation by solar activity, which is motivated by the results of Bowman et al. (2008) and Emmert and Picone (2010).
Except for this adjustment to O, the modeled upper thermospheric densities approximately match MSISE00; the difference is generally less than 10%. The upper thermospheric MSIS 2.0 O densities are ~10% less overall than MSISE00, as a result of the tuning to the orbit derived data. Additionally, the N_{2} chemical/dynamical correction (see section 4), which was tuned to match upper thermospheric MSISE00 N_{2} densities, is turned off by default in the MSIS 2.0 software, for reasons discussed in section 22.
5 Statistical Comparisons of Models to Data
In this section, we summarize statistical verifications that we conducted on MSIS 2.0 using the independent data ensembles (16–30) described in section 8, independent time intervals from the orbitderived thermospheric mass density data set, and CHAMP and GOCE thermospheric mass density data. Following the approaches used for MSISE00 (Picone et al., 2002), two statistical metrics of the dataminusmodel residuals are computed: the mean (which we also refer to as the bias) and the standard deviation. We computed these metrics for each data set and in selected altitude bins. For density quantities, we computed the residuals in natural log space (i.e., ln [data/model]), so that a residual of 0.1 corresponds to a dataminusmodel difference of ~10%. The bias indicates systematic differences between a data set and the corresponding model estimates, while the standard deviation indicates the agreement between the geophysical variations in the data and model (it also includes measurement noise).
One of our goals in constructing and tuning MSIS 2.0 was to produce a model that statistically performs at least as well as MSISE00 and better in most instances (i.e., smaller biases and residual standard deviations). This was accomplished by the assimilation of extensive new data sets in the lower and middle atmosphere and by largely retaining the MSISE00 thermosphere. Further development of the MSIS thermosphere will be the subject of future work. We computed the statistical metrics with respect to both MSISE00 and MSIS 2.0 and compared their values.
Table 2 shows residual standard deviations, of the independent ensembles with respect to MSISE00 and MSIS 2.0, in four broad altitude bins in the mesosphere and below. In almost all cases, the standard deviations with respect to MSIS 2.0 are smaller, indicating that MSIS 2.0 is capturing the geophysical variability in the data better than MSISE00. The MSIS 2.0 residual standard deviation values are typically 10–15 K in the upper mesosphere (75–100 km), 6–8 K in the lower mesosphere (50–75 km), and 5–6 K in the stratosphere and troposphere. The MSIS 2.0 standard deviations are typically 1–2 K smaller than MSISE00 in the mesosphere and ~0.5 K smaller in the stratosphere and troposphere. The only two instances in which the MSISE00 residual standard deviations are smaller occur at the upper altitudinal extent of the Aura/MLS and CFSR data sets, where these data are presumably less reliable (these data sets also show biases relative to the other data sets at their upper extent, as shown in the next session, and data from such regions were excluded from the fitting ensembles for that reason).
75–100 km  50–75 km  25–50 km  0–25 km  

Data set/station/instrument  MSISE00  MSIS 2.0  MSISE00  MSIS 2.0  MSISE00  MSIS 2.0  MSISE00  MSIS 2.0 
Temperature residual std. dev. (K)  
CFSR  2.6  3.0  5.4  5.3  4.7  4.1  
MERRA2  11.0  9.3  8.1  6.6  5.8  5.1  4.7  4.1 
Aura/MLS  9.1  10.1  7.3  6.6  5.9  5.3  5.0  4.5 
ACE/FTS  12.0  10.3  9.5  7.9  8.3  7.6  5.6  5.5 
UARS/HALOE  12.2  9.8  10.3  8.1  6.3  5.8  
AIM/SOFIE  12.9  11.2  9.7  8.6  9.7  9.2  
Andes Lidar  23.1  20.5  
Boulder Lidar  28.9  28.0  
Ft. Collins Lidar  15.1  14.0  
Logan Lidar  15.0  13.7  
ALOMAR Lidar  15.8  14.2  10.4  10.0  
TIMED/SABER  18.2  15.4  9.7  8.1  6.1  5.7  
Odin/OSIRIS  15.0  13.4  9.4  7.7  
ln (n_{O}) residual Std. Dev.  
TIMED/SABER  1.23  0.91  0.88  
Odin/OSIRIS  1.50  0.97  1.35  
ln (n_{H}) residual std. dev.  
TIMED/SABER  4.29  0.76 
The MSIS 2.0 residual standard deviations of O and H in the mesosphere are large (~0.9 ≅ a factor of 2) but are much smaller than with MSISE00. MSISE00 does not output O below 72.5 km, so MSIS 2.0 sets a new performance benchmark for O in this region.
More detailed results in 5 km altitude bins, including the bias statistic, are provided in Data Sets S2 (temperature), S3 (O density), and S4 (H density).
Data Set S5 contains residual statistics of the orbitderived mass density data set. The tuning of upper thermospheric O in MSIS 2.0 (section 16) resulted in lower residual standard deviations than MSISE00 not just in the 1986–2005 interval used for fitting but also in the independent 1971–1985 interval; the residual standard deviations during 2006–2013 are considerably larger with both MSISE00 and MSIS 2.0, due to the anomalous solar minimum that occurred during this period (Emmert et al., 2014). The MSIS 2.0 mass densities are ~10% lower than MSISE00, as a result of the recalibration of the MSISE00 thermosphere to the downward revised orbitderived density. Tuning to a later epoch (1986–2005) may have also contributed, due to observed longterm trends in density (e.g., Emmert, 2015a).
Data Set S6 contains residual statistics of the CHAMP and GOCE accelerometerderived mass density data. Above 400 km, the residual standard deviation is lower with MSIS 2.0 than with MSISE00, further indicating that the tuning of the global interannual variation is robust.
6 Scientific Results and Technical Issues
An important aspect of the model generation and validation process is the examination of datamodel biases as a function of altitude and other geophysical variables. As mentioned in section 18, the mean residual statistic indicates systematic differences (i.e., biases) between a data set and the corresponding model estimates. By extension, this metric also indicates biases among data sets, with the model acting to filter out common spatiotemporal geophysical variations, provided that (1) the model accurately captures the average variations or (2) the data sets sample approximately the same geophysical conditions. We also discuss some scientific and technical issues that arose during the development of the model and which are illustrative of the state of knowledge of the structure and average behavior of the atmosphere, particularly in the mesosphere and thermosphere.
6.1 Temperature and Pressure in the Mesosphere and Below
Figure 5 shows average dataminusmodel temperature residuals as a function of altitude from the ground to 105 km. The data are from the validation ensembles (16–30) described in section 8. It is evident from the plots that the mean residuals with respect to MSIS 2.0 (right column) are flatter than those with respect to MSISE00 (left column), indicating that MSIS 2.0 better captures the overall height dependence of the data. The residuals with respect to MSISE00 tend to be negative by up to ~10 K, except for the reanalysis data between 5 and 15 km altitude. This indicates that contemporary data in the stratosphere and mesosphere are colder than MSISE00, which is based on tabulated data from the 1970s and 1980s (Hedin, 1991). This shift is qualitatively consistent with studies of longterm trends in these regions (e.g., Garcia et al., 2019; Laštovička, 2017; Randel et al., 2016).
The mean residuals from the various temperature data sets are generally within ~3 K of each other, suggesting that there are no major systematic biases among the data sets. Exceptions are as follows. The Na lidar data tend to become increasingly warmer than the other data sets below ~85 km, by up to ~40 K (Figure 5d). SABER temperatures above the highlatitude summer mesopause (~90 km) are up to ~50 K warmer than the other data sets (Figure 5b). The CFSR data begin to deviate from the other data sets above ~35 km, and the MERRA2 data deviate strongly above ~70 km. In all of these cases, the outlying data were excluded from the fitting ensembles via the altitude selection described in section 8. Additionally, Aura/MLS data between 55 and 95 km are ~4 K cooler than the other data sets, as is MERRA2, which assimilates MLS data.
Figure 6 shows contours of MSIS 2.0 zonal mean temperature as a function of latitude and altitude (left column), as well as the change from MSISE00 (right column). As mentioned above, MSIS 2.0 is warmer overall than MSISE00 in the upper troposphere and cooler overall in the stratosphere and mesosphere. Regions where MSIS 2.0 is warmer include the low and middlelatitude upper mesosphere (particularly during winter) and the highlatitude lower mesosphere (particularly during winter). Presumably, the difference patterns are a consequence of the average differences between the ~2002 and 2018 data assimilated into MSIS 2.0 and the 1970s and 1980s data on which MSISE00 is based. The pattern of the annual average change from MSISE00 to MSIS 2.0 is similar to the ~1974 to 2003 change modeled by Solomon et al., 2018; Figure 1); some of the differences are likely attributable to the fact that Solomon et al. presented their results as a function of logpressure not altitude as in Figure 6.
Figure 7 highlights the highlatitude summer mesopause region in more detail; Figure S2 contains additional binaveraged plots of highlatitude mesopause region data and corresponding MSIS 2.0 results, as a function of day of year. Höffner and Lübken (2007) noted that potassium lidar measurements from Spitsbergen (78°N), taken during 2001–2003, showed a colder and higher summer mesopause minimum than MSISE00 (119 K vs. 132 K and 90 km vs. 88 km). The MSIS 2.0 summer minimum at 78°N is 126 K (Figure 7d), which is closer to the Spitzbergen lidar results, but the height of the mesopause is still lower, at ~87 km. The SABER and OSIRIS measurements show a mesopause height of ~90 km at this latitude (Figure 7b), which agrees with Höffner and Lübken (2007). At 69°, both the infrared and occultation data sets indicate a lower mesopause at 87–88 km, and here MSIS 2.0 is in better agreement with the data (Figure 7a), including daytime ALOMAR Na lidar data not used in the MSIS 2.0 fit (cf. Figure S3 with Figure 7c). This latitude dependence of the summer mesopause height was also noted by Höffner and Lübken (2007), who cited Lübken's (1999) analysis of falling sphere measurements at 69°N.
Capturing this shift in mesopause height with latitude would require additional expansion terms in MSIS; MSIS 2.0 includes zonal mean Legendre function terms up to degree 6. Additionally, the region above the summer mesopause contains the strongest vertical temperature gradients in the atmosphere, as the temperature transitions from a summer minimum in the mesosphere to a summer maximum in the thermosphere. This gradient is largely controlled by the Bates parameter (Equation 1), which in MSIS 2.0 is taken from MSISE00 and includes only lowresolution variations as a function of latitude and day of year. More flexibility in this parameter could improve the model's representation of the highlatitude summer mesopause, provided there are sufficient data near the inflection point (~120 km altitude) to constrain its value.
Figure S4 shows average dataminusmodel temperature residuals as a function of local time in selected altitude and latitude bins. As with the altitude dependence shown in Figure 5, the local time dependence of the mean residuals with respect to MSIS 2.0 are flatter than those with respect to MSISE00, indicating that MSIS 2.0 better captures the migrating tides in the data. Figure S5 compares the local time variation of the absolute measured and modeled temperatures near 95 km at middle and low latitudes, further demonstrating the improved tides in MSIS 2.0. Figure 8 illustrates the mesospheric local timelatitude structure in the two models, after subtracting out zonal means. At 95 km, MSISE00 shows a primarily semidiurnal pattern with a single peak near the equator, whereas MSIS 2.0 exhibits a mix of diurnal and semidiurnal variations with two peaks at ~40°S and 40°N. At 70 km, MSISE00 shows a highly structured semidiurnal variation during June solstice and a weak diurnal variation during equinox; the local time variation in MSIS 2.0 at this altitude is relatively weak in both seasons (<5 K).
Figure 9 shows mean residuals of logpressure from the reanalysis, after subtracting out the water vapor partial pressure from the data. The residuals with respect to MSISE00 become increasingly negative above 15 km as a result of the lower temperatures in the data (see Figure 5) that imply a more contracted atmosphere. The residuals with respect to MSIS 2.0 are largely flat and mostly less than 2%, indicating that the overall pressure structure in the MSIS 2.0 lower and middle atmosphere is consistent with the reanalyses. The bias near the surface is zero by design (see section 10) but then shifts to +1.1% above 5 km. This is because the dryair MSIS atmosphere has a smallerscale height in the lower troposphere than the moistair reanalyses (the presence of water vapor decreases the mean molecular mass; the lighter air is more expanded in altitude, resulting in higher partial pressures of all species at a given altitude). The residual standard deviation of the logpressure residuals with respect to MSIS 2.0 is ~1.3% near the surface, increasing to ~5.5% in the lower mesosphere (not shown).
6.2 Atomic Oxygen
For upper atmospheric applications, O is perhaps the most important and the most challenging species to represent. O is the dominant neutral constituent in the atmosphere at thermospheric altitudes and is the primary source atom for the F region ionosphere. However, in the MLT region, O is a minor constituent and sensitive to photochemical production and loss, dynamical transport, and diffusion (e.g., Jones et al., 2014, 2017; Smith et al., 2010; Swenson et al., 2019). Furthermore, these processes that determine the MLT O distribution vary over a wide range of spatiotemporal scales. MSISE00 and earlier version provided an accurate representation of thermospheric O from mass spectrometer data but did not include any global satellite measurements for O in the MLT region. Rather, several rocket profiles were used to extrapolate the O profiles into the middle atmosphere and provide an estimate for the value of the O peak in the MLT. Not surprisingly, significant differences between MSISE00 and the newer MLT satellite data sets have been reported. For example, Sheese et al. (2011) suggested caution when using MSISE00 for MLT O if one wants accuracy to better than an order of magnitude. By incorporating SABER and OSIRIS O data within the MLT region, MSIS 2.0 significantly ameliorates those earlier issues and provides a seamless representation of O from 50 km through to the upper thermosphere.
Figure 10 shows mean residuals of log O density measurements from SABER and OSIRIS, as a function of altitude. With respect to MSISE00, the OSIRIS mean residuals are within 0.2 (~22%) of zero, and the SABER mean residuals are ~0.3–0.6 (35–82%) larger than OSIRIS. Below 80 km, the MSISE00 O density falls off much more rapidly than the measurements, so the mean residuals of both data sets are very large. With respect to MSIS 2.0, the difference between the OSIRIS and SABER mean residuals is smaller, ~0.2–0.3 (22–35%), suggesting that MSIS 2.0 is accounting for some O variations that affect the mean residual via differences between the OSIRIS and SABER sampling patterns (e.g., sunsynchronous vs. precessing). Other comparisons, such as with SCIAMACHY (Kaufmann et al., 2014; Zhu & Kaufmann, 2019), have suggested that SABER is too high by about 30%. Mlynczak et al. (2018) discussed how modifications to the OH(v') kinetics scheme could lessen the biases and produce a consistent global energy budget. Above 75 km, MSIS 2.0 falls between the SABER and OSIRIS data but leans more toward SABER, as a result of the weighting shown in Figure 4c. Below 70 km, where there is no OSIRIS data, MSIS 2.0 follows the SABER data closely, on average.
Figure 11 is an update to Figure 12 of Sheese et al. (2011), who pointed out major discrepancies in the seasonallatitudinal variation of MSISE00 (we note that the Sheese et al. figure showed results for 0700 local time, not 1900 as indicated in their caption). Figure 11 shows that MSIS 2.0 properly captures the equatorial minimum and semiannual oscillation seen in OSIRIS. At midlatitudes MSIS 2.0 now yields an annual variation with the same phase (summer maximum) as the OSIRIS observations. At northern polar latitudes, while improved relative to MSISE00, MSIS 2.0 still does not fully capture the summer maximum seen in OSIRIS, but it is not clear if this is a significant feature of the OSIRIS data. At southern polar latitudes, OSIRIS does not show a clear summer maximum, and MSIS 2.0 is in better agreement with OSIRIS than in the north.
Figure 12 illustrates the seasonal dependence of midlatitude O profiles from SABER, OSIRIS, MSISE00, and MSIS 2.0. MSIS 2.0 exhibits a narrower and larger summer O peak than MSISE00. A larger peak during summer is consistent with the SABER and OSIRIS data (and with Figure 11); the narrower peak follows from the seasonal phase reversal to larger winter O density in the upper thermosphere. The SABER data do not form a welldefined peak in summer: The SABER values near 95 km are larger than MSIS 2.0, and these values extend upward to 105 km with little or no gradient. This feature of the SABER data is possibly associated with daytime overestimation of O_{3} at these altitudes; a corrective scheme is currently under development. SABER data above 100 km were excluded from the MSIS 2.0 fit.
During winter, MSIS 2.0 and MSISE00 show a very similar structure at the peak, but MSIS 2.0 is greater by almost an order of magnitude near 80 km. Above ~105 km, MSIS 2.0 and MSISE00 are fairly similar with MSIS 2.0 O values smaller by ~10%, as a result of tuning to upper thermospheric mass density data (see section 16).
Figure 13 compares the local time dependence of SABER and OSIRIS O with MSIS 2.0, at selected altitudes in the MLT. As one moves downward into the mesosphere, O transitions from being under dynamical control to under photochemical control. O photochemistry is driven by rapid daytime photolysis of ozone followed by rapid recombination after sunset. This midmesospheric diurnal variation has previously been analyzed and discussed by Siskind et al. (2015). MSIS 2.0 includes diurnal harmonics and a solar zenith angle transition function to represent the transition from dynamical control to photochemical control. At 65–70 km (Figure 13d), the model very accurately represents the daynight differences. At 75–85 km (Figures 13c and 13b), the SABER data begin to show a rounded daytime local time dependence suggesting the growing influence of transport; MSIS 2.0 captures this feature, although the model densities are ~50% smaller than SABER and ~30% smaller than OSIRIS in the postdawn sector. Near the O peak at 93–98 km (Figure 13a), there is no clear local time dependence in the data. As noted above, the SABER summer density near the peak is larger than both MSIS 2.0 and OSIRIS.
6.3 Atomic Hydrogen
With the inclusion of SABER H data, MSIS 2.0 provides a more accurate description of hydrogen variability in the MLT than MSISE00 (see Table 2 and Data Set S4) One important aspect of this variability is the reversal from a summer maximum at the mesopause (cf. Siskind et al., 2018) to a winter maximum in the upper thermosphere, commonly referred to as the “winter bulge” (Keating & Prior, 1968). Consistent with the known variability of light species, MSISE00 does have a winter maximum in the upper thermosphere but has very little seasonal variation in the MLT region (Qian et al., 2018). Figure 14 illustrates the improved seasonal variation by comparing MSISE00 and MSIS 2.0 in a format similar to Figure 5 of Qian et al. (2018). The figure shows binned averages of the SABER data and corresponding MSIS profiles extending from the mesosphere to the upper thermosphere. Both SABER and MSIS 2.0 show a Northern Hemisphere summer maximum (Southern Hemisphere results, not shown, are very similar) in the MLT; MSIS 2.0 has a winter maximum in the upper thermosphere that it inherited from MSISE00. The crossover altitude is about 150 km, in approximate agreement with the WACCMX results presented by Qian et al. (2018). However, there is currently no data available to accurately determine where this crossover occurs, which underscores the need for global neutral constituent measurements between 100 and 300 km.
An interesting aspect of the SABER data discussed by Mlynczak et al. (2014) is the inverse relationship between MLT H and solar activity. Mlynczak et al. (2014) ascribe this difference to relative roles of decreasing temperatures with decreasing solar activity and changes in the O/O_{3} ratio. Qian et al. (2018) point out that this solar cycle change was present in MSISE00 at all altitudes above 80 km, but not in WACCMX, which showed an alternating positivenegativepositive effect between 80 and 130 km. The solar cycle variation in MSIS 2.0 is generally consistent both with MSISE00 and SABER and differs from the WACCMX results shown by Qian et al. (2018). This is shown in Figure 15, which presents solar minimum and maximum averages and their differences. The SABER and MSIS 2.0 differences (Figure 15b) are consistently negative at all altitudes above 80 km.
6.4 Middle Thermosphere Temperature and Composition
Although several new middle and upper thermospheric temperature data sets have become available since MSISE00 was developed, we found that there is considerable variation among their mean residuals with respect to MSISE00. Thermospheric species and mass densities are highly sensitive to the entire thermospheric temperature profile, and any temperature adjustments to the model need to be consistent with observed density residuals (such as accelerometerderived mass densities). With some data sets, the mean temperature residuals show strong local time and/or latitude dependences that are not evident in available density data. On the other hand, available density data are insufficient to constrain the thermospheric temperature profile. Because of these issues, we chose to largely retain the MSISE00 thermosphere in MSIS 2.0 and to defer a major thermospheric upgrade until we can accurately reconcile the various historical thermospheric temperature and density data sets, ideally with new measurements that can constrain the problem. In this section, we explore some of the aspects of this challenge.
There are few contemporary data sets of middle thermospheric (~120–200 km) temperature. Figure 16 shows mean residuals, with respect to MSIS 2.0, of Envisat/MIPAS and Millstone Hill ISR temperatures as a function of altitude. Because the MSIS 2.0 thermospheric temperature above 120 km is largely identical to MSISE00, the results are nearly the same if MSISE00 is used as the reference model. Nighttime MIPAS and daytime Millstone Hill ISR data (Figures 16b and 16a, respectively) suggest that MSIS temperatures are 30–50 K too high above 120 km, which may be associated with longterm trends that have been reported in the ISR data (Zhang et al., 2011; Zhang & Holt, 2013). However, daytime MIPAS average temperatures (Figure 16a) are within 10–20 K of MSIS 2.0. Thus, the daytime MIPAS and ISR residuals differ by up to 30 K. The biases depend on altitude, and the MIPAS and ISR profile shapes are different. These biases appear minor from a total temperature profile perspective (Figure 16d), but they strongly affect middle and upper thermospheric density via thermal expansion or contraction: A 30 K decrease in temperature above 120 km produces an ~25% decrease in mass density at 400 km. Heightdependent biases are not necessarily problematic, since they can possibly be corrected by tuning the Bates profile parameters, and the Bates profile is not an exact representation of thermal balance. However, the MIPAS and ISR mean residuals appear to be discontinuous with those of the upper mesospheric data sets (which are tightly clustered within ~5 K near their upper bound of 105 km), and it is not immediately clear how this discontinuity should be resolved.
Figure 17 similarly shows MIPAS and Millstone Hill ISR mean residuals as a function of local time. Near 120 km, the ISR residuals are fairly consistent over local time and in different seasons. Between 150 and 180 km, however, the ISR residuals depend strongly on local time and season, and the MIPAS nighttime residuals also depend strongly on season. It may be possible to tune MSIS to better match these patterns, but this would require commensurate tuning of the MSIS species densities, since these bias patterns are not evident in accelerometer measurements of upper thermospheric mass density.
We note here that MSISE00 assimilated 1988–1997 Millstone Hill ISR temperatures between 100 and 130 km altitude, which may account for the relatively flat mean residuals in Figures 17d–17f (which are based on 2002–2015 ISR data). However, the MSIS thermospheric temperature profile and variations are also strongly determined by mass spectrometer measurements, from the 1970s and 1980s, of N_{2} density. This reliance on density data to estimate thermospheric temperature resulted in accurate middle and upper thermospheric N_{2} density, but it did not distinguish the contribution of loweraltitude temperatures to the N_{2} density, which in MSISE00 and earlier versions was effectively encoded in a poorly resolved combination of the reference thermospheric N_{2} density at 120 km and the Bates temperature and gradient at 120 km.
The cooler stratospheric and mesospheric temperatures in MSIS 2.0 (section 19), compared to MSISE00, have direct and strong implications for thermospheric N_{2} density. In MSIS 2.0, thermospheric N_{2} density is now coupled to the underlying temperature via the effective mass profile. Figure 18 shows the difference between the MSIS 2.0 and MSISE00 global average temperature profiles and the resulting change in the N_{2} density profile. Although the temperature change is relatively small, the effect of the thermal contraction accumulates with altitude via the hydrostatic term in Equation 2. The resulting MSIS 2.0 thermospheric N_{2} density is ~20% lower than MSISE00. This result is not sensitive to the choice of effective mass profile parameters, because the mass shift of N_{2} is quite small (28.97 to 28 Da). Thus, even without new data in the thermosphere, the highly robust lower and middle atmosphere temperature data constrain the N_{2} density in the thermosphere. Other thermospheric species, especially O_{2}, Ar, and He, are likely also affected by the cooler stratospheric and mesospheric temperatures, but in the absence of middle thermospheric data to constrain their effective mass profile parameters (to which the densities are much more sensitive than in the case of N_{2}), the thermospheric densities of these species remain uncertain, and in MSIS 2.0 they relax to MSISE00.
For users who desire the legacy thermospheric N_{2} profile and its variations, the software includes an option to turn on the relaxation to MSISE00. For the global average profile, this relaxation is illustrated by the dashed line in Figure 18c. A consequence of turning this adjustment on is that the model becomes hydrostatically imbalanced between ~120 and 200 km, due to the added N_{2} mass, as shown in Figure 18d. This panel shows the gradient of model pressure with respect to geopotential height divided by mass density, which is equal to the reference gravitational acceleration if the model is in hydrostatic equilibrium. Both MSISE00 and MSIS 2.0 are hydrostatically balanced in the fully mixed region below 70 km and the diffusive equilibrium region above ~200 km. In the transition region, imbalances are typically less than 0.1 m/s^{2}; with the legacy N_{2} adjustment turned on, the imbalance is ~0.4 m/s^{2}.
Empirical models of the thermosphere are sometimes referred to as “static” models (e.g., Jacchia, 1971), because the parameters of these models (including MSIS 2.0) are fixed and the formulations do not include explicit time dependence. However, the results discussed in this paper and studies of atmospheric trends (e.g., Garcia et al., 2019; Laštovička, 2017; Randel et al., 2016; Solomon et al., 2018) demonstrate that the climatological behavior of the thermosphere and underlying layers is not stationary. This poses additional challenges for the MSIS thermosphere, since its thermospheric composition is based primarily on mass spectrometer data that are now 35–50 years old, and available contemporary data consist mainly of in situ mass density measurements, which, while highly valuable, do not by themselves resolve the influences of temperature and individual species dynamics. Ultraviolet remote sensing of N_{2}, O_{2}, and O densities above 130 km (Meier et al., 2015) is a relatively new and promising technique, but there are some outstanding discrepancies between those retrievals and mass density derived from orbital drag (e.g., Emmert et al., 2014, Figure 17). Further complicating the challenge of updating the MSIS thermosphere is that measurement techniques often use MSIS itself as an initial guess or for ancillary parameters needed in the retrieval (e.g., BermejoPantaleón et al., 2011; Meier et al., 2015).
We also note that species densities in the middle thermosphere are not known to the accuracy needed to fully understand the critical transition from a fully mixed atmosphere to a diffusively separated one. In physicsbased models, subgridscale mixing parameterizations typically have to be tuned to produce the observed upper thermosphere (e.g., Qian et al., 2009). Observations needed to constrain lower and middle thermospheric physics are scarce, and the 100–200 km region can perhaps be termed the new “ignorosphere,” an epithet previously applied to the mesosphere, which by comparison is now well measured and understood. The lack of 100–200 km observations is due in part because it is a difficult region to probe: Emissions that can be exploited by remote sensing are relatively weak, and orbital in situ access is hampered by large satellite drag.
Considering these challenges, we judge that advances in climatological specification of the thermosphere would be greatly facilitated by (1) new in situ mass spectrometer measurements of species densities to recalibrate the thermosphere to the current epoch; (2) new techniques for, and extensive measurements of, heightresolved temperature and species densities in the 100–200 km region; and (3) a concerted effort to identify and reconcile systematic biases among existing and new temperature and composition data sets, taking into account the strong coupling between temperature and species densities as well as longterm changes in thermospheric climate. The recently launched ICON and GOLD missions partly address the need for new measurements in the 100–200 km region. ICON/MIGHTI (Englert et al., 2017) is measuring temperatures up to ~140 km (Stevens et al., 2018). GOLD is measuring O_{2} density profiles from ~130 to 250 km (Eastes et al., 2017). Daedalus (Sarris et al., 2020) is a proposed European Space Agency mission to make in situ heightresolved measurements in the 100–200 km region from an eccentric lowperigee orbit; if selected, this mission could make a strong contribution to middle thermospheric physics and specification in the 2028–2030 timeframe.
6.5 Upper Thermosphere Mass Density
Although MSIS 2.0 does not include a major revision of the thermosphere, it does address some welldocumented aspects of upper thermospheric mass density that are not accounted for in MSISE00. As described in section 16, we tuned the O intraannual variation, including solar activity modulation thereof, to orbitderived mass density. Figure 19 shows bin average residuals as a function of day of year; unlike earlier plots, these residuals are modelminusdata, since in this case we are superposing results from different models and showing individual data sets in separate panels. The MSIS 2.0 densities are ~10% lower overall than MSISE00, in part because of the ~7% downward revision of the TLEderived density described in section 7 and supported by the ballistic coefficient calculations of Pilinski et al. (2011).
In Figures 19a–19f, the MSIS 2.0 residual means with respect to the TLE density data set are flatter than MSISE00 across all levels of solar activity not just for the 1986–2005 time interval used to tune the model but also for the independent time period 1971–1985. This indicates that the tuning and the addition of a solar cycle modulation are robust and are consistent with the findings of Emmert and Picone (2010). The MSIS 2.0 residuals are also somewhat flatter with respect to the CHAMP and GOCE data sets (Figures 19g–19k), suggesting that the tuning is also supported by these independent data sets.
At the lower altitude of the GOCE observations (225–300 km), N_{2} is a significant contributor to the mass density (~20–60%) and the overall difference between the MSIS 2.0 and MSISE00 mass density is larger here (~16%) due to the lower N_{2} density in MSIS 2.0 (see section 22). Overall, MSIS 2.0 mass densities are ~2% larger than GOCE and ~9% larger than CHAMP (Data Set S7).
Figure S6 is the same as Figure 19 but additionally shows results from the Global Average Mass Density Model (Emmert, 2015a). GAMDM 2.1 is based solely on the 1986–2005 TLEderived densities and therefore performs better than MSIS with respect to this data set.
Figure 20 illustrates the intraannual variation of MSISE00 and MSIS 2.0 global average mass density at 400 km, as a function of F_{10.7}. The models depict the wellknown semiannual oscillation with equinoctial maxima and an annual oscillation with an overall minimum near June solstice. The righthand panel shows the log ratio of the two models. With the addition of a solar cycle modulation to the O density, at solar maximum MSIS 2.0 has a larger March equinox peak and a deeper June solstice minimum. This is consistent with Bowman et al. (2008) and Emmert and Picone (2010; Figure 5).
7 Summary and Future Development
Like its predecessors, NRLMSIS® 2.0 is an empirical atmospheric model that estimates the average observed behavior of temperature, eight species densities (N_{2}, O_{2}, O, He, H, Ar, N, and anomalous O), and mass density as a parametric function of location, day of year, time of day, solar activity (via the F_{10.7} index), and geomagnetic activity (via the ap index). The model incorporates physical constraints of hydrostatic equilibrium in the wellmixed lower atmosphere (below ~70 km altitude), speciesbyspecies hydrostatic equilibrium (similar to diffusive equilibrium) above ~200 km, and relaxation of thermospheric temperature to an asymptotic exospheric temperature (via the Bates temperature profile). Fortran 90 software to run the model is available in the supporting information and in the repository listed in the acknowledgments.
NRLMSIS 2.0 is a major upgrade to the previous version, NRLMSISE00 (Picone et al., 2002), with fundamental changes to the formulation and the assimilation of extensive new measurements in the middle and lower atmosphere. The formulation changes include the following:
 Thermosphere species densities are now fully coupled to the entire temperature profile from the ground to the exosphere, via the introduction of an effective mass profile that approximates the transition from fully mixed to diffusive separation for each species.
 Geopotential height is now used internally as the vertical coordinate of the model; previous versions used geopotential differences among geometric altitude reference levels.
 Modeled O density now extends down to 50 km altitude, via the introduction of cubic splines between 50 and 85 km that are decoupled from temperature.
 Thermal diffusion of He, H, and Ar, which was applied in the thermosphere in previous versions, has been removed.
MSIS 2.0 development focused primarily on altitudes below 100 km. To estimate the parameters of the reformulated model, we assimilated extensive new measurements of temperature in the troposphere, stratosphere, and mesosphere covering the years 2002–2018. The data types used are numerical weather prediction reanalyses, microwave limb sounding, solar occultation, groundbased Na lidars, and infrared passive remote sensing. We also assimilated mesospheric infraredbased measurements of O and H, as well as upper thermospheric mass density derived from satellite orbits.
The temperature data sets in the mesosphere and below are mutually highly consistent, with only a few exceptions that we addressed with appropriate exclusions from the fitting process. The temperature performance of MSIS 2.0 is considerably improved compared to MSISE00, based on residual analyses of independent samples of the temperature data sets. Biases among MSIS 2.0 and the data sets are typically less than 3 K in the mesosphere and smaller in the stratosphere and troposphere. Residual standard deviations are typically 10–15 K in the upper mesosphere, 6–8 K in the lower mesosphere, and 5–6 K in the stratosphere and troposphere. MSIS 2.0 is warmer overall than MSISE00 in the upper troposphere and cooler in the stratosphere and mesosphere, which is consistent with the assimilation of contemporary data sets and firstprinciples simulations of longterm changes in the atmospheric temperature.
MSIS 2.0 mesospheric O and H density predictions are also improved compared to MSISE00. In particular, at midlatitudes both species now transition from a winter maximum in the upper thermosphere to a summer maximum in the upper mesosphere, which is consistent with previous data analyses and modeling.
In the fully connected thermosphere and lower atmosphere of MSIS 2.0, the cooler stratospheric and mesospheric temperatures produce N_{2} densities in the thermosphere that are ~20% lower than MSISE00. The software includes an option to recover the MSISE00 thermospheric N_{2} density, but this comes with a large hydrostatic imbalance in the middle thermosphere.
The O density in the MSIS 2.0 upper thermosphere is ~10% lower overall than MSISE00, mainly as a result of a downward revision of orbitderived mass density based on theoretical ballistic coefficient modeling. Additionally, the global intraannual variation of thermospheric O now includes a solar activity modulation consistent with previous findings of increased annual and semiannual oscillations at solar maximum.
Besides the changes to N_{2} and O, the MSIS 2.0 thermospheric output is largely the same as MSISE00. Because of difficulties reconciling new thermospheric temperature and density data sets, as well as combining the new data sets with historical mass spectrometer measurements, we have deferred a major thermospheric upgrade of the model. We have concluded that significant advances in climatological specification of the thermosphere require new in situ mass spectrometer measurements of species densities, new techniques for heightresolved temperature and species densities in the 100–200 km region, and a concerted effort to identify and reconcile systematic biases among temperature and composition data sets, taking into account longterm changes in thermospheric climate.
We are currently developing a nitric oxide (NO) component for MSIS, which is slated for inclusion in the next release. Other future plans for the model include the addition of nonmigrating tides, carbon dioxide, and explicit time dependence to account for longterm changes.
Acknowledgments
Work at NRL was supported by the Office of Naval Research (including via the BSION program) and NASA (Grants NNH16ZDA001NHSR/ITM16_20013, NNH14ZDA001NGIODDE14_2/NNH15AZ72I, and interagency agreements to support D. Siskind's participation on the TIMED (NNG17PX04I) and AIM (S50029G) programs). M. Jones Jr. acknowledges support from NASA's Early Career Investigator Program (Grant NNH18ZDA001NECIP/18ECIP 2801 0018). M. G. Mlynczak acknowledges the NASA Heliophysics Division TIMED Project for continued support enabling collaborations such as these. The ACE mission is funded primarily by the Canadian Space Agency. The University of Colorado STAR Na Doppler lidar work was supported by the National Science Foundation Grants AGS1136272 and AGS1452351. X. Chu acknowledges the contributions of Wentao Huang, Weichun Fong, Zhibin Yu, John A. Smith, and Cao Chen to the STAR lidar data collection and analysis. B. Funke acknowledges financial support from the State Agency for Research of the Spanish MCIU through Project ESP2017–87143R, the “Center of Excellence Severo Ochoa” award to the Instituto de Astrofísica de Andalucía (SEV20170709), and EC FEDER funds. Millstone Hill ISR data products and access through the Madrigal distributed data system are provided to the community (http://www.openmadrigal.org) by the Massachusetts Institute of Technology (MIT) under support from the U.S. National Science Foundation Grant AGS1952737. F. V. acknowledges NSF Grant 1759573 for the project “Collaborative Research: Instabilities and Turbulence in Gravity Wave Dissipation and Formation of Thermospheric Sodium Layers above the Andes”. Work at the Jet Propulsion Laboratory, California Institute of Technology, was done under contract with the National Aeronautics and Space Administration. B. P. W.'s work was supported by NSF AGS1829138. T. Yuan acknowledges the following funding in support of the CSU/USU lidar over the years: the National Science Foundation Grants AGS1041571, AGS1135882, AGS1734333, and AGS1136082. The authors are grateful to the following colleagues for helpful discussions and/or beta testing: J. Tate, M. LópezPuertas, M. S. Dhadly, M. H. Stevens, S. D. Eckermann, S. E. McDonald, and R. R. Meier.
List of Acronyms

 ACE

 Atmospheric Chemistry Experiment

 AIM

 Aeronomy of Ice in the Mesosphere (satellite mission)

 ALO

 Andes Lidar Observatory

 ALOMAR

 Arctic Lidar Observatory for Middle Atmosphere Research

 CFSR

 Climate Forecast System Reanalysis

 CHAMP

 Challenging Minisatellite Payload

 Envisat

 Environmental Satellite

 FTS

 Fourier Transform Spectrometer

 GOCE

 Gravity Field and SteadyState Ocean Circulation Explorer

 GOLD

 Globalscale Observations of the Limb and Disk (satellite mission)

 HALOE

 Halogen Occultation Experiment

 ICON

 Ionospheric Connection Explorer

 ISR

 Incoherent Scatter Radar

 MERRA2

 ModernEra Retrospective analysis for Research and Applications version 2

 MIGHTI

 Michelson Interferometer for Global HighResolution Thermospheric Imaging

 MIPAS

 Michelson Interferometer for Passive Atmospheric Sounding

 MLS

 Microwave Limb Sounder

 MLT

 Mesosphere and lower thermosphere

 MSIS®

 Mass Spectrometer Incoherent Scatter radar

 NRL

 Naval Research Laborotory

 ODRPACK

 Orthogonal Distance Regression Package

 OSIRIS

 Optical Spectrograph and InfraRed Imager System

 SABER

 Sounding of the Atmosphere using Broadband Emission Radiometry

 SCIAMACHY

 SCanning Imaging Absorption SpectroMeter for Atmospheric CHartographY

 SOFIE

 Solar Occultation For Ice Experiment

 STAR

 Student Training and Atmospheric Research

 TIMED

 Thermosphere Ionosphere Mesosphere Energetics and Dynamics (satellite mission)

 TLE

 TwoLine Elements

 UARS

 Upper Atmosphere Research Satellite

 UHF

 Ultra high frequency

 WACCMX

 Whole Atmosphere Community Climate Model with thermosphere and ionosphere extension

 WGS

 World Geodetic System

 WMO

 World Meteorological Organization
Appendix A
1 Geopotential Height
Note that for altitudes near the surface, the difference in the numerator of Equation A3 is small compared to the values of U and U_{0}. Therefore, we carry out the geopotential height calculation in double precision.
Figure A1a shows the difference between geodetic altitude h and latitudinally averaged (areaweighted) geopotential height, as a function of geodetic altitude. The difference increases nonlinearly with altitude; a geodetic altitude of 800 km corresponds to a geopotential height of ~710 km. Figure A1b illustrates the latitude dependence of geopotential height, relative to its latitudinally averaged value at each geodetic altitude. The magnitude of the deviation increases with increasing altitude; at 800 km, the maximum deviation is ~3 km.
The difference between the geopotential height calculated from WGS84 and the true local geopotential height relative to the geoid is less than ~0.1 km (National Imagery and Mapping Agency, 2000), with the largest differences near the surface. The density scale height of the atmosphere is almost everywhere greater than 5 km, so the simplified geopotential could induce density errors of at most 2%. However, some of that error will be corrected by the spherical harmonic expansion of the pressure and density parameters in the model (section 5).
Open Research
Data Availability Statement
NRLMSIS 2.0 Code and all data samples used in this work are available at https://map.nrl.navy.mil/map/pub/nrl/NRLMSIS/NRLMSIS2.0. Raw CFSR Versions 1 and 2 data were obtained from http://nomads.ncdc.noaa.gov/modeldata/cmd_pgbh/ and https://nomads.ncdc.noaa.gov/modeldata/cfsv2_analysis_pgbh/, respectively. MERRA2 data were obtained online (https://goldsmr5.gesdisc.eosdis.nasa.gov/data/MERRA2/M2I3NVASM.5.12.4/). Groundbased lidar and ISR data were obtained from http://www.cedar.openmadrigal.org website. USU Lidar data are also available online (https://doi.org/10.15142/T33H26). MIPAS data used in this study are available for registered users at http://www.imkasf.kit.edu/english/308.php website. CHAMP and GOCE accelerometer densities were obtained from http://thermosphere.tudelft.nl website.