Mutual information between geomagnetic indices and the solar wind as seen by WIND: Implications for propagation time estimates
Abstract
[1] The determination of delay times of solar wind conditions at the sunward libration point to effects on Earth is investigated using mutual information. This measures the amount of information shared between two timeseries. We consider the mutual information content of solar wind observations, from WIND, and the geomagnetic indices. The success of five commonly used schemes for estimating interplanetary propagation times is examined. Propagation assuming a fixed plane normal at 45 degrees to the GSE x-axis (i.e. the Parker Spiral estimate) is found to give optimal mutual information. The mutual information depends on the point in space chosen as the target for the propagation estimate, and we find that it is maximized by choosing a point in the nightside rather than dayside magnetosphere. In addition, we employ recurrence plot analysis to visualize contributions to the mutual information, this suggests that it appears on timescales of hours rather than minutes.
1. Introduction
[2] An important problem in space weather forecasting and magnetospheric physics is the determination of solar wind effects on Earth, given measurements from a satellite. In particular we consider a spacecraft in orbit around the sunward libration point L1. Solar wind plasma passing this point takes around 60 minutes to reach the Earth, providing sufficient advance notice of solar wind conditions to facilitate real-time space weather prediction.
[3] Early work on this subject [Russell et al., 1980; Kelly et al., 1986] demonstrated that an L1 monitor is a good predictor of solar wind conditions near Earth, assuming a planar description of the solar wind propagation. Studies of techniques to estimate plane orientation typically fall into two categories: 1) propagation of bulk properties [Collier et al., 1998; Richardson et al., 1998; Blanchard and Bankston, 2002; Weimer et al., 2003]; 2) propagation of specific features such as discontinuities [Ridley, 2000] or sudden southward turnings of the magnetic field [Horbury et al., 2001]. In case 1, correlation coefficients are computed between time-lagged solar wind timeseries from spatially separated spacecraft, in order to quantify the success of a particular time-lagging method. In case 2, a specific class of feature is identified in the timeseries from each spacecraft, and the difference between predicted and observed arrival times at the downstream spacecraft quantifies the degree of success.
[4] In the present paper we adopt the first approach, considering also the related problem of what happens once the solar wind has propagated through interplanetary space and is incident on the Earth's magnetosphere. We take solar wind measurements from the WIND spacecraft [Lepping et al., 1995; Ogilvie et al., 1995], but in contrast to previous work we look for correlation with detectable geomagnetic effects observed via the AE geomagnetic indices [Davis and Sugiura, 1966] on Earth. Estimation of propagation delays in interplanetary space has been well studied (see above). However, from an operational perspective the correlation between effects on Earth and data from an L1 monitor is also of direct interest, and is the focus of this paper. The method is not restricted by the requirement that a second spacecraft is in the solar wind during the same time period as the first, and hence we can expect to find more data available and thus obtain better statistics. Using L1 and Earth also provides a large distance over which to estimate the propagation, and hence greater contrast between the propagation estimates given by different methods.
2. Data
[5] For the Earth's response to the solar wind we use provisional AE index data from 1995, around solar minimum. We use WIND data for the first half of 1995, from day 24 to day 200. During this period the spacecraft was in the vicinity of the L1 point. One advantage of using geomagnetic indices instead of a second spacecraft is that we are able to use a long period of data and are hence certain of obtaining a representative sample of the solar wind. We take magnetic field measurements from the MFI experiment [Lepping et al., 1995], with 60s sampling, and proton bulk velocities and density from the SWE experiment [Ogilvie et al., 1995] with around 98s sampling.
3. Method


[8] We employ the following hypotheses concerning the plane normal vector n: 1) n is oriented along the x-axis; 2) n is perpendicular to B and lies in the ecliptic plane; 3) n is parallel to the vector (ByBz, BxBz, BxBy) [Ridley, 2000]; 4) n is in the ecliptic plane and is fixed perpendicular to the average magnetic field orientation, at 45° to the x-axis (the Parker spiral angle); 5) n is perpendicular to B, obtained using the minimum variance analysis (MVA) of Weimer et al. [2003], note that this is not a standard MVA (D. R. Weimer, private communication, 2004). For the projected timeseries we use the following solar wind parameters: 1) Bx; 2) By; 3) Bz; 4) B; 5) vxBz; 6) vx; 7) ε = vB2 sin4 tan−1(∣
∣); 8) ρ; 9) ρv; 10) ρv2; here ρ is the proton number density and the data are taken in the GSM (Geocentric Solar Magnetic) coordinate system.



4. Results
[10] The mutual information is computed between the projected solar wind timeseries and the AE index, and expressed as a function of additional time-lag Δt′. After applying the time-lagging, the peak in mutual information as a function of Δt′ has a higher value, indicating improved correlation. Figure 1 shows a comparison between the time-lagging methods using eight bins. We consider the maximum mutual information obtained for each method, and conclude that the Parker Spiral method (method 4) in this case yields the best results with a peak in mutual information at Δt′ = 33 ± 5 minutes, with a peak value of 0.360 ± 0.002 bits, where the uncertainty is estimated following [Roulston, 1999]. This value should be compared with the information contents (entropy) of the timeseries considered individually: 3 bits when using eight bins. We have also considered other constant plane angles but find that the Parker Spiral angle is optimal. The total magnetic field method (method 3) yielded the lowest mutual information. Interestingly, taking a different perspective, Ridley [2000] found the opposite to be true when considering the error in predicting the arrival times of discontinuities. Of the ten different solar wind parameters considered, those which show a clear enhancement of the mutual information versus additional time-lag Δt′ are those involving Bz and vx, with the highest being vxBz; the peak mutual information between ε and AE is 0.0771 ± 0.0007 bits. We observe that the density ρ does not show any significant correlation with the AE index.

[11] Figure 2 shows the mutual information between the three geomagnetic timeseries AE, AU and AL and vxBz, results for the Dst index are also shown. The greatest mutual information is found for the AE index, defined as the difference between AU and AL. We note that the peak in mutual information for AU occurs at a smaller additional time-lag Δt′ than for AL. AU thus reacts earlier than AL, implying earlier transport of information into the dayside magnetosphere. Furthermore, the fact that AE responds more strongly than either AL or AU implies that AL and AU each possess different information about the solar wind driver.

[12] The construction of the time-lag timeseries requires a target position PE, see equation (1). For targeting Earth we use the origin of the GSE coordinate system. Weimer et al. [2003] use a position near the northern cusp, (8,0,4)RE where RE denotes one Earth radius. We assume for simplicity that once the solar wind impacts on the magnetosphere, the velocity information is lost and subsequent developments are best represented by a constant additional time-lag, although in reality we would expect this parameter to vary with magnetospheric conditions. Figure 3 shows the peak mutual information obtained between vxBz and the AE index, using the x-axis propagation estimate, as a function of PE(x). As the target is moved along the x-axis, a different additional time-lag Δt′ is required to maximize the mutual information. This maximum mutual information is not at a maximum around the northern cusp, however, in fact we observe stronger correlation than the cusp value in a region around 10 to 220 Earth radii downstream. Since we neglect the compression and deceleration of the solar wind as it impacts on the magnetosphere, our target position should be considered as the location of where the solar wind would have propagated to if the Earth had not been present. The peak in mutual information occurs with a target around 125 Earth radii downstream. Comparison with models of the dynamic magnetosphere might elucidate whether this is consistent with dayside reconnection being the dominant effect on Earth.

5. Visualization of Mutual Information

[14] Figure 4 shows recurrence plots for a short period (days 68 and 69 of 1995) of the AE index and vxBz timeseries after time-lagging using the Parker spiral method. While there are many small scale differences between the plots, the overall large scale structure is similar. This shared pattern is quantified by the mutual information measurement, which can be related to counting the number of coordinates with black dots that are common to both plots [March et al., 2005]. Each black or white area corresponds to specific instants in time, so we can assess which parts of the timeseries contribute most strongly to the mutual information. In the present case it appears that the shared structure on the plots in Figure 4 derives from periods of both high and low activity, with structures which appear to be on a timescale of order hours.

6. Conclusions
[15] We have quantitatively distinguished between hypotheses concerning time delay estimation using geomagnetic data. The mutual information quantifies the degree to which the far upstream solar wind shares information with, and hence presumably can be used to predict, AE and has also been used to estimate the response time of the magnetosphere to solar wind input. In this work we have assumed that the magnetospheric portion of the propagation can be represented by a constant time-lag. Since this assumption is employed for each propagation estimation method it should not affect the conclusions of the work.
[16] Using this technique it is possible to examine the assumptions typically employed in time-lag estimation to Earth, which are inaccessible to multi-spacecraft studies. Here we have considered the target position and have shown that a tailward target seems to be optimal. Furthermore, using the technique of recurrence plot analysis, the nature of the correlation between the solar wind and geomagnetic timeseries as measured by mutual information can be visualized. This demonstrates that the correlation is dominated by phenomena on timescales of the order of a few hours, suggesting that substorms may be the dominant information carrier. An open question is whether the mutual information between AE and the solar wind input is strongly correlated with activity levels. Ordering the mutual information calculated over one day intervals with activity levels reveals a large amount of scatter which obscures any such trend. Elucidating this possible trend is thus a subset of future work. The mutual information technique on one interval alone cannot distinguish between a situation where there is a weak correlation present at all times or a situation where a strong correlation is present intermittently. However, the variability over day-long intervals and the common features on the recurrence plots (apparent on a variety of scales) imply the latter situation.
Acknowledgments
[17] The authors acknowledge support from the United Kingdom Engineering and Physical Sciences Research Council and the Particle Physics and Astronomy Research Council. We thank R. P. Lepping, A. Szabo and K. Ogilvie for provision of WIND MFI and SWE datasets, and the Kyoto WDC for provision of geomagnetic index data.





