We start with the concept of an MT Station and time series of calibrated Fourier coefficients
No discussion in this module is provided for how these are obtained
In a noise free world, at a fixed frequency, under the plane wave assumption, the measured fields at any fixed location should obey the following equation:
The fundamental TF relationship is defined by equation \eqref{eq:1}. The elements of the 3x2 tensor Zij and Ti are (theoretically) invariants for a fixed measurement location for simultaneous measurements of $E.
*Note that we can regress on Hx,Hy as if they were “inputs” and the Ex,Ey,Hz were “output” even though physically the Hx,Hy may be some combination of input and output.
This is three equations in six unknowns, but each of the three equations can be solved separately for pairs of tensor elements (rows of Equation 1).
Extracting the first equation from the system we write:
[Ex]=[ZxxZxy][HxHy](2a)
In the “E_x” equation. the values for E, H are themselves vector observations
Here the data are organized into row vectors. Ex is 1 x N, Z is 1 x 2, and H is 2 x N. 1
The standard (Ordinary Least Squares -- OLS) way to solve this is by multiplying H on the right by H† , the N x 2 conjugate transpose, yielding a square2 matrix, and then multiplying on the right by the inverse of this newly formed square matrix.
The next few equations parameterize the OLS solution in terms of cross powers, but they boil down to the following, 2d solves 2c for OLS:
The equation is now in a form where we can call numpy or scipy’s solve functions, but note that since the solution is only 2x2 we can also solve explicitly.
The 2x2 on the right (we can call it HH†)is basically the covariance matrix of the magnetic field data.
N.B. We recover the solutions for Zyx and Zyy simply by substition of Ey for Ex in the above equations, and similarly Tx and Ty by substition of Hz for Ex. Proof is left as an exercise for the reader.
We have shown that the elements of the impedance tensor can be derived from the cross-powers of the various channels.
Note that this has all been one under the assumption that the measurements are being made at a single location (that is what the TF equation is relating -- Ex, Ey, Hz, to local Hx, Hy.
Mathematically, any channels A, B can be used to invert the matrix, provided that HC is invertable, which is the same as saying it’s determinant is non-zero.
For example, we cannot allow A equal B.
TODO: There is a stronger condition which also must be obeyed, which is that the columns of C must be coherent with H, E.
Sims et al. evaluate each of the 6 pairings of Ex, Ey, Hx, Hy (from a single station).
They point out that pairings (Ex, Hx) and (Ey, Hy) are unstable -- this is because tend to be co-linear, so the determinant of HC can be very close to 0.
The remaining combinations are all “legal”, with two of them tending to overestimate and two tending to underestimate the impedance tensor elements, ... this can be seen by auto-power and cross power arguments (TODO add more detail here -- AST).
A corralary to this that seems worth mentioning is that any rotated version of the channels A, B would still result in an invertable HC. i.e.
C is Nx2, so we can muliply it by any 2x2 rotation matrix and we still get a linear-independent Nx2. Say that we represent the rotation matrix by R,
then (HCR)−1 = ((HC)(R))−1 = (R)^{-1}(HC)^{-1}$, i.e. we will recover the same solution no matter if the reference channels are rotated.
What if the data are not noise-free? What can we say about the derived solutions?
That would depend on the details of the noise, so start with a simple noise model based on a few assumptions:
Let any channel of data, aobs(t) = as(t)+an(t). 3
The subsripts s,n are for the “signal part” and the “noise part” of the observed data respectively.
Let’s evaluate the role of noise under some Assumptions:
Assumption 1: Noise is incoherent from all signal
<asig(t),bn(t)>=0 for any channels a, b. We haven’t really formlalized what is signal and what is noise yet, so this is a pretty weak constraint. For now say that signal plane wave, that obeys equation 1. So all noise is assumed incoherent with the signal source.
Assumption 2: Noise is incoherent between channels
<an(t),bn(t)>=0 for any channels a, b. So the non-plane-wave part of the observed data that does not obey equation 1 is incoherent on all channels.
Relaxing the noise-free assumption:
(technically should incorporate noise around Equation A13b, and carry the noise through the math above -- but for now, just look at Equation tmp2 above, substituting the signal+noise model):
But this says that the denominator is biased up by the noise autopower in Hy, so Zxy will be smaller than it ought to be.
Other channel combinations will yield biasing up by Hy in the numerator, or by E in the numerator and denominator respectively.
Proof left as an exercise.
Aside: since noise levels in H tend to be lower than in E, if you have to have bias it maybe better to have it from H. That said, you can also compute the biased estimates and average them -- this was the recommended approach until the Remote Reference method.
The above approach motivates the RR approach as an unbiased estimator. WLOG we can replace channels A and B in the matrix C with the remote Hx and remote Hy respectivley. If we assume that the remote reference channels are parallel to the channels at our station then all the assumptions 0,1,2 can be applied and we simply wind up with
tmp4d, but in this case:
And by Assumption 2 the second term in the denominator is 0 and we recover an unbiased estimate:
Zxy=⟨Hy,sRy,s∗⟩⟨Ex,sRy,s∗⟩(tmp4f)
This “cross-power trick” is a common enough in geophysics (and other fields) that it is worth looking at -- If we had started from first principles on a noise-free homogeneous earth we could have derived that
Zxy=HyEx(tmp4g)
Proof left as an exercise
but when noise is present in the measurements, we can use the “cross-power trick” to get an unbiased estimate ... this basically selects the part of the signal that is coherent with Ry. As long as Ry is relatively quiet, and the noise it does see is incoherent with the local station noise, then we are effectively projecting the local channels onto the coherent signal. It can be thought of as is something like
Zxy=∣Hx∣∣Ry∣cos(θ)∣Ex∣∣Ry∣cos(θ)(tmp4h)
where cos(θ) is like the angle between the real answer vector and the answer+noise, although for complex numbers the inner product is not quite as geometrically straightforward to sketch. The same trick can be used in controlled source applications for example to reduce noise in the receiver by taking
Before moving on, consider a 3-D earth ... then in 13b_with_R, we would not have had much simplification -- we cannot say in general that Zxx is 0 for one thing, so we would never have been able to strike the Hx autopowers like we did in tmp2 -- because we would not have been able to “zero-out” the first terms in the sums of numerator and denominator as was done in tmp0, tmp1.
We would have a bigger, messier algebraic expression to track.
But what about Assumptions 1 & 2.
Assumptions 1 is a topic for discussion. It would seem that if you have noise that is effectively plane wave, as long as it is present at both local and remote, you should be able to treat is as signal -- it walks like a duck -- (CSMT, CSAMT).
Assumptions 2 is not the same -- incoherent channel noise is rather optimistic. A ground sqirrel chewing a cable is one wat this can happen, so is a faulty electronic component, but for a real physical process that generates EM Noise, assuming that it effects only one channel is not realistic. For one thing, E, H are coupled, so it is only plausible that noise in an H would have a corresponding noise in E (Discuss electric fences), and also, it woudl be very lucky if a noise source was so precisely polarized that it coupled into only one of say Hx or Hy, more likely it would be in both -- and ditto for electrics.
The RR method will still work well for 2 or 3D, BUT STILL ASSUMES that channel noise is uncorrelated with other channel noise to get an unbiased estimate. So-- N.B. If the noise is not incoherent, i.e. say some channels at either site have noise correlated with noise at the same site, we do not get the friendly cancelations of the previous section, rather the noise contaminates the estimates. The details of working through that are somewhat algebraically involved, and it is probably better to build a toy model with coherent noise in various channel combinations and look at the bias rather than derive it -- at least for now.
For incoherent, Gaussian noise, we get an unbiased estimate of the TF
these converge to the true value as the number of observations increases
Sims, W.E., F.X. Bostick, and H.W. SMITH. “THE ESTIMATION OF MAGNETOTELLURIC IMPEDANCE TENSOR ELEMENTS FROM MEASURED DATA?” GEOPHYSICS 36, no. 5 (1971). Sims et al. (1971).
There is one more difference with Vozoff91, which is that V91 Equation 42b has a typo: The very last term in bra-kets in the denominator (above HxA∗) is unfortunately, incorrectly transcribed in V91 as (above HyA∗).
Sims, W. E., Bostick, F. X., & Smith, H. W. (1971). THE ESTIMATION OF MAGNETOTELLURIC IMPEDANCE TENSOR ELEMENTS FROM MEASURED DATA. GEOPHYSICS, 36(5), 938–942. 10.1190/1.1440225