What are wave buoys actually measuring and how confidently?

Hello I am Arthur, postdoc at the University of Western Australia, where I work in a group seeking to improve numerical wave forecast using data assimilation or machine learning. In both cases we use data retrieved by directional wave buoy hence the following question:

What are wave buoys actually measuring and how confidently ?

Motivation

The question is motivated by the assimilation of wave buoy measurements into spectral model like WW3. As a brief reminder, data assimilation aims at estimating an initial condition combining observation and model information, to later produce a forecast.

The link between model and observation space is classically described by the following observation equation:

Y = H(X) + Îľ

H: Observation operator
X: state (model space)
Y: observation
Îľ: observational noise

In the case of interest, X=E(f, θ) is a directional wave spectrum and Y is an in-situ wave buoy measurement. It is often considered in the literature that the first-5 directional Fourier coefficients [a0, a1, b1, a2, b2](f) of E(f, θ) can be observed so that:

H(X) = int_θ{ E(f, θ) [1, cos(θ), sin(θ), cos(2θ), sin(2θ)] dθ }  (un-normalized convention)

But [a0, a1, b1, a2, b2]=Y are actually estimated from measurements. The estimation involves a chain of various processes and assumptions. So the observational noise Îľ could arise from different sources and errors may accumulate and grow along the chain, which makes Îľ difficult to describe. Refining the question I am interested in:

What is the likelihood of Y knowing X?
i.e. what are the statistics of Îľ, which characterize the confidence in the observation and so act as weighting in the assimilation process?

Identifying error sources

The series of operations to go from raw sea state measurements to estimated coefficients can be quite complex as illustrated in [1] (fig. 2). From my understanding, there are three majors operations at which errors might be introduced:

1) Sensing
Raw data are produced by calibrated instruments. Calibration and intrinsic instrumental sensitivity introduce an error.

2) Response Amplitude Operator inversion
The Response Amplitude Operator expresses how a specific buoy-sensor system turns the true sea-surface particle motion into the buoy’s measured quantities. In the process, the RAO is inverted to infer the equivalent surface-particle. The operator is not perfectly known so it introduces an error.

To the best of my knowledge, commercial pitch-roll buoys assume an approximately flat RAO in the 0.03–0.5 Hz band and therefore omit an explicit inversion step. I keep the RAO error source as (i) it reminds that the assumption can break in different see condition, and (ii) the same concept re-appears whenever we assimilate data from sensors with non-flat response.

3) Fourier analysis
The corrected surface-particle time series are windowed, detrended, correlated and Fourier-transformed to estimate auto- and cross-spectra. From such spectra, under linear wave and stationarity assumption, directional coefficients [a0,a1,b1,a2,b2](f) can be deduced. The spectral density estimation process, by definition, introduces an error.

Quantifying observation uncertainty i.e. covariance(Îľ)

Unfortunately most of these errors seem untraceable. However what has been done in the article [2] is to use the covariance of the spectral density estimation. Calculation of such covariance can be found in [3] or [4], it is relatively heavy, even working with un-nornmalized directional Fourier coefficients.

I agree that’s it is better than nothing to use the spectral density estimation uncertainty. For instance if you consider a0(f) alone, the variance is proportional to a0**2(f), which translates into “the more energy in a frequency bin, the less certain the energy estimation”. In the classical Gaussian Data Assimilation framework, my understanding is that it means more flexibility for the model to fit high energy bins but less for low energy ones, which seems reasonable.

Finally, the same question becomes:

Is there a way to characterise Îľ more accurately or usefully than by relying solely on the covariance of spectral-density estimates ?

[1] Steele et al. (1992) Wave direction measurements using pitch and roll buoys. Ocean Engineering
[2] Crosby et al. (2017) Assimilating global wave model predictions and deep-water wave observations in nearshore swell predictions. Journal of Atmospheric and Oceanic Technology
[3] Borgman et al. (1982) Statistical precision of directional spectrum estimation with data from a tilt-and-roll buoy. Topics in Ocean Physics
[4] Long et al. (1980) The statistical evaluation of directional spectrum estimates derived from pitch/roll buoy data. Journal of Physical Oceanography

2 Likes

Seemanth, M., Remya, P.G., Kumar, R. et al. Impact of assimilating satellite and in-situ buoy observed significant wave height on a regional wave forecasting system in the Indian Ocean. J Earth Syst Sci 133, 144 (2024). Impact of assimilating satellite and in-situ buoy observed significant wave height on a regional wave forecasting system in the Indian Ocean | Journal of Earth System Science

M. Seemanth, P.G. Remya, Suchandra Aich Bhowmick, Rashmi Sharma, T.M. Balakrishnan Nair, Raj Kumar, Arun Chakraborty,Implementation of altimeter data assimilation on a regional wave forecasting system and its impact on wave and swell surge forecast in the Indian Ocean,Ocean Engineering,Volume 237,2021,109585,
Redirecting.

In a practical sense the biggest flaw of buoy observations is the mapping from a finite set of directional components to a directional spectrum. I quite liked the idea of mapping to the observables instead of something else as advocated by Sean and Bob (to note they map boundary and not initial conditions). That is why we used that as they basis for our operational forecasting using the Sofar network (See Houston et al).

further - in terms of practical DA the measurement error is really lost in the noise. If I set the buoy error to 0 in our DA system I really get a similar answer.

Hi Pieter and thank you for answering.

I agree that correcting model background directly from observational errors removes the need for such mapping, as done in [2] (Crosby et al, 2017). I actually came across the reference in the article you are mentioning, for those interested:

[5] Houghton et al. (2022), Operational Assimilation of Spectral Wave Data from the Sofar Spotter Network, Geophysical Research
Letters

If I understand correctly, in this work you use for the covariance of observational errors R=σ*Id which means you assume the observational errors uncorrelated with equal variance between different buoys, directional components and frequency bins.

Similarly the formulation for R in [2] (Appendix) also assumes un-correlation as the matrix is diagonal. The difference I see is the variances change for every buoys, directional components and frequency bins. Very briefly, I think that switching to this R would impact assimilation results only when spatial areas of correction from buoys are overlapping.

Do you mean setting σ=∞? which would be equivalent to not running the Optimal interpolation step (section 2.3 in [5]) and directly doing the ‘directional reconstruction’ step (section 2.4 in [5]) from observations?

(Some people from our team will be attending the International Workshop on Waves in Santander next September. If you plan on joining, I would be pleased to meet you there!)

Hello Arthur,

this is a very interesting thread. Obviously you want to work with the first 5 (a0=E(f) … ) or the co and cross-spectra of the measurements: directional spectra from buoys are just “pretty pictures” with no confidence whatsoever. In practice, different buoys have different responses and error levels (see e.g. O’Reilly et al. JTECH 1996 or Guimaraes et al. 2018: OS - A surface kinematics buoy (SKIB) for wave–current interaction studies ). Now, starting with a0 = E(f), the variance is proportional to the mean (see Young 1986, J. Waterways … ) … due to sampling alone. Measurement errors is another story but it is often negligible. Depending on the type of buoys the co and cross-spectra of the measurements can be analyzed in the same way.

Now, if you are interested in altimetry data, here are some further thoughts on uncertainties (which depend on the retracking method …) : De Carlo, M., & Ardhuin, F. (2024).
Along‐track resolution and uncertainty of altimeter‐derived wave height and sea level: Re‐defining the significant wave height in extreme storms. Journal of Geophysical Research: Oceans, 129, e2023JC020832. https://doi.org/10.1029/2023JC020832

Hi Fabrice and thank you for answering.

Yes my goal is to correct model output (background) directly from [a0, a1, b1, a2, b2](f). I agree that directional spectrum estimation from the ‘first-5’ do not add any information and just fill the blank according to a preferred assumption, for instance maximum entropy.

Thank you for pointing this out, there is typo in the original post (**2 to be ignored)
→ a0(f) alone, the variance is proportional to a0(f)

I am understanding it may be reasonable to quantify the first-5 uncertainty only using the variance of the spectral density estimation. I guess at the end of the day this choice can only be validated empirically: Is the forecast better with the preferred observation uncertainty.

People I work with are actually assimilating altimetry data so I will share this, I am personally focused on buoy. That being said the idea of simulating a realistic forward model to estimate uncertainties is interesting and definitely transferable to buoy.