Re: how to measure entropy of music?

From: Andor (an2or_at_mailcircuit.com)
Date: 12/04/04


Date: 4 Dec 2004 01:28:10 -0800

glen herrmannsfeldt wrote:

> I would think a pure sine wave should be low complexity, so maybe
> you should compress the Fourier transform.

I think a vocoder does something similar: linear predictive coding
tries to fit an AR model to the input, effectively searching for the
formants (resonant peaks) of the "vocal tract" transfer function. If
you initialize an IIR filter with the LPC coefficients and the states
(delays) with the signal, you can let it "ring" like an oscillator -
the output will converge to a sum of pure damped sine waves at the
estimated frequencies of the formants. However, instead of storing the
sine waves, the vocoder stores the LPC coefficients and the residue
(prediction error).

To take this back to entropy, the energy of the prediction error could
well be regarded as the entropy of a vocal signal - it gives a good
measure on how unexpected the signal is, given its past history.

Recent work has shown that this method is also suited for general
audio signals, not just speech. The order of the AR model just
increases drastically (for interpolation, orders of 1000 - 3000 are
used). The same applies here for the prediction error and the entropy.

> Has anyone ever done an FFT of a whole CD?

If you take a music CD, I would expect a linear trend of the form
1/f^a. This is a common model for music signals, and it agrees well
with findings of long-range correlated time series.

For speech, I would guess a rectangular pulse response. The formants
vary across a certain frequency range, but are limited from above and
below.

>
> -- glen

Regards,
Andor



Relevant Pages

  • Re: Other "tested" techniques for detecting bias
    ... If you know that an algorithm was used then you don't need to ... You don't "know" until you test your prediction against what actually ... This is the restriction that SETI ... ways to conceal their signals so that we could not decipher them. ...
    (talk.origins)
  • Re: Is the Curt net a kind of decision tree?
    ... only prediction it needs to make. ... maximizing function on it's own, and as a result, there was no need for ... output of the amplifier. ... input signals, to a different set of output signals. ...
    (comp.ai.philosophy)
  • Re: motion sensor signal compression/prediction
    ... I have a bunch of huge motion sensor signals to lossy compress. ... need random access to the samples block based compression schemes are ... Along with this prediction ...
    (comp.dsp)
  • Re: limitations of Hawkins top-down predictive model
    ... each area receiving FB signals from fully *HALF* of all the other ... including many at the same "level" of the hierarchy besides from ... using information from elswhere to improve it's prediction, ... Why wouldn't both lower level ...
    (comp.ai.philosophy)