SRA © 2007 - Roughness Estimation Model

Roughness Calculation Model (from Vassilakis, 2001 & 2005)

BACKGROUND

Numerous roughness calculation models have been proposed over the last ~100 years (e.g. Helmholtz, 1885; Plomp & Levelt, 1965; Kameoka & Kuriyagawa, 1969a&b; Hutchinson & Knopoff, 1978; Sethares, 1998), largely overlooking the two principal studies (von Békésy, 1960; Terhardt, 1974) that have systematically examined the relationship between a signal’s amplitude fluctuation degree and roughness. They have been employed in studies that attempt to link auditory roughness to auditory/sensory consonance (e.g. Vos, 1986; Bigand et al., 1996; Dibben, 1999), demonstrating a relatively low degree of agreement between calculated and experimental data.
All the above models (review in Vassilakis, 2001):
(a) overestimate the contribution of sound pressure level (i.e. absolute amplitude values of the interfering signals) to roughness,
(b) underestimate the contribution of the degree of amplitude fluctuation (i.e. relative amplitudes values of the interfering signals) to roughness, and
(c) often misrepresent the relationship between roughness and register.

SRA incorporates a new roughness calculation model, outlined below. Perceptual experiments testing the model indicate that it reliably and validly represents the perception of roughness, and performs better than previous roughness calculation models (Vassilakis, 2001, 2005).

ROUGHNESS OF SINE-PAIRS

Note: As is the case with all roughness calculation models, the absolute roughness values calculated by the model are arbitrary and are only useful for roughness comparisons among signals that have been analyzed using consistent analysis parameters.

The roughness R of a signal whose spectrum has two sinusoidal components with frequencies f₁, f₂ and amplitudes A₁, A₂, where f_min = min(f₁, f₂), f_max = max(f₁, f₂), A_min = min(A₁, A₂), A_max = max(A₁, A₂), is:

R = X^0.1*0.5(Y^3.11)*Z where

X = A_min*A_max
The term X^0.1 represents the dependence of roughness on intensity (related to the amplitude of the added sines). It is based on Terhardt (1974), adjusted (Vassilakis, 2000, 2001) to account for the quantitative difference between modulation depth (used in Terhardt, 1974) and amplitude fluctuation degree (the signal parameter influencing roughness).

Y = 2A_min/ (A_min+A_max )
The term Y^3.11 represents the dependence of roughness on amplitude fluctuation degree (related to the amplitude difference of the added sines). It, too, is based on Terhard (1974), adjusted (Vassilakis, 2000, 2001) to account for the quantitative difference between modulation depth and amplitude fluctuation degree.

Z = e^{-b1s(fmax - fmin)} - e^{-b2s(fmax - fmin)},
[b1 = 3.5; b2 = 5.75; s = 0.24/(s1f_min + s2); s1 = 0.0207; s2 = 18.96]
The term Z represents the dependence of roughness on amplitude fluctuation rate (frequency difference of the added sines) and register (frequency of the lower sine). It is based on Sethares's (1998) modeling of the roughness curves in Figure 1, below, curves that have been derived from multiple perceptual experiments examining the roughness of pairs of sines (Plomp & Levelt, 1965; Kameoka & Kuriyagawa, 1969a&b).

Figure 1: Roughness curves plotting the observed roughness (arbitrary measure - y axis) of a pair of sines (equal amplitudes) as a function of frequency separation (x axis) and frequency of the lower sine (after Sethares, 1998: 45). The roughness curves have been derived from multiple perceptual experiments examining the roughness of pairs of sines (Plomp & Levelt, 1965; Kameoka & Kuriyagawa, 1969a&b).

ROUGHNESS OF SIGNALS CORRESPONDING TO SPECTRA WITH MORE THAN TWO COMPONENTS

The roughness of signals corresponding to spectra with more than two sine components is calculated by summing the roughness of all sine-pairs in the spectrum. Although it has been argued that, depending on the relative phase of the respective amplitude fluctuations, the total roughness can be less than the sum of the roughness values for individual sine-pairs (von Békésy, 1960: 350-351), several studies (e.g. Terhardt, 1974; Lin & Hartmann, 1995) and pilot experiments (in Vassilakis, 2001) indicate otherwise. More specifically, Lin & Hartmann (1995) concluded that the total roughness is summed over all auditory filters. In addition, since roughness modeling is meaningful to roughness comparisons among multiple signals, rather than to roughness calculations of isolated signals, any such phase effects are more likely to be diffused across the signals of interest, the more complex the signals (see also below).

ROUGHNESS CALCULATION AND PHASE OF A SIGNAL'S SPECTRAL COMPONENTS

The phase of a signal's spectral components is not included as a parameter in the roughness calculation. According to Pressnitzer & McAdams (1999), the relative phase of the components of a three-component spectrum influences the complex signal's overall envelope shape and/or amplitude fluctuation degree, consequently influencing the signal's roughness, especially when three or more sine components fall within the same critical band.
In spite of this observation, the absence of the phase parameter from the model does not significantly distort the model's calculations. For the types of signals that will be submitted to the calculator (synthetic signals, where the phase relationship of the components can be controlled and remain the same for all, or natural signals from polyphonic passages, where the phase relationships are more likely to be random than systematic), differences in the roughness phase effects among the signals to be examined are either controllable or defused. This supports valid comparisons of the resulting relative roughness values.
As is the case with all roughness calculation models, the absolute roughness values calculated by the model are arbitrary and are only useful for roughness comparisons among signals that have been analyzed using consistent analysis parameters.

ROUGHNESS CALCULATION AND LOUDNESS

The model does not adjust for equal loudness.

ROUGHNESS MODEL CALCULATIONS VS PERCEPTUAL ROUGHNESS RATINGS

The roughness calculations of the model correlate very well (r = 0.98) with roughness ratings obtained in a set of perceptual experiments (Vassilakis, 2001 & 2005), better than predictions of models by Helmholtz, 1885 (r = 0.73) and Hutchinson and Knopoff, 1978 (r = 0.87).

REFERENCES
(with some links to the sources)

von Békésy, G. (1960). Experiments in Hearing. New York: Acoustical Society of America Press (1989).

Bigand, E., Parncutt, R., and Lerdahl, F. (1996). "Perception of musical tension in short chord sequences: The influence of harmonic function, sensory dissonance, horizontal motion, and musical training," Perception and Psychophysics 58: 125-141.

Dibben, N. (1999). "The perception of structural stability in atonal music: The influence of salience, stability, horizontal motion, pitch commonality, and dissonance," Music Perception 16(3): 265-294.

Helmholtz, H. L. F. (1885). On the Sensations of Tone as a Physiological Basis for the Theory of Music (2^nd edition). Trans. A. J. Ellis. New York: Dover Publications, Inc. (1954).

Hutchinson, W. and Knopoff, L. (1978). "The acoustic component of Western consonance," Interface 7: 1-29.

Kameoka, A. and Kuriyagawa, M. (1969a). "Consonance theory, part I: Consonance of dyads," J. Acoust. Soc. Am. 45(6): 1451-1459.

Kameoka, A. and Kuriyagawa, M. (1969b). "Consonance theory, part II: Consonance of complex tones and its calculation method," J. Acoust. Soc. Am. 45(6): 1460-1469.

Lin J. Y. and Hartmann, W. M. (1995). "Roughness and the critical bandwidth at low frequency," J. Acoust. Soc. Am. 97(5): 3274-797.

Plomp, R. and Levelt, W. J. M. (1965). "Tonal consonance and critical bandwidth," J. Acoust. Soc. Am. 38(4): 548-560.

Pressnitzer D. and McAdams, S. (1999). "Two phase effects in roughness perception," J. Acoust. Soc. Am. 105(5): 2773-2782.

Sethares, W. A. (1998). Tuning, Timbre, Spectrum, Scale. London: Springer-Verlag.

Terhardt, E. (1974). "On the perception of periodic sound fluctuations (roughness)," Acustica 30(4): 201-213.

Vassilakis, P.N. (2000). "Amplitude modulation depth versus degree of amplitude fluctuation: implementation error, adjustment and implications," J. Acoust. Soc. Am. 108(5/2): 2597

Vassilakis, P. N. (2001). Perceptual and Physical Properties of Amplitude Fluctuation and their Musical Significance. Doctoral Dissertation. Los Angeles: University of California, Los Angeles; Systematic Musicology.

Vassilakis, P. N. (2005). Auditory roughness as means of musical expression. Selected Reports in Ethnomusicology 12 (Perspectives in Systematic Musicology): 119-144.

Vos, J. (1986). "Purity ratings of tempered fifths and major thirds," Music Perception 3(3): 221-258.