-
-
p a p e r s
|
Summary
Qualitative evidence supporting this hypothesis:
The
general inability of place (spectral) theories of pitch perception to account for residue
phenomena has been the main justification for the introduction of various periodicity
theories, which are often favored despite their own inability to explain phenomena
such as: The place theory of
pitch perception can explain the above phenomena and, as the present study has indicated,
it may also explain residue phenomena. In its most
common form (von Bekesy, 1963a,b) this theory states that pitch is determined by the
place of maximum excitation along the basilar membrane and is not dependent on the firing
rate of neural discharges. This independence on firing rate allows the place theory
to eliminate one of the greatest problems faced by periodicity theories: their
inability to explain how pitch and loudness are separated in the auditory nervous system. On the other hand, the dependence on the place of maximum excitation seems to be challenged
by the fact that the pitch of complex tones does not necessarily correspond to the
component of a vibration that has maximum energy. Von Bekesy (1963a,b; 1972) has answered
this challenge by arguing that the place of maximum excitation along the basilar membrane
differs from the vibration maximum as a consequence of time delay and nervous inhibition,
nonlinearities that are introduced by the inner ear and result in the amplification of the
lowest component present in the cochlea (whether it was present in the original stimulus
or it was introduced as the difference tone). An
alternative answer to this challenge is suggested below. The hypothesis is that the pitch is determined not by the place of maximum excitation along the
basilar membrane but by the place along the basilar membrane where the change in the slope
of excitation changes direction [a somewhat similar
suggestion has been made by Evans, (1975; in Campbell & Greated,
1987)]. For complex
stimuli there are more than one such places (corresponding to successive components lying
within separate critical bands) fact that may explain the difference between synthetic
and analytic pitch. According to the above hypothesis, synthetic and analytic modes of
listening may be seen as describing the same operation, with the only difference depending
on whether a listener focuses on the place with the most prominent change in slope
direction (lowest component) or on one of the other available such places (first 4-5
components). The advantage of such an explanation over the one based on the
maximum-excitation hypothesis is that it explains the importance of the fundamental
(lowest component) to the perceived pitch of harmonic complex tones. Studies on the pitch of pure tones seem to support the above
hypothesis. Von Bekesy (1960, in von Bekesy
1963a) has shown that there are consistent deviations between the anatomically determined
frequency localization (place of maximum excitation along the basilar membrane) and the
pitch localization deriving from psychological experiments. More specifically, when
compared with the anatomically determined frequency localization, pitch localization along
the basilar membrane is shifted towards the helicotrema, with a shift that decreases
as we
move from low to high frequencies. Since the excitation pattern along the basilar
membrane in response to low frequencies spreads over a larger area than it does in
response to high frequencies, the observed localization shift may be explained in terms of
the place difference between the point of maximum excitation and the point of change in
the slope direction, a difference that is greater for low frequencies. A modified place theory of pitch
perception based on changes in the slope direction of the excitation pattern along the
basilar membrane has interesting implications regarding the pitch of inharmonic complex
tones. In such tones, interference between the original components introduces (according to the present study) new
components that do not in general match the frequency of
the original components or of each other. The resulting concentration of more than one
components on each critical band may smooth out the slopes of the excitation pattern along
the basilar membrane, making pitch determination difficult, ambiguous, or impossible,
depending on the total number of components and on the uniformity of energy/frequency
distribution [for example, infinite number of components and complete uniformity
(i.e. white noise) yields
no pitch]. REFERENCES von Békésy, G. (1963a). Hearing theories and complex sounds. J.
Acoust. Soc. Am., 35(4): 588-601. van den Brink, G. (1970). Two experiments on pitch perception:
diplacusis of harmonic AM signals and pitch of inharmonic Licklider, J. C. R. (1954). "Periodicity" pitch and
"place" pitch. J. Acoust. Soc. Am., 26: 945. Campbell, M. and Greated, C. (1987). The Musician's Guide to
Acoustics. New York: Schirmer Books. Houtsma, A. J. M. and Goldstein, J. L. (1972). The central origin
of the pitch of complex tones: evidence from musical Schouten, J. F. (1938). The perception of subjective tones. Proceedings
of Koninklijke Nederlandsche Akademie van |