• No results found

Simulation of auditory analysis of pitch : an elaboration on the DWS pitch meter

N/A
N/A
Protected

Academic year: 2021

Share "Simulation of auditory analysis of pitch : an elaboration on the DWS pitch meter"

Copied!
11
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Simulation of auditory analysis of pitch : an elaboration on the

DWS pitch meter

Citation for published version (APA):

Scheffers, M. T. M. (1983). Simulation of auditory analysis of pitch : an elaboration on the DWS pitch meter. Journal of the Acoustical Society of America, 74(6), 1176-1725. https://doi.org/10.1121/1.390280

DOI:

10.1121/1.390280

Document status and date: Published: 01/01/1983 Document Version:

Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers) Please check the document version of this publication:

• A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website.

• The final author version and the galley proof are versions of the publication after peer review.

• The final published version features the final layout of the paper including the volume, issue and page numbers.

Link to publication

General rights

Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain

• You may freely distribute the URL identifying the publication in the public portal.

If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please follow below link for the End User Agreement:

www.tue.nl/taverne Take down policy

If you believe that this document breaches copyright please contact us at: openaccess@tue.nl

(2)

Simulation of auditory analysis of pitch: An elaboration

on the DWS pitch meter

Michael T. M. Scheffers

Institute for Perception Research, P.O. Box 513, 5600 MB Eindhoven, The Netherlands

{Received 11 November 1982; accepted for publication 11 July 1983)

A model was developed for estimating the pitch of complex sounds that are partially masked by background sound. Our ultimate aim is to obtain a model that can separate two simultaneous sounds on the basis of the harmonic structure of at least one of the sounds. The MDWS model is an extension of the Duifhuis, Willems, and Sluyter pitch meter (DWS} [J. Acoust. See. Am. 71, 1568-1580(1982}] which is a practical implementation of Goldstein's optimum processor theory of pitch perception [J. Acoust. See. Am. 54, 1496-1516 (1973}]. The main modifications incorporated in MDWS consist of a more faithful modeling of auditory frequency analysis and of an alteration to the criterion used to decide which fundamental best fits a set of resolved

components. Effects of the latter modification were investigated in a comparison between model estimates of the pitch of inharmonic complex signals and results obtained for humans.

Furthermore, the accuracy of model estimates of the pitch of periodic signals (among which were synthesized vowel sounds}, partially masked by noise, was compared with the just noticeable difference of fundamental frequency of these sounds for human observers. The results of these two tests show that the model estimates come close to human perception.

PACS numbers: 43.66.Hg, 43.66.Ba [FLW]

INTRODUCTION

In many normal speech communication situations the voice of the speaker reaches the listeners' ears somewhat disturbed by background sound. In his classical study on this "cocktail party problem" Cherry {1953) found that human listeners have a surprising ability to separate the target sound from the background, even when the background sound is the voice of another speaker. As part of our research effort to investigate perceptual separation of simultaneous sounds by exploring the auditory and cognitive processes involved, an attempt will be made to model this human abi- lity. In this research we are primarily interested in the role of pitch differences between the two sounds in the separation process (e:g., Brokx and Nooteboom, 1982). For the model we therefore examined whether an existing model of pitch perception could be modified to serve our purpose. One of the most important characteristics of such a model should be that it is largely indifferent to signal components that are not harmonics of the target signal: simultaneous sounds will comprise many such spurious components viz. those of the second signal.

Most

models

are

related

to one

of three

recent

theories

of pitch perception, i.e., Wightman's pattern transformation theory (Wightman, 1973), Goldstein's optimum processor theory {Goldstein, 1973; Gerson and Goldstein, 1978), and Terhardt's virtual pitch theory {Terhardt, 1974, 1979, 1980). These three theories share the concept of a central formation of pitch on the basis of information derived from the fre- quencies of components resolved in the auditory frequency analysis. All three theories have at present reached a state in which they can be {or even have been) translated into a com- puter program. Goldstein's optimum processor theory is in our opinion the most promising for our purpose. The sto- chastic nature of this theory makes it most apt for quantita-

tive predictions of the pitch perceived in periodic signals. A practical implementation of this theory is made in the DW$ pitch meter ofDuifhuis et al. {1979, 1982). The occurrence of spurious components is explicitly mentioned and dealt with in their description of the determination of pitch on the basis of a set of aurally resolved components. Preliminary tests showed that this meter can successfully extract the pitch of speech signals at low signal-to-noise ratios. This means that it can discriminate between signal components and noisy ones, which can be considered as a first step in separating a signal from the background noise.

The DW$ meter, however, is optimized for the analysis of the pitch of normal speech sounds. It can make errors in determining the residue pitch of signals comprising only a few high {harmonic number above 5) harmonics. When a speech signal is disturbed by background sound, the lower harmonics might be masked. Some modifications are there- fore proposed to reduce these errors. They consist of{1) a more faithful modeling of the auditory frequency analysis, which also led to a different component detection procedure and {2) a modification to the decision on the "best fitting fundamental" of the set of resolved signal components..

This paper introduces a model--which will be referred

to as the MDWS model {Modified DWS)--which is an elaboration on the DWS meter. Results of a test of the per- formance of the model on complex signals comprising a few

high harmonics will be described. Second, a comparison will

be made between model estimates of the pitch of signals in a

noise background and human pitch perception. The results of a listening experiment on discrimination of fundamental frequency of periodic sounds in a noise background {Schetfers, 1982) will to this end be compared with the accu- racy of model estimates of the pitch of the experimental stim-

uli.

(3)

I. DESCRIPTION OF THE MDWS MODEL A. Introduction

Because MDWS is essentially a modification of the DWS meter, a brief outline of the latter will be given first,

followed by an argumentation for the proposed modifica-

tions.

Three stages can be discerned in the DWS procedure: (1) a frequency analysis of a short time frame of the acoustic signal, (2)detection of signal components, and (3)estimation of the fundamental frequency that optimally fits the set of

resolved component frequencies. According to the Gold- stein theory this fundamental frequency corresponds to the perceived pitch.

In the first stage a 128-point FFT amplitude spectrum is calculated for frequencies from 0 to 2.5 kHz of a 10- to 40-

ms frame of the signal. The resolution bandwidth of this

analysis is about 20 Hz.

In the second stage the spectrum is scanned for peaks

from low to high frequencies. Each peak is checked (a) to be

above an absolute threshold, (b) to be above a masked thresh- old that depends on the nearest accepted peak with a lower frequency, and (c) to have an acceptable shape. If a peak fulfills these conditions it is accepted as a signal component and a more accurate estimate of its frequency than provided by the FFT resolution is made by a quadratic interpolation on the three FFT points that define the peak. The estimated frequency values of the lowest six detected components form the input to the third stage.

In this

final

stage

a harmonics

sieve

sifts

out

thos•

com-

ponents that bear a harmonic relation to one another and

labels them with harmonic numbers. From the frequencies of the components that are accepted as harmonics and the harmonic numbers assigned to them, an optimum estimate is made of the best fitting fundamental frequency. This proce- dure will be more extensively described in Sec. I D.

As mentioned earlier, the DWS meter is optimized for a fast analysis of the pitch of speech signals. This necessitated some restrictions on, e.g., the frequency range, the maxi- mum number of resolved components, and the lowest accep- table harmonic number. Because of these restrictions the meter may make errors in estimating the residue pitch of signals consisting of only a few high harmonics. The reason for stressing this point is that similar problems arise in esti- mating the pitch of, for example, a vowel sound of which the lower harmonics are masked by another signal. A modifica- tion is therefore proposed in the harmonics sieve procedure to reduce these errors.

Secondly, the calculated masked threshold for a com- ponent is only determined in the DWS meter by the ampli- tude of the nearest detected component with a lower fre- quency. Masking is in reality a combined effect of all near components. It is at least more elegant if not more effective for the analysis to incorporate a more realistic simulation of auditory frequency analysis. Such a simulation would lead to fewer noisy components. It would also rule out the necessity of restricting the number of components to be detected be- cause high harmonics fuse in the auditory frequency analysis and are not separately detectable (cf. Plomp, 1964). For the

reasoning behind the choice of most parameter values for the

model the reader is referred to the paper by Duifhuis et al.

(1982).

B. Simulation of auditory frequency analysis

The signal is sampled at 10 kHz in the MDWS model, using a 12-bit •esolution. A 256-point FFT power spectrum from 0-5 kHz is calculated of a 40-ms frame of the signal which is band-pass filtered from 50 Hz-4.5 kHz, shaped by a Hamming window and supplemented with 11.2 ms of si- lence. The levels of this spectrum are roughly corrected for sensation levels by shifting the whole spectrum upwards or downwards according to the sensation level of components around 1000 Hz. Auditory frequency analysis is then simu- lated by convolving this spectrum with stylized pure tone excitation patterns. This procedure is similar to the one ap- plied in Wightman's pitch model (Wightman, 1973}.

The excitation patterns are derived from the masking patterns used by Terhardt in his virtual pitch model {Ter- hardt, 1979; Terhardt et al., 1982}. They have a triangular form on a dB versus log-frequency plot. For component fre- quencies above 400 Hz, the low-frequency slope equals 120 dB?oct and the high-frequency slope equals -- ( 108-Lc }dB? oct with Lc the component level in dB SL. For frequencies below 400 Hz these slopes are adjusted to obtain a constant bandwidth in Hz of the patterns reflecting the frequency- independent critical bandwidth for lower frequencies {cf. Zwicker and Feldtkeller, 1967}. The pattern of a pure tone is a convolution of the spectrum of the Hamming window and the stylized pattern. It is a little broader than the latter, has a rounded top, and a noise floor at about 43 dB {e.g., Harris, 1978} below the top. The bandwidth of a pattern will vary with the component level. For levels from 30 to 70 dB SL the equivalent rectangular bandwidth {ERB} for components with frequencies below 400 Hz is 70-120 Hz and the 3-dB bandwidth is 45-80 Hz. The ERB is 12%-24% of the fre- quency and the 3-dB bandwidth is 8%-16% for components with frequencies above 400 Hz. These properties are in close agreement with recent data on the width of the auditory filter {e.g., Patterson, 1976; Patterson and Nimmo-Smith, 1980; Pick, 1980}. The simulation is illustrated in Fig. 1.

Because the shape of a pattern is not constant but de-

pendent on the level and frequency of the component, it is

more correct to describe the procedure as replacing the pow- er level at each FFT point by the stylized excitation pattern

of a pure tone with the same frequency and a corresponding

sensation level. The excitation level at each point is then calculated by adding the power contributions of the excita- tion patterns for all other FFT points. Calculations are per- formed in floating point precision with a mantissa of 23 bits. The level resolution for the spectrum is therefore fully deter- mined by the resolution of the A?D conversion {12 bits}, yielding a value of less than -- 60 dB.

As usual in this kind of processing, phase is ignored because pitch perception seems to be largely ii•dependent of the phase relation between the individual components. How- ever, this is true only for those components that can be re- solved in the auditory frequency analysis. Because the corn-

(4)

9O 60 30 0 0 a o 4o t (ms) ! ! ! I 2 3 4 5 f (kHz) 9O 6O 3O i i i i i .5 I 2 5 10 2O f/fc

90

/ ....

d

0 I I I I 0 I 2 3 4 5 f (kHz)

FIG. 1. Schematic illustration of the construction of the masked spectrum.

The speech signal (a) is low-pass filtered at 5 kHz and sampled at 10 kHz. A 512-point FFT analysis is performed on a 40-ms frame, shaped by a Ham- ming window. The 256-point FFT power spectrum (b) is convolved with stylized excitation patterns (c), producing a masked power spectrum for fre-

quencies from 0 to 5 kHz (d).

ponents of a periodic sound are correlated, phase effects might influence the detectability of a component. In a sec- ond version of the model, therefore, the excitation patterns are provided with corresponding minimum-phase charac- teristics in a first order approximation of the phase behavior within the pattern (cf. Goldstein et al., 1971 }. After calculat-

ing both FFT amplitude and phase spectrum each--now

complex--point is replaced by its complex pattern. This leads to a set of amplitude and phase values at each individ- ual FFT sampling frequency f,. that reflect the influence of all components on the excitation waveform at this frequen- cy. The time course of the waveform is reconstructed from these values on the assumption that all components are (qua- si-} stationary sinusoids. Next, the rms value of the wave- form is calculated over ten cycles off,.. In this way the audi- tory system's increase in temporal resolution with increasing

frequency is included. Figure 2 depicts the spectrum for fre- quencies from 0-5 kHz constructed from the rms values of

these waveforms. 9O -- ß i i i

E

3O

0 i i I I 0 I 2 3 4 5 f (kHz)

FIG. 2. By convolving the amplitude and phase FFT spectra with stylized

excitation patterns to which minimum-phase characteristics are added, out- put waveforms of the simulated auditory filters can be constructed. The rms values of these waveforms, calculated over 10 cycles of the center frequency of each filter, are depicted for the same speech frame as in Fig. 1.

From a comparison between the power spectrum in Fig. l{d} and the rms spectrum of Fig. 2 it can be seen that there is only a minor difference between the two versions. It was therefore judged not to be necessary to incorporate phase for the analysis of the pitch of signals with a single periodicity. Preliminary tests on signals consisting of two simultaneous periodic sounds showed major differences between the spec- tra produced by the two versions. This is probably because the harmonics of each of the sounds in signals obtained by adding two sounds with different fundamental frequencies are correlated, whereas in general no correlation will exist between the harmonics of the one sound and those of the

other. This means that it might be useful to incorporate

phase in an extension of the model to extract pitches of si- multaneous sounds. It would also enable us to extract tem- poral information on the pitch from the excitation wave- forms.

Combination tones (CT's} are not incorporated in the standard version (without phase effects} of the model be- cause they will usually be masked in speech sounds by the lower harmonics. They are, however, introduced after the component detection stage for some signals such as de-

scribed in Sec. II A. IfCT's would be included in the version

of the model that incorporates phase effects, it should be done at the convolving stage. Because that version of the

model is not used in the tests to be described, no effort is

made for the moment to incorporate CT's there. C. Detection of components and estimation of their

frequencies

Because the excitation patterns are derived from mask-

ing patterns, an aurally resolvable component should pro-

duce a maximum in the convolved spectrum. In the detec-

tion stage that spectrum is therefore scanned for local

maxima and minima. The level of each maximum is com-

pared with the levels

of adjacent

minima and when both

peak-to-valley ratios exceed the--rather arbitrarily cho-

sen-value of 1 dB the maximum is considered as being

caused by the presence of a signal component. Through parabolic interpolation on the levels at the three FFT points

that form the maximum the peak frequency is estimated with

an accuracy which is an order of magnitude better than the

FFT resolution and more in accordance with that of the

auditory system (Goldstein, 1973}. The interpolation also

(5)

gives information about the peak shape so that very shallow peaks can be rejected because they are in general caused by more than one component. Maxima with a level more than 43 dB below the absolute maximum in the spectrum are also rejected because they cannot be distinguished from side lobes of the Hamming window used in the frequency analy- sis. Only the estimated frequency values of components re- solved in this way are retained for the pitch estimation stage.

This procedure leads to estimated frequency values which differ somewhat from the "real" component frequen- cies, owing to masking and the level of the component. These effects are•qualitatively at least--the same as the pitch shifts incorporated in the virtual pitch model (Terhardt, 1979). In view of the order of magnitude of these effects and their variance between subjects the qualitative presence is judged sufficient for MDWS.

D. Pitch estimation

In determining the pitch of the set of resolved compo- nents two problems arise. First of all, it must be decided whether a component is a genuine harmonic or a spurious component due to an interfering sound. Secondly, when a component is accepted as a harmonic, the corresponding fundamental must be found. In other words, the component must be given the correct harmonic number. For this the harmonics sieve procedure is introduced in the DWS meter. Below, a general description of this DWS procedure will be given, followed by a proposed modification.

The harmonics sieve has meshes at the harmonic fre- quenciesfn = nfo with n = 1 to N. Each mesh has a width of 8% of the frequency to which it corresponds in order to allow for frequency shifts up to 4%. Successive meshes are not allowed to overlap. This restricts the number N of meshes to 12. The set of harmonic numbers that best fits the set of resolved component frequencies, can be determined by using the sieve. To this end the sieve is successively set at a

number of positions in respect to the components. Each posi-

tion is fully characterized by the fundamental of the sieve which varies from 50-500 Hz. A step size between successive positions of 3% of the fundamental frequency is chosen so that there is a slight overlap to minimize the chance of a component being missed.

At each position {i) a criterion value is calculated for the match of the sieve in this position to the set of components.

This criterion is basically a measure of the difference

between the pattern of the sieve and that of the resolved components. The value is diminished for each component that passes a mesh and augmented {a) for each component that cannot pass a mesh {a spurious component) and {b) for each mesh through which no component passes {a missing harmonic}. The last decision cannot be taken for compo- nents with frequencies above the frequency to which the

12th mesh corresponds. These components are therefore dis- regarded in the criterion. The same is done for empty meshes above the highest one through which a component passes. The mathematics underlying the criterion are extensively

described by Duifhuis et al. {1982). The criterion • is given in

Eq. {1}.

C, = [ (Hi -- Pi ) + R , -- P, ) ] /P, . (1)

In this equation Pi equals the number of components that pass the sieve in this position i, Hi equals the harmonic num- ber of the highest mesh through which a component passes and R i equals the number of resolved components minus the number of components with a frequency above that of the 12th harmonic at position i. Note that (Hi -- Pi) equals the number of missing harmonics for the fundamental of the sieve in this position and that (Ri -- Pi )equals the number of components that are rejected as harmonics of that funda- mental. That position of the sieve which obtains the lowest criterion value is regarded as the best fit. Each signal compo- nent that passes a mesh of the sieve in this position is labeled with the harmonic number of that particular mesh. A more accurate estimate of the best fitting fundamental than the fundamental of the sieve at the best fitting position, is then calculated by means of the maximum likelihood estimate of Eq. (2)(Goldstein, 1973).

• Nj•j__•

fo

= •l Xj

j=

N;.

{2)

In this equation Xj equals the estimated frequency of the component that passes the mesh with harmonic number Nj

and P equals the number of components that are accepted as harmonics.

The criterion described above, works well on signals in which three or more low harmonics can be detected. How- ever, errors occur on signals in which a number of low har- monics are missing, e.g., typical psychoacoustical stimuli, "telephone" speech, or vowel sounds that are partially masked by noise. It can be seen from Eq. {1} that for such signals a higher pitch than the correct one is favored by the criterion in order to reduce the number of empty meshes. The C value of a complex comprising the harmonics 6, 7, and 8 for example, equals 1.67 for the fundamental but 1.5 for the octave. These errors can be avoided by incorporating not only the highest harmonic number {H } assigned to a compo- nent, but also the lowest one {L }, and considering the maxi- mal number of harmonics to be found: M = H- L + 1. To indicate the difference with respect to the criterion of the DWS meter a "quality measure" Qi of a fit is introduced at

this point. 2 It is given in Eq. {3}. Now, that position which

1 .8 ci .2 o 50 I 2 3 4 56789.. I I ! I I I I i i i i II 'i i i i i i I I I I I i

!

II

.•o ,c;o 200 i I I !!111;111111 I I I I I I I I I I I

I Ill,

5•o ,ooo 2o•o 5ooo

f (Hz) 1OO 200 500

p

FIG. 3. The harmonics sieve procedure. For 80fo values between 50 and 500

Hz {p), the mesh frequencies of the sieve are matched to the frequencies of

components resolved from the spectrum of Fig. l(d). A quality measure of

the fit is calculated for each fo value. The normalized quality values

Qmax) are drawn in the left-hand panel. The fo candidate that obtains the highest Q value is considered to be the "best fit." Three settings of the sieve

(6)

obtains the highest Q value is considered as the best fit.

Q, = Pi /(M, + R, ). (3)

Figure 3 gives an illustration of the matching of the sieve to a set of component frequencies. The normalized Q values (Qi/Qmax ) are shown in the left-hand graph.

In most cases only one optimal fit of the sieve is found. When the signal is heavily disturbed, however, two or more fits of the sieve sometimes obtain the highest Q value. The lowest estimate is in these cases taken for the pitch. The other estimates, however, are also available in the output of this stage because pitch ambiguities can also occur in the

perception in such conditions. The model does not make an

estimate of the residue pitch when less than three compo- nents can be detected but produces the estimated frequencies of those components.

II. TESTING THE MODEL

A. Residue

pitch of inharmonic

signals

In order to test the performance of MDWS on signals comprising only a few harmonics, the pitch shift experiment by Schouten et al. (1962) was simulated on the computer. Stimuli in this experiment consisted of an AM signal with a modulation frequency fm of 200 Hz. Pitch matches were recorded as a function of the cartier frequencyfc which var- ied from 1200 to 2400 Hz. Because it is generally assumed that aural combination tones affect the pitch perceived in this type of stimuli and cause the so-called second effect of pitch shift (Goldstein, 1973), the first two odd-order combi-

nation tones 2f• -f2 and 3fl -- 2f2 were computed after the

component detection stage of the model and added to the set of resolved components. Results of the simulation are shown in Fig. 4 together with the regression lines from Schouten et al. (1962, Fig. 2). When two or more positions of the sieve receive the maximum Q value only the lowestfo estimate is normally produced by the model. Because we are in this case looking for pitch ambiguities, allfo estimates based on fits of

i ! 240 fmod: 200 Hz

220 /

200 180

160

, ,' ,

1.2 1.4 1.6 1.8 2.0 2.2 2.4 carrier frequency (kHz)

FIG. 4. Model estimates of the pitch of AM tones as a function of the carrier

frequency and pitch matchings by human observers. The lines in this figure are the regression lines from the data for listeners of Schouten et al. (1962, Fig. 2). The dots give the estimates of the best fitting fundamental by

MDWS.

the sieve that received the same (highest) Q measure are plot- ted.

It should be mentioned that other "high quality" fits were generally found for pitch values near the edges of the sawtooth pattern in this figure. Schouten et al. (1962) also reported dusters of pitch matchings in those regions. The results show that the errors that DWS would make for car- rier frequencies above 1400 Hz are avoided by the use of the new quality measure for a fit. The estimates by MDWS ap- pear to be in close agreement with the experimental data. The deviations from the averaged data obtained for humans are of the same order of magnitude as the differences between the individual results.

B. Analysis of the pitch of periodic signals in a noise background

To test the performance of MDWS in noisy conditions, pitches of stimuli from an experiment on the discrimination by human listeners of fundamental frequency of periodic sounds in noise (Scheffers, 1982) were analyzed by the mod- el. The stimulus set of this experiment contained synthesized vowel sounds and pulse trains with fundamental frequencies of 75, 150, and 300 Hz, and pure tones with frequencies of

150, 300, and 1000 Hz. The signals were masked by pink noise (3 dB/oct attenuation). For the verification of the mod- el the means and standard deviations of pitch estimates by MDWS were determined. They were calculated from pitch estimates of 20 different presentations of the same stimulus at a fixed S/N ratio. This was done separately for the three vowel sounds and the pulse train at five S/N ratios, viz. 20, 15, 10, 5, and 0 dB S/Nlo (the signal-to-noise ratio measured in a 10% band centered at the frequency where the highest 10%-band signal level was found, cf. Scheffers, 1982). The sd's will be compared with the just noticeable differences of

fundamental

frequency

(jndso)

for humans

that were

mea-

sured in the listening experiment. 3 A similar comparison will

be made between the frequency estimates of the pure tones

by the component detection stage and the jnds's as a func-

tion of S/N ratio obtained for listeners.

After preliminary tests it was decided to modify the model further to bring it more into line with human percep- tion. First, the accuracy with which human observers per- ceive characteristics such as the pitch and the loudness of a stationary sound increases with the duration of the signal (e.g., Cardozo, 1962; van den Brink, 1964; Henning, 1970). In the case of pitch this integration of information can be observed up to a duration of about 200 ms. The stationary part of the signals (which had a duration of 180 ms) was therefore divided in seven 40-ms frames, which overlapped by 20 ms. The "auditory" power spectra of these seven frames were computed and averaged. The component detec- tion was performed on this averaged spectrum.

Second, it became clear that the accuracy of the model in estimating the component frequencies and thus the resi-

due pitch was too good compared with that of human ob-

servers [see also Fig. 5(a)]. Goldstein (1973)has incorporated this accuracy in his theory by including the uncertainty in the conveyance of information on the component frequen- cies through stochastic channels. Similarly, an internal noise

(7)

factor was therefore added to the frequency values of detect- ed components in the present model. A random frequency value was drawn for each component from a Gaussian distri- bution with the estimated frequency of the component as mean and a standard deviation calculated from the sd (f) function of Eq. (4)(Goldstein, 1973, Fig. 7).

sd(f) = (f/10) ø'5 (4)

(sd and fin Hz).

Third, it is very probable that subjects used a priori knowledge on the pitch, because presentations started in the experiment at a high S/N ratio and the fundamentals were always in a small region during a run. Apart from the pitch estimation described in Sec. I D, the model therefore also made pitch estimates restricting the candidate fundamentals to regions from 10% below to 10% above the three stimulus fundamentals used viz. 75, 150, and 300 Hz. When analyzing signals of which the fundamental is not specifically known beforehand, the pitch estimate of the previous frame could, e.g., be used as the center frequency of such a "window." The width of the wind, ow could be chosen on the basis of the Q value for that estimate (as a measure of its reliability) such as is done in the DWS meter in a similar way. Therefore, the application of these preference regions will henceforth be re- ferred to as tracking. The value of 10% is based on two find- ings: (1) On the results of the listening experiments which indicated thatfo differences greater than about 5 % could be discriminated as soon as the signals were just detectable, and (2) on the fact that if two or more candidate fundamentals obtained the highest Q value, they always differed more than 12% from each other. The present value avoids the necessity of a decision what to do with such multiple estimates. It sets at the same time a theoretical upper limit of 6% (for a uni- form distribution of estimates) to the relative sd's to be found, which is in good agreement with the perceptual ef-

fects mentioned before. It is mentioned here that a value,

larger than 12% would in general yield a few more estimates to be found for the lower S?N ratios. This would result in a somewhat greater sd, but would also give rise to multiple estimates. I found in a pilot study that multiple estimates occurred in about 5% of the signals if a value of 15% was chosen and in 15% for a value of 20%.

The standard deviations of the pitch estimates of the pulse train are plotted in Fig. 5 (filled symbols, solid lines}

together

with the corresponding

jndœo'S

obtained

for human

listeners (open symbols, dashed lines}. The data points are connected with lines only if the average number of pitch estimates on the basis of which the sd was calculated was at least 15. A number near a data point indicates the number of estimates on which the sd is based if this number does not equal 20. Figure 5(a} shows the results for estimates without the internal noise factor and without tracking. Figure 5(b} gives the results in the condition where internal noise was added to the component frequencies but no tracking was applied. In Fig. 5{c} the results are plotted in the condition where internal noise was added and tracking was used. For Fig. 5(a} and (b} the pitch estimates for each S/N ratio were sorted out on "the same pitch" on the basis of a 10% toler-

ance of the value. 4 Thus, 75.3 and 77.9 for example were

considered as estimates of the same pitch, while e.g., 73.8 and 112.5 (a typical MDWS error} were considered as esti- mates of different pitches. The mean and the standard devi- ation of each subdivision of estimates obtained in this way were calculated. From the first results it appeared that the sd for the same set of stimuli could vary by a factor of about 1.5, depending on the actual values of the internal noise. This was partly due to the fact that the pitch estimates were not al- ways based on the same set of component frequencies be- cause of the internal noise. Each measurement was therefore repeated 20 times, each time with different samples of the internal noise for each detected component. The sd's plotted in Fig. 5{b} and {c} are the averages of these 20 measurements.

The results show that the sd's are far smaller than the

10x16 2 .2 .05 .02 a b c

i f q, 'pulse

', ' ß o: 75 Hz

train

• • '"• '• lSOHz

ß []: 300 Hz xx' , 15 15 ... listeners - -. model i S/N10 ratio (dB)

with internal noise -5 0 5 I I 20

\ \18 I •

xx x I xx,

\x i 'x

with tracking with internal noise

i i

-; •) ; 10 15 2'0

S/N10 ratio (dB)

FIG. 5. Comparison between jnd/o of pulse

trains in noise for listeners (dashed lines,

open symbols) and sd's of model predictions of pitch (solid lines, closed symbols). Panel (a) gives the model predictions without the

addition of internal noise to the frequencies

of resolved components and without the ap- plication of tracking. In panel (b)the average

sd's of pitch estimates are plotted in the con-

dition where internal noise is added to the

frequencies of resolved components but no tracking is used. Panel (c) gives the average sd's of pitch estimates when internal noise is added and tracking is applied. A number near a data point indicates on how many es-

(8)

jnd/o'S

f(•r

humans

when

no

internal

noise

factor

is added.

The sd's

come

close

to the

jnd/o'S

when

the uncertainty

in the

internal representation of the component frequencies is tak- en into account. The S/Nlo ratio at which MDWS can still make a reasonable amount of "correct" estimates lies then about 9 dB above the S/Nlo ratio where human listeners can only discriminate fo differences greater than 5 %. The use of tracking for the pitch estimates lowers this limit by about 6

dB.

Figure 6 shows the sd's of pitch estimates of the three vowel sounds in the condition when internal noise was added and tracking was applied. Each sd is again the average of 20

measurements.

The results show that the model cannot make pitch esti- mates for the vowel sounds at low S/Nlo ratios. At higher S/

Nlo ratios

a conformity

between

the sd's

and the

jnd•o'S

can

be seen for the vowel/a/with fundamental frequencies of 150 and 300 Hz and for the vowel/u/with fundamentals of

75 and 150 Hz. For the vowel/i/, however, the sd's differ an

order of magnitude from thejnd's. Even at the higher S/Nlo

ratios the fundamental of this vowel cannot always be esti- mated by the model.

In discussing the experimental results obtained with lis- teners (Scheffers, 1982) it was argued that for the vowel sounds in particular, there were two other cues besides resi- due pitch on the basis of which the subjects could discrimi- nate the differences in fundamental frequency. First, the pitch of a strong low-frequency harmonic could be domi- nant. Second, due to the fact that the sampling frequency at which the vowel sounds were generated was varied to obtain differences in fundamental frequency, the spectral envelope of the signal was stretched when the fundamental frequency was increased. This could result in a perceptible change of the vowel quality.

For the vowel/u/the pitch of the strong 300-Hz com- ponent appeared to be dominant. The change in this pitch was probably used by the subjects to discriminate the differ- ences in fundamental frequency below an S/N•o of about 20 dB. For the vowel/i/the pitch of a harmonic in the first formant {225 Hz) or the change in the spectral envelope in the region of the second to fourth formant (2280-3200 Hzl could be used as a clue. For the vowel/a/the pitch of a harmonic in the first formant {750 Hz) was probably used at

low S/N•o ratios. I therefore altered the model to produce the estimated frequencies of the detected components as well. These values were again sorted out on "estimate of the same component" with a 10% tolerance. The means and sd's of these subdivisions were calculated. This of course raises a problem for the high harmonics that differ less than 10% in frequency. This problem could only partly be solved, viz. when a number of successive harmonics was consistently detected in those cases.

A comparison between the sd's calculated from Eq. (4)

and of the jnd• of pure tones (e.g., Moore, 1973) reveals a

difference of an order of magnitude. This reflects the as- sumption that the component frequency analysis takes place at a lower level in the auditory system than the more central processing of pitch. The sd of the internal noise factor was therefore altered to that of Eq. (5) which is estimated on

jnd/'s for pure tones

measured

by Harris (1952),

Walliser

(1968), Moore (1973), and Wier et al. (1977) (for a review see Hoekstra, 1979).

sd(f) = 0.03 xf ø'6

(sd and fin Hz). ß

The sd's of the frequency estimates of the pure tones and of the most dominant component of the vowel sounds

are plotted in Fig. 7 together with the jnd•'s for humans. The

component that was detected most often at each S/N ratio was taken as the most dominant for a vowel sound. The pure tones were also analyzed at S/N ratios near subjective masked threshold in order to gain insight on the detection thresholds for the model.

If we consider the results for the pure tones first, we see that the model can detect the signal consistently down to a few dB above masked threshold (indicated on the top hori-

zontal

axis).

The sd's

follow

a curve

similar

to thejnd/s, but

are almost exactly a factor of 2 smaller.

Table I gives the number of harmonics, that are consis- tently detected by MDWS in each of the 20 stimuli used for determining the sd's for the vowel sounds. The table also gives the number of harmonics that are detected in at least 15 of the 20 stimuli and the frequency of the most dominant harmonicfa, viz. the harmonic that was detected most often down to the lowest S/Nlo ratios.

10x1C• = 1

,(•,,

vowel/U/

•:\ 14 11 ß o: 75 Hz *•,: 150 Hz ß [•: 300 Hz listeners model S/N10 ratio (dB) with tracking with internal noise

FIG. 6. Comparison between jnd•,, of vowel

sounds in noise for listeners (dashed lines,

open symbols) and the average sd's of pitch

estimates by MDWS (solid lines, closed symbols). Pitches are estimated in the condi-

tion where internal noise is added and track-

ing is applied. A number near a data point

indicates from how many estimates the sd is

calculated on average if not from 20.

(9)

10X1(• 2 5 o

'• 2

-a 1

•.5

.2 10X1(• 2

vowel

It]/

g ,•,

vowel

lil

? •

vowel/Cl/

i:[•

'•

11•,

',

16 ' ' ' ' o 2o -5 5 m.t. pure tone ', ', •=f. 150Hz 5/N10 ratio(dB) ' ,, ', m•:f- 3OOHz 11' •7• '• * •= f =1OOO Hz' i i i i i i 0 -5 0 5 10 15 2 S/N10 ratio (dB)

FIG. 7. Comparison between jnd/of pure

tones in noise for listeners (dashed lines, open symbols) and sd's of frequency esti- mates by MDWS (solid lines, closed sym-

bols) and between jnd/o of vowel sounds in

noise and sd's of frequency estimates by

MDWS of the most dominant harmonic of

the vowels. The vertical lines near the top horizontal axis of the graph for the pure tones indicate subjective masked thresholds (m.t.) of these signals. The number near a data point indicates on how many estimates

the sd is based if not 20.

The 300-Hz harmonic was found to be the most domi- nant for the vowel/u/. The sd's of the frequency estimates of this harmonic follow a curve, more or less parallel to the experimental results. There is a difference of about a factor of 2 between the sd's and the jnd's, as is also found for the pure tones.

A similar observation can be made for the 225-Hz har- monic of the vowel/i/with an fo of 75 Hz. The 300-Hz harmonic is the most dominant for the other two fundamen- tals. The sd's of the frequency estimates of this harmonic come close to the jnd's. The typical difference of a factor of 2 between the model predictions and the experimental results

cannot, however, be observed for these two fundamentals.

The 750-Hz harmonic was detected most often by the model for the vowel/o/with fundamental frequencies of 75 and 150 Hz. The 900-Hz harmonic appeared to be dominant for the 300-Hz fundamental. Again the sd's of the frequency estimates of these components differ by about a factor of 2 from the results obtained for listeners.

III. DISCUSSION

The MDWS model was developed from the DWS pitch meter. Two main modifications were proposed in this paper, viz. a better approximation of auditory frequency analysis by convolving a high-resolution FFT spectrum with stylized excitation patterns and a modification to the cri,terion used to decide on the "best fitting fundamental" of a set of re- solved components. This second modification was tested in a comparison between model estimates of the pitch of inhar- monic complexes comprising three components and results obtained for humans. The results of this test show that errors

induced by the absence of low harmonics are indeed re-

moved. The model estimates are comparable to human per- ception. It is noteworthy that this practical model gives the same estimates as were predicted by Goldstein (1973) on a more theoretical basis. A limited test on normal speech ut- terances showed only minor differences between predictions

by DWS and by MDWS.

TABLE I. Number of harmonics of the vowel sounds that are detected by MDWS in each of the 20 stimuli (above the oblique), or in at least 15 of the 20 stimuli (below the oblique) and the frequency fa of the most dominant harmonic.

vowel /u/ /i/ /a/

fo in Hz 75 150 300 75 150 300 75 150 300 20 dB S/Nio 2/3 2/4 2/2 1/2 2/5 2/5 2/2 5/5 5/5 15 dB S/N•o 1/3 2/3 1/2 1/1 2/3 2/3 2/2 2/4 4/4 10 dB S/N•o 1/1' 1/2 1/1 1/1 0/1 2/3 1/2 1/3 3/3 5 dB S/Nio 1/1 1/1 1/1 1/1 0/1 0/1 1/1 1/1 2/3 0 dB S/N•o 0/1 !/1 0/1 0/1 0/0 0/1 1/1 1/1 1/2 fa in Hz 300 300 300 225 300 300 750 750 900

(10)

The performance of the frequency detection stage of

MDWS was investigated

by a comparison

between

the jndf

of pure tones in noise for humans and the sd's of the frequen- cy estimates of these signals by the model. The results of this test show that MDWS can identify pure tones in noise al- most as far as subjective masked threshold. We consider the detection stage to be a good simulation of the auditory analy- sis at least for the signal durations that are used, particularly because in the listening experiment the noise onset preceded the signal onset by 300 ms, which facilitated the detection for the listener but was of no advantage for the model. The dif- ference of a factor of 2 between the sd's of the model esti-

mates

of the frequency

of the pure tones

and the jndf's for

humans are solely owing to the choice of the amount of inter-

nal noise.

In view

of the variance

ofjnd• measurements

(see

Hoekstra, 1979} one should not attribute too much impor- tance to this difference. As for the effects of internal masking {the masking of a signal component by other components} the reader is referred to the papers by Terhardt { 1979, 1982).

The results for the pitch estimates of the pulse trains and the vowels show that when the signal spectrum contains a number of harmonics which {a} are above the calculated masked threshold, {b} have successive harmonic numbers, and {c} lie within the existence region {Ritsma, 1962, 1963}, the model can successfully predict the residue pitch and the

sd's

of the model

estimates

come

close

to the jndfo'S

for lis-

teners. These conditions are met down to S?N•o ratios near

subjective

ma•ked

threshold

for pulse

trains.

Higher

S?N•o

ratios are required for vowel sounds to bring enough compo- nents above'masked threshold so that the residue pitch can be estimated, because of the peaked spectrum of these sig- nals. Residue pitch can therefore be estimated down to lower S?N•o ratios for a vowel with a rather fiat spectrum like the vowel/o/than for the vowels/u/and/i/which have a more peaked spectrum or a fiat part at frequencies outside the existence region.

One should note that even when the subjects in the ex- periment could perceive the residue pitch of a signal at a certain S?N•o ratio, this does not necessarily mean that they used this pitch to discriminate the presented difference in fundamental frequency. In fact, subjects often reported per- forming the discrimination task on the pitch of a prominent harmonic rather than on the residue pitch in particular for the vowel sounds at low S?Nlo ratios. The sd's of the esti- mates by MDWS of the frequencies of dominant harmonics show that the results for the listeners can indeed often be predicted on that basis.

The results indicate two points which may need further investigation. First, the rather strong dependence of the sd values on the actual value of the internal noise. It appeared that the internal noise could be so strong that a harmonic was wrongly rejected by the harmonics sieve. This might be caused by the "all-or-nothing" decision of the sieve•viz. the shape of the meshes•r by the width of the meshes {espe- cially for low frequencies}. In view of the sd values of the internal representation of component frequencies as calcu- lated from Eq. {5), one could argue that the meshes are too narrow for harmonics with frequencies below 250 Hz. Sec- ond, the results for the vowels with a fundamental frequency

of 75 Hz are poorer than were expected from the perceptual impressions. It is possible that the internal masking effects in the model are too strong. This may be caused by the use of a

convolution followed by the component detection stage, as

opposed to a detection stage followed by the incorporation of masking effects as is done in Terhardt's virtual pitch model {Terhardt, 1979; Terhardt et al., 1982}.'The convolution adds 10 to 20 Hz to the bandwidth of the excitation patterns which may result in too poor a resolving power for low fre- quencies. It is also possible that the level dependence of the high-frequency slope of the excitation patterns is too great.

IV. CONCLUSIONS

The modifications of the DWS pitch meter (Duifhuis et

al., 1979, 1982) incorporated in MDWS have led to a practi- cal model that can predict the pitch perceived by human listeners in normal speech signals equally as well as in typical psychoacoustical stimuli comprising only a few high har- monics.

The performance of MDWS on pulse trains that are partially masked by noise comes close to human perception. When tracking is applied, the model can successfully esti- mate the pitch of these signals down to about 3 dB above the S?N ratio where listeners apparently can just perceive resi- due pitch. Standard deviations of these pitch estimates are

comparable

with the

jndfo'S

for humans.

Pure

tones

in noise

can be identified down to about subjective masked threshold. The comparison between the sd's of estimates by MDWS of the residue pitch of vowel sounds in noise and the

jnd•o'S

of these

signals

for humans

appears

to be quite

com-

plex. Depending on the vowel sound, the model can predict the pitch of these signals down to 10 to 30 dB above masked

threshold.

It appeared

that

the

jnd•o'S

at lower'

S?N

ratios

could in many cases be predicted on the basis of frequency estimates of a prominent low-frequency harmonic. This agrees well with perceptual impressions reported by the lis-

teners

in the experiment

where

jndfo'S

were

measured.

The success MDWS has in discriminating between sig- nal components and noisy components leads us to believe that it will be a powerful tool in our study into the perceptual separation of simultaneous speech sounds.

ACKNOWLEDGMENTS

I would like to thank H. Duifhuis, S. G. Nooteboom,

and R. D. Patterson for their stimulating support and for their assistance in the preparation of this paper. This re- search was supported by the Netherlands Organization for the Advancement of Pure Research (Z.W.O.) through the Netherlands Psychonomics Foundation, grant number 15-

31-011.

lActually, Ci -- (Hi + Ri)/Pi is used in the DWS meter. This reduced form is mathematically simpler, but functionally equal to that of Eq. (1). The latter version is considered to be more transparent because it gives a direct

reference to the fit parameters.

2During the preparation of this paper, a similar solution was published by Sluyter, Kotmans, and Claasen [Proc. ICASSP 82, 188-191 (1982)]. 3The sd's should in fact be compared with x/• times the jndœ. In view of the

(11)

fact that the amount of internal noise that had to be added to the frequen- cies of resolved components to bring the model predictions into line with human perception is only a first approximation, this factor is neglected in the comparison.

4I realize that this is also a form of tracking. It was, however, necessary' in

order to separate the two or more "optimal fits" that can occur for the same

stimulus {see I D}. It also suppresses the enormous increase of the sd as a result of a single octave error. The value of 10% was chosen for reasons given at the end of the previous paragraph. The frequency with which the estimates were compared was in this case not fixed but was gradually built up during each series of measurements by adjusting it to the average of the estimates that were previously accepted.

Brokx, J.P. L., and Nooteboom, S. G. (1982}. "Intonation and the percep-

tual separation of simultaneous voices," J. Phonetics 10, 23-26. Cardozo, B. L, (1962). "Frequency discrimination ofthe human ear," Proc.

Fourth Int. Congress on Acoustics, 11-14.

Cherry, E. C. (1953). "Some experiments on the recognition of speech with

one and two ears," J. Acoust. Soc. Am. 25, 975-979.

Duifhuis, H., Willems, L. F., and Sluyter, R. J. (1979). "Pitch in speech: a hearing theory approach," in Speech Communication Papers 97th ASA

Meeting, edited by J. J. Wolf and D. H. Klatt (Acoustical Society of America, New York), pp. 245-248; J. Acoust. Soc. Am. Suppl. 1 65, S67 11979}.

Duifhuis, H., Willems, L. F., and Sluyter, R. J. (1982). "Measurement of pitch in speech: An implementation of Goldstein's theory ofpitch percep-

tion," J. Acoust. Soc. Am. 71, 1568-1580.

Gerson, A., and Goldstein, J. L. 11978•. "Evidence for a general template in central optimal processing for the pitch of complex tones," J. Acoust.

Soc. Am. 63, 498-510.

Goldstein, J. L. 11973•. "An optimal processor theory for the central forma-

tion of pitch of complex tones," J. Acoust. Soc. Am. 54, 1496-1516. Goldstein, J. L., Baer, T., and Kiang, N.Y.S. (1971). "A theoretical treat-

ment of latency, group delay and tuning characteristics for auditory- nerve responses to clicks and tones," in Physiology of the Auditory System,

edited by M. B. Sachs INational Educational Consultants, Baltimore), pp.

133-141.

Harris, F. J. {1978}. "On the use of windows for harmonic analysis with

Discrete Fourier Transform," Proc. IEEE 66, 51-83.

Harris, J. D. (1952). "Pitch discrimination," J. Acoust. Soc. Am. 24, 750-

755.

Henning, G. B. (1970}. "A comparison of the effects of signal duration on

frequency and amplitude discrimination," in Frequency analysis and peri-

odicity detection in hearing, edited by R. Plomp and G. F. Smoorenburg (Sijthoff, Leiden}, pp. 350-361.

Hoekstra, A. (1979}. "Frequency discrimination and frequency analysis in

hearing, a psyehophysieal study ofsome aspects ofthe normal and abnor-

mal auditory system," Doctoral dissertation, University of Groningen. Moore, B.C. J. (1973}. "Frequency difference limen of short-duration

tones," J. Acoust. Soc. Am. 54, 610-619.

Patterson, R. D. (1976}. "Auditory filter shapes derived with noise stimuli,"

J. Aeoust. Soc. Am. 59, 640-654.

Patterson, R. D., and Nimmo-Smith, I. (1980). "Oil-frequency listening and auditory filter asymmetry," J. Acoust. Soc. Am. 67, 229-245.

Pick, G. F. (1980}. "Level dependence of psyehophysieal frequency resolu-

tion and auditory fil{er shape," J. Acoust. Soc. Am. 68, 1085-1095.

Plomp, R. (1964}. "The ear as a frequency analyzer," J. Aeoust. Soe. Am.

36, 1628-1636.

Ritsma, R. J. (1962}. "Existence region of the tonal residue I," J. Aeoust.

Soc. Am. 34, 1224-1229.

Ritsma, R. J. (1963}. "Existence region of the tonal residue II," J. Aeoust.

Soc. Am. 35, 1241-1245.

Seheffers, M. T. M. (1982}. "Discrimination of fundamental frequency of synthesized vowel sounds in a noise background," submitted for publica-

tion.

Schouten, J. F., Ritsma, R. J., and Lopes Cardozo, B. (1962}. "Pitch of the

residue," J. Acoust. Soc. Am. 34, 1418-1424.

Terhardt, E. (1974}. "Pitch, consonance, and harmony," J. Aeoust. Soe.

Am. 55, 1061-1069.

Terhardt, E. (1979). "Calculating virtual pitch," Hearing Res. 1, 155-182.

Terhardt, E. (1980}. "Toward understanding pitch perception: problems,

concepts and solutions," in Psychophysical, physiological and behaoioral

studies in hearing, edited by G. van den Brink and F. A. Bilsen (Delft

U.P., Delft}, pp. 353-360.

Terhardt, E., Stoll, G., and Seewann, M. (1982}. "Algorithm for extraction of pitch and pitch salience from complex tonal signals," J. Acoust. Soe.

Am. 71, 679-688. ,

van den Brink, G. {1964}. "Detection of tone pulse of various durations in

noise of various bandwidths," J. Acoust. Soc. Am. 36, 1206-1211.

Walliser, K. {1968}. "Zusammenwirken yon Hiillkurvenperiode und Ton- heit bei der Bildung der Periodentonh6he," Doctoral dissertation, Uni- versity of Miinchen.

Wier, C. C., Jesteadt, W., and Green, D. M. {1977}. "Frequency discrimina- tion as a function of frequency and sensation level," J. Acoust. Soc. Am.

64, 178-184.

Wightman, F. L. {1973}. "The pattern-transformation model of pitch," J.

Acoust. Soc. Am. 54, 407-416.

Zwicker, E. C., and Feldtkeller, R. {1967}. "Das Ohr als Nachrichtenemp- fdnger {Hirzel, Stuttgart}, 2nd ed.

Referenties

GERELATEERDE DOCUMENTEN

The tweet fields we will use as features for the Bayes model are: timezone, user location, tweet language, utc offset and geoparsed user location.. When a field is empty we ignore

Het is wel van belang voor de praktijk, omdat het model wordt gebruikt bij diverse beleidsevaluaties.

geeft een auteur om een bepaalde reden het materiaal (nog) niet tot op het niveau van sub species een naam, maar slechts tot op het niveau van soort, terwijl het materiaal niet

We bepalen de optimale toepassing van substraten voor iedere gewenste teelt en bieden onze klanten en partners een platform voor de uitwisseling van kennis en ervaring bij het

Compared to older same sex drivers, both male and female young drivers in Europe report more frequently to engage in various traffic infringements, a preference for higher

Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of

La fouille de la maison de fief d' Azy illustre la modestie des feudataires unis aux families seigneuriales successives du bail1'4ge de Chassepierre et le déclin

de nieuwe drieslag reduceert Nederlands tot een vaardighedenvak. 2) Dan (2) de kennis: waar is de vakinhoud bij Nederlands, met name de taalwetenschappelijke en