• No results found

1 Binaural hearing aids

N/A
N/A
Protected

Academic year: 2021

Share "1 Binaural hearing aids"

Copied!
1
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Extension of the multi-channel Wiener filter with localization cues for noise

reduction in binaural hearing aids

S. Doclo, R. Dong, T. Klasen, J. Wouters, S. Haykin, M. Moonen

Katholieke Universiteit Leuven, Belgium - Dept. Elec. Engineering (SCD), Lab. Exp. ORL McMaster University, Canada - Adaptive Systems Laboratory

1 Binaural hearing aids

• Hearing impairment → reduction of speech intelligibility in background noise signal processing to selectively enhance/extract useful speech signal

many hearing impaired are fitted with a hearing aid at both ears

• Binaural auditory cues:

Interaural Time Difference (ITD) - Interaural Level Difference (ILD)

binaural cues, in addition to spectral and temporal cues, play an important role in binaural noise reduction and sound localization

• Bilateral system: independent processing → binaural cues are not preserved

• Binaural system: cooperation between left and right hearing aid Objectives/requirements for binaural processing:

1. SNR improvement: noise reduction, limit speech distortion

2. Preservation of binaural cues to exploit binaural hearing advantage 3. No assumptions about position of speech source and microphones

2 Overview of binaural noise reduction techniques

• Fixed beamforming: spatial selectivity + preservation of binaural speech cues maximize directivity index while restricting ITD error [Desloge, 1997]

superdirective beamformer using HRTFs [Lotter, 2004]

⊖ broadside array, limited performance, assumptions about geometry

• Adaptive beamforming: based on Generalized Sidelobe Canceller structure

divide frequency spectrum: low-pass portion unaltered to preserve binaural cues (ITD), high-pass portion processed using GSC [Welker, 1997]

⊖ low-pass: no noise reduction, high-pass: no preservation of binaural cues TF-GSC: minimize output energy, constraint: speech component in output

signal = speech component in reference mic signal (both ears) [Gannot, 2001]

⊖ binaural noise cues may be distorted

• Multi-channel Wiener filter (MWF) [Doclo, Spriet, Klasen, Wouters, Moonen]

MMSE estimate of speech component in reference mic signal at both ears → binaural speech cues are preserved, binaural noise cues may be distorted

Extension of MWF: preservation of binaural speech and noise cues without significantly compromising noise reduction performance

3 Binaural multi-channel Wiener filter

Z 1 (ω) Z 0 (ω)

W 0 (ω)

Y 1 ,0 (ω)

Y 1 ,1 (ω)

W 1 (ω)

Y 0 ,M

0

− 1 (ω) Y 1 ,M

1

− 1 (ω) Y 0 ,0 (ω)

Y 0 ,1 (ω)

• Configuration: microphone array at left and right hearing aid Y 0,m (ω) = X 0,m (ω) + V 0,m (ω), m = 0 . . . M 0 − 1

Y (ω) = h Y 0,0 (ω) . . . Y 0,M 0 −1 (ω) Y 1,0 (ω) . . . Y 1,M 1 −1 (ω) i T

• Cooperation between hearing aids: use all available microphone signals to

generate output signal at both ears → computation of filters W 0 (ω) and W 1 (ω) Z 0 (ω) = W 0 H (ω)Y(ω), Z 1 (ω) = W 1 H (ω)Y(ω), W (ω) =

"

W 0 (ω) W 1 (ω)

#

• SDW-MWF: estimate speech component in reference microphone signal at both ears; additional trade-off between noise reduction and speech distortion

J SDW,0 = E n |X 0,r 0 − W 0 H X | 2 o + µ 0 E n |W H 0 V | 2 o

J SDW (W) = J SDW,0 + J SDW,1 = P + W H RW − W H r − r H W ⇒ W SDW = R −1 r

P = P 0 + P 1 , r =

"

r x0 r x1

#

, R =

"

R x + µ 0 R v 0 M

0 M R x + µ 1 R v

#

, R x = R y − R v

• Binaural speech cues are generally preserved, noise cues may be distorted

4 Preservation of binaural noise cues

4.1 Partial estimation of noise component [Klasen, 2005]

• MMSE estimate of sum of speech component and scaled noise component J ¯ M SE,0 (W 0 ) = E n |(X 0,r 0 + λ 0 V 0,r 0 ) − W 0 H Y | 2 o

⊖ considerable reduction of noise reduction performance 4.2 Extension of SDW-MWF with binaural cues

• Add term related to ITD and ILD cue of noise component to SDW cost function J tot (W) = J SDW (W) + β |IT D out (W) − IT D des | 2

| {z }

J IT D (W)

+γ |ILD out (W) − ILD des | 2

| {z }

J ILD (W)

→ link computation of filters W 0 and W 1

• ITD: phase of cross-correlation between two signals

output: E{Z v0 Z v1 } = W 0 H R v W 1 → input: s = E{V 0,r 0 V 1,r 1 } = R v (r 0 , r 1 )

Cost function: cosine of phase difference φ(W) between cross-correlations J IT D (W) = 1 − cos  φ(W)  = 1 − s R (W 0 H R v W 1 ) R + s I (W 0 H R v W 1 ) I

q s 2 R + s 2 I q (W H 0 R v W 1 ) 2 R + (W 0 H R v W 1 ) 2 I

• ILD: power ratio of two signals output: E{|Z E{|Z v0 | 2 }

v1 | 2 } = W W 0 H H R v W 0

1 R v W 1 → input: P = E{|V E{|V 0,r0 | 2 }

1,r1 | 2 } = R R v (r 0 ,r 0 )

v (r 1 ,r 1 )

J ILD (W) =

"

W H

0 R v W 0 W H

1 R v W 1 − P

# 2

• Other possibility: specify desired angle θ v and use HRTFS:

s(ω) = HRTF 0 (ω, θ v ) HRTF 1 (ω, θ v ), P = |HRTF 0 (ω, θ v )| 2 /|HRTF 1 (ω, θ v )| 2

• Estimate noise cross-correlation/power during noise-dominated segments

• No closed-form expression for solution → iterative optimization techniques

• For a single noise source, controlling ITD and ILD corresponds to controlling Interaural Transfer Function (ITF) → interpretation for multiple noise sources ?

5 Experimental results

• Binaural recordings on KEMAR, 2 microphones at each ear (d = 1 cm)

• Speech source in front (0 ), multi-talker babble noise source at 45

• SNR = 0 dB, T 60 = 125 ms, f s = 16 kHz, FFT-size N = 256, µ 0 = µ 1 = 1

• Performance measures: SNR improvement (left/right), mean ITD and ILD cost function (speech/noise component)

• Partial estimation of noise component: ITD and ILD cost function of speech and noise components decrease, SNR improvement is significantly degraded

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

0 2 4 6 8 10 12

λ

∆SNR [dB]

Left ear Right ear

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

−4

−3.5

−3

−2.5

−2

−1.5

−1

−0.5 0

λ J ITD [dB]

Noise component Speech component

• Extension with ITD/ILD cost function: ITD and ILD cost function of noise component decrease, SNR improvement is practically not compromised

0.5 0 1.5 1

2

0 0.2 0.4 0.6 0.8

1 8.5

9 9.5 10 10.5

β

SNR improvement right ear

γ

∆SNR [dB]

0.5 0 1.5 1

2

0 0.2 0.4 0.6 0.8

1

−2.5

−2

−1.5

−1

−0.5 0

β

ITD cost function − noise component

γ J ITD [dB]

• Future work: better perceptual performance measures, tuning of frequency-

dependent weight factors β and γ, multiple noise sources

Referenties

GERELATEERDE DOCUMENTEN

It was previously proven that a binaural noise reduction procedure based on the Speech Distortion Weighted Multi-channel Wiener Filter (SDW-MWF) indeed preserves the speech

Klasen, 1 Simon Doclo, 1,2 Tim Van den Bogaert, 1 Marc Moonen, 2 Jan Wouters. 1 KU Leuven ESAT, Kasteelpark Arenberg 10, Leuven 2 KU

o Multi-channel Wiener filter (but also e.g. Transfer Function GSC) speech cues are preserved noise cues may be distorted. • Preservation of binaural

o Independent processing of left and right hearing aid o Localisation cues are distorted. RMS error per loudspeaker when accumulating all responses of the different test conditions

– Binaural cues, in addition to spectral and temporal cues, play an important role in binaural noise reduction and sound localization. (important to preserve

BINAURAL MULTI-CHANNEL WIENER FILTERING The multi-channel Wiener filter (MWF) produces a minimum mean- square error (MMSE) estimate of the speech component in one of the

Suboptimal techniques either using the front contralateral micro- phone signal or the output of a monaural MWF are presented, to- gether with an iterative distributed MWF scheme

(2001a) to predict the spectral integration rates for subcritical bandwidths for binaural conditions with IA or with a reduction in masker correlation suggests that the processing