Binaural multi-channel Wiener filtering for hearing aids:

(1)

Binaural multi-channel Wiener filtering for hearing aids:

Preserving interaural time and level differences

1,2 T.J. Klasen, ¹ Simon Doclo, ^1,2 Tim Van den Bogaert, ¹ Marc Moonen, ² Jan Wouters

1 KU Leuven ESAT, Kasteelpark Arenberg 10, Leuven ² KU Leuven Lab. ORL, Kapucijnenvoer 33, Leuven tklasen@esat.kuleuven.be

Introduction

•Hearing impaired persons localize sounds better without their bilateral hearing aids than with them.

•Current hearing aids are not designed to preserve localization cues

•Advantages of preserving localization cues – Visual cues ⇒ Improvement in intelligibility – Spatial separation ⇒ Improvement in intelligibility

Interaural localization cues

•Interaural time difference (ITD)

– ITD is difference in arrival of signal between ears – ITD cues reside in low frequencies < 1500Hz

•interaural level difference (ILD) – ILD is intensity difference between ears – ILD cues reside in high frequencies > 3000Hz

State of the art

•Binaural Wiener filter ⇒ Preserves speech ITD cues

•Controlled binaural Wiener filter ⇒ Preserves noise ITD cues at cost of noise reduction

•Extended cost function includes ITD and ILD terms ⇒ Iter- ative optimization techniques

System model

Speaker

Hearing aid user

Noise θ φ

YL0(ω) · · · YL_M−1(ω) YR0(ω) · · · YR_M−1(ω)

ZR1(ω) ZL0(ω)

WL(ω) WR(ω)

•Signals received at the mth microphone pair Y L

m

(ω) = X L

m

(ω)

| {z } Speech

+ V L

m

(ω)

| {z } N oise

Y R

m

(ω) = X R

m

(ω)

| {z } Speech

+ V R

m

(ω)

| {z } N oise

•2M-dimensional signal vector Y(ω) =

Y L

0

(ω) . . . Y L

_M−1

(ω)Y R

0

(ω) . . . Y R

_M−1

(ω) T Y (ω) = X(ω) + V(ω)

•Left and Right2M-dimensional filters W (ω) =

W _L (ω) W _R (ω)

=

"

W L

0

(ω) . . . W L

2M−1

(ω) T

W R

0

(ω) . . . W R

2M−1

(ω) T

#

Interaural transfer function (ITF)

•Input and Output ITFs (speech and noise) IT F X

des

= X L

0

X R

0

IT F V

out

(W) = W ^H _L V W ^H

R V

•Desired ITFs of the speech and noise components – In function of the desired angles θ X and θ V , and fre-

quency, ω

IT F X

des

= HRT F X

L

(ω, θ X ) HRT F X

L

(ω, θ X ) – As original ITFs

IT F X

des

= E n X L

0

X _R ^∗

₀

o E n

X R

0

X _R ^∗

₀

o IT F V

des

= E n V L

0

V _R ^∗

₀

o E n

V R

0

V _R ^∗

₀

o

•Preserve binaural cues ⇒ original ITFs as desired ITFs

Binaural Wiener filtering

•Original cost function J(W) = E



 

 

 

 

X L

0

− W ^H _L X X R

0

− W ^H _R X

2 | {z }

Speech Distortion + µ

W ^H _L V W ^H

R V

2 | {z }

Residual N oise



 

 

 

 

•Goal: Output speech and noise parallel to desired ITFs

R I

IT F^Vdes 1

W^HLV WH

RV

k to IT F^Vdes

1

WL^HV WR^HV

⊥ to IT FVdes 1

WL^HV WR^HV

•Add ITF terms to cost function minimize perpendicular part J(W) = E

(

X L

0

− W ^H _L X X R

0

− W ^H _R X

2 + µ

W ^H L V W ^H _R V

2 | {z }

Original SDW Cost F unction +

α

W ^H L X W ^H

R X

⊥ 2

+ β

W ^H L V W ^H

R V

⊥ 2

| {z }

Additional IT F T erms )

•Rewrite using definition of the cross product J (W) = E

(

X L

0

− W ^H _L X X R

0

− W ^H _R X

2 + µ

W ^H L V W ^H

R V

2 +

α W ^H

L X − IT F X

des

W ^H R X

2 IT F X

des

1 2 + β

W ^H

L V − IT F V

des

W ^H R V

2 IT F V

des

1

2 ) .

•Take derivative of J(W), set to zero, and solve for W W =

E

R _R

X

+ µR R

V

+ αR R

XC

+ βR R

V C

−1 E

r _X

where, r _X =

"

X _L ^∗

₀

X X ^∗ _R

₀

X

# R _X = XX ^H R _V = VV ^H

R _R

X

=

R _X 0 _2M 0 _2M R _X

R _R

V

=

R _V 0 _2M 0 _2M R _V

R _R

XC

=

R _X −IT F _X ^∗

des

R _X

−IT F X

des

R _X |IT F X

des

| ² R _X

R _R

V C

=

R _V −IT F _V ^∗

des

R _V

−IT F V

des

R _V |IT F V

des

| ² R _V

Simulations

Setup

• T ₆₀ = 0.76 sec, f s = 16 kHz, and FFT size = 256

•HINT speech at 345 degrees and HINT noise at 60 degrees

•Input SNR Left 2.8dB Right -6.8dB

•GN ReSound Canta behind the ear hearing aids on CORTEX MK2 artificial head

•Varied α and β from 0 to 100 with µ = 1 Performance measures

•ITD Error (N bins < 1500Hz) 1

N X N

i=1

1 − cos

6 E n X L

0

X R ^∗

0

o

− ⁶ E n

W ^H _L X (W ^H R X ) ^∗ o

•ILD Error (All N bins) 1 N

X N

i=1

10 log ₁₀ P L

in

(ω i )

P R

in

(ω i ) − 10 log ₁₀ P L

out

(ω i ) P R

out

(ω i )

•Improvement in speech intelligibility weighted signal-to- noise-ratio (SNR INT )

SNR _INT = X J

j=1 w j SNR j

Results

0 50 100 0

50 100 0

0.1 0.2 0.3 0.4 0.5

alpha ITD Error Speech Component

beta

ITD Error

0 50 100 0

50 100 0

0.1 0.2 0.3 0.4 0.5

alpha ITD Error Noise Component

beta

ITD Error

0 50 100 0

50 100 0

2 4 6 8 10 12

alpha ILD Error Speech Component

beta

ILD Error (dB)

0 50 100 0

50 100 0

2 4 6 8 10 12

alpha ILD Error Noise Component

beta

ILD Error (dB)

0 50 100 0

50 100 9

10 11 12 13 14

alpha Output Intelligibility Weighted SNR Left Microphone

beta

Intelligibility Weighted SNR (dB)

0 50 100 0

50 100 9

10 11 12 13 14

alpha Output Intelligibility Weighted SNR Right Microphone

beta

Intelligibility Weighted SNR (dB)

Conclusions

• Extended binaural Wiener filter two ITF terms

• Weights control emphasis speech and noise ITFs and noise reduction

• Preserve speech and noise ITFs ⇒ ITD and ILD cues and

improvement in signal-to-noise ratio.

Binaural multi-channel Wiener filtering for hearing aids: