• No results found

Frequency-domain Criterion for Frequency-domain Criterion for Speech Distortion Weighted Speech Distortion Weighted Multi-Channel Wiener Filtering Multi-Channel Wiener Filtering

N/A
N/A
Protected

Academic year: 2021

Share "Frequency-domain Criterion for Frequency-domain Criterion for Speech Distortion Weighted Speech Distortion Weighted Multi-Channel Wiener Filtering Multi-Channel Wiener Filtering"

Copied!
17
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Frequency-domain Criterion for Frequency-domain Criterion for

Speech Distortion Weighted Speech Distortion Weighted Multi-Channel Wiener Filtering Multi-Channel Wiener Filtering

Simon Doclo1, Ann Spriet1,2, Marc Moonen1, Jan Wouters2

1Dept. of Electrical Engineering (ESAT-SCD), KU Leuven, Belgium

2Laboratory for Exp. ORL, KU Leuven, Belgium

HSCMA-2005, 17.03.2005

(2)

Overview Overview

• Adaptive beamforming: GSC

o Not robust against signal model errors

• Spatially-preprocessed SDW-MWF:

o Increase robustness of adaptive stage by taking speech distortion into account

o Implementation: stochastic gradient algorithms o Frequency-domain criterion

o Experimental validation in hearing instruments

• Audio demonstration

• Conclusions

(3)

3

Hearing instruments Hearing instruments

• Hearing problems effect more than 10% of population

Digital hearing instruments allow for advanced signal processing, resulting in improved speech understanding

Major problem: (directional) hearing in background noise

o reduction of noise wrt useful speech signal o multiple microphones + DSP in BTE

o current systems: simple fixed and adaptive beamforming o robustness important due to small inter-microphone distance

hearing aids and cochlear implants

design of robust multi-microphone noise reduction scheme

Introduction

Adaptive beamforming

Experimental validation

Audio demo

Conclusions

(4)

GGSC = Adaptive MVDR-beamformerSC = Adaptive MVDR-beamformer

Avoids speech distortion

Relies on assumptions

known mic characteristics, known speaker position, no reverberation

Speech distortion !

distorted speech + noise

speech

+ noise

Filter w1 Filter w2

+

- -

Spatial pre-processor

(Fixed beamforming) Adaptive stage

(Adaptive Noise Canceller)

speech + noise speech reference

noise

noise references

Violated in practice

G

speech leakage+

Minimises output noise power

0 2

[ ] min[ ] [ ] [ ] [ ] output noise power

T

k k v k    k k

w w w v



Introduction

Adaptive beamforming -GSC

-SP-SDW-MWF -Implementation

Experimental validation

Audio demo

Conclusions

(5)

5

Robustness against model errors Robustness against model errors

• Spatial pre-processor and adaptive stage rely on

assumptions that are generally not satisfied in practice:

o Distortion of speech component in speech reference o Leakage of speech into noise references, i.e.

]

0[k x 0 x[k]

Speech component in output signal gets distorted ]

[ ] [ ]

[ ]

[k x0 k k k

zx wT x

• Design of robust noise reduction algorithm:

1. Reduce speech leakage contributions in noise references:

Robust fixed spatial filter [Nordebo 94, Doclo 03]

Adaptive blocking matrix [Van Compernolle 90, Hoshuyama 99, Herbordt 01]

Estimate relative acoustic transfer functions [Gannot 01]

2. Reduce effect of present speech leakage:

Only update adaptive filter during low-SNR periods/frequencies

Quadratic inequality constraint, leaky LMS [Cox 87, Claesson 92, Tian 01]

Take speech distortion explicitly into account, SDW-MWF [Spriet 04]

Introduction

Adaptive beamforming -GSC

-SP-SDW-MWF -Implementation

Experimental validation

Audio demo

Conclusions

(6)

Design of robust adaptive stage Design of robust adaptive stage

• Distorted speech in output signal:

Robustness: limit by controlling adaptive filter

] [ ] [ ]

[ ]

[k x0 k k k

zx wT x ]

[ ] [k k

T x

w w[k]

o Quadratic inequality constraint (QIC-GSC):

= conservative approach, constraint  f(amount of leakage)

]

[k w

o Take speech distortion into account in optimisation criterion (SDW-MWF)

1/ trades off noise reduction and speech distortion (1/ = 0  GSC, 1/ = 1  MMSE estimate)

Regularisation term ~ amount of speech leakage

 

0 2

 

2

]

[ 1 [ ] [ ]

] [ ] [ ]

[

min E v k T k k E T k k

k w v w x

w

noise reduction speech distortion

Limit speech distortion, while not affecting noise reduction performance in case of no model errors  QIC

Introduction

Adaptive beamforming -GSC

-SP-SDW-MWF -Implementation

Experimental validation

Audio demo

Conclusions

(7)

8

Implementation Implementation

Algorithms:

o Recursive matrix-based (GSVD, QRD) – too expensive

o Stochastic gradient algorithms (time vs. frequency domain)

Stochastic gradient algorithm (time-domain):

o Cost function

results in LMS-based updating formula

 

0 2

1

[ ] [ ]

2

] [ ] [ ]

[ )

( E v k k k E k k

J w wT v wT x

0

[ 1] [ ] [ ] 1

[ ] [ ]

[ ] T[ ] [ ] T [ ]

k k k v k k k k k k

    

w w v v w x x w

regularisation term Classical GSC

o Practical computation of regularisation term using data buffers o Reduce complexity by frequency-domain implementation [Spriet 04]

 Still large memory requirement due to data buffers

o Memory reduction by approximating FD regularisation term [Doclo 04]

Introduction

Adaptive beamforming -GSC

-SP-SDW-MWF -Implementation

Experimental validation

Audio demo

Conclusions

(8)

Frequency-domain criterion (1) Frequency-domain criterion (1)

• Extension of block-based frequency-domain criterion for multi-channel AEC [Benesty 01, Buchner 03]

• Set derivative wrt time-domain filter coefficients w to zero

 normal equations in FD

0 0

[ ] (1 ) m m i H[ ] [ ] 1 (1 ) [ ] [ ]

f v v

m m i H

x x x x

i

v v

i

J m i i i i

 e ee e

[ ] [ ], [ ] 0[ ] [ ] [ ]

[ ] [ ], [ ] [ ] [ ]

T

v L v v

T

x L x x

m m e k v k k k

m m e k k k

  

e F e w v

e F e w x

,2

1

, 10

2

1 [ ]

1 [ ]

[ ] [ 1] (1 ) [ ]

[ ] [ ] [ ]

v

H v

H

x x L

v L

x m

m m

m m m

m

m

  

Q

D e

w w G Q

D e

Recursive algorithm (details cf. book “Speech Enhancement”)

Introduction

Adaptive beamforming -GSC

-SP-SDW-MWF -Implementation

Experimental validation

Audio demo

Conclusions

(9)

10

Frequency-domain criterion (2) Frequency-domain criterion (2)

• Practical calculation of regularisation term  averaging [ ]m 1 x[m] [m 1], x[m] y[m] v[m]

r Q w Q Q Q

• Approximations for reducing the computational complexity:

o Approximate and by block-diagonal (or diagonal) correlation matrices :

 (block-)diagonal matrices can be easily inverted

 Ensure that is positive-definite:

eigenvalues of (block-)diagonal matrix can be easily computed

v[ ]m

Q Qy[ ]m

[ ] [ 1] (1 ) H[ ] [ ]/ 2

y m y m   y m y m

Q Q D D

[ ] [ ] [ ]

x m y m v m

Q Q Q

o Constrained vs. unconstrained update :

 corresponds to setting derivate wrt frequency-domain filter coefficients to zero

10

2NL / 2

G I w

,2

[ ]m w[m 1] (1)Λ[ ]m DHv [ ]m ev L[ ]m r[ ]m w

Introduction

Adaptive beamforming -GSC

-SP-SDW-MWF -Implementation

Experimental validation

Audio demo

Conclusions

(10)

Experimental results Experimental results

Configuration

3-mic BTE on dummy head (d = 1cm, 1.5cm)

Speech source in front of dummy head (0)

5 speech-like noise sources: 75,120,180,240,285

Gain mismatch = 4dB at 2nd microphone

Noise 1 Reverberation time

= 500 msec

H.A.

Noise 3

Noise 3 Noise 4

Noise 5

mic 1 mic 2

mic 3

2

Introduction

Adaptive beamforming

Experimental validation -Performance -Complexity

Audio demo

Conclusions

(11)

12

• Improvement in speech intelligibility

Performan

Performance measuresce measures

speech

noise

f

SNRi - [dB]

intellig

SNR iSNRi

i

I

Importance of i-th band for speech intelligibility [dB]

• Speech distortion

[dB]

f

SDi - [dB]

input speech

output speech

intellig

SD iSDi

i

I

Introduction

Adaptive beamforming

Experimental validation -Performance -Complexity

Audio demo

Conclusions

(12)

Experimental validation (1) Experimental validation (1)

SDR-GSC (unconstrained update)

o Results after convergence (L=32, =0.5, =0.995, BD/D stepsize) o GSC (1/ = 0) : degraded performance if significant leakage

o 1/ > 0 increases robustness (speech distortion  noise reduction)

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

4 5 6 7

1/

SNR [dB]

SDR-GSC (N=2), unconstrained update, = 0.50, = 0.9950

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 5 10 15

1/

SD [dB]

Algo 2 (U-BD), no mismatch Algo 4 (U-D1), no mismatch Algo 2 (U-BD), mismatch Algo 4 (U-D1), mismatch

GSC

Introduction

Adaptive beamforming

Experimental validation -Performance -Complexity

Audio demo

Conclusions

(13)

14

Experimental validation (2) Experimental validation (2)

Convergence behaviour:

o Convergence speed: block-diagonal step size > diagonal step size o large   fast convergence

o large   slow convergence, better performance upon convergence

2 4 6 8

2 3 4 5 6 7

= 0.99500

= 0.10

Algo 2 (U-BD) Algo 4 (U-D1)

2 4 6 8

2 3 4 5 6

7 = 0.50

Algo 2 (U-BD) Algo 4 (U-D1)

2 4 6 8

2 3 4 5 6 7

= 0.99875

Algo 2 (U-BD) Algo 4 (U-D1)

SNR (dB), SDR-GSC (N=2), L = 32, 1/ = 0.5, no mismatch

2 4 6 8

2 3 4 5 6 7

Algo 2 (U-BD) Algo 4 (U-D1)

Introduction

Adaptive beamforming

Experimental validation -Performance -Complexity

Audio demo

Conclusions

(14)

Complexity + memory Complexity + memory

Parameters: M = 3 (mics), N = 2 (a), N = 3 (b), L = 32, fs = 16kHz, Ly = 10000

Computational complexity:

Memory requirement:

Algorithm Complexity (MAC) MIPS

QIC-GSC (FD) (3M-1)FFT + 16M - 9 2.16

SDW-MWF (FD-buffer) (3N+5)FFT + 30N + 10 3.22(a), 4.27(b) SDW-MWF (FD-matrix-diag) (3N+2)FFT + 8N2 + 13N 2.46(a), 3.89(b) SDW-MWF (FD-matrix-BD) (3N+2)FFT + 14N2 + 10N + 12 2.94 (N=2 !)

Algorithm Memory kWords

QIC-GSC (FD) 4(M-1)L + 6L 0.45

SDW-MWF (FD-buffer) 2NLy + 6LN + 7L 40.61 (a), 60.80 (b) SDW-MWF (FD-matrix-all) 4LN2 + 6LN + 7L 1.12 (a), 1.95 (b)

Complexity and memory comparable to QIC-GSC

Introduction

Adaptive beamforming

Experimental validation -Performance -Complexity

Audio demo

Conclusions

(15)

16

Algorithm No deviations Deviation (4dB) Noisy microphone signal

Speech reference Noise reference

Output GSC (1/ = 0)

Output SDR-GSC (1/ = 0.5)

Audio demonstration Audio demonstration

Introduction

Adaptive beamforming

Experimental validation

Audio demo

Conclusions

(L=32, =10, =0.99875, block-diagonal stepsize, unconstrained update)

(16)

Spatially pre-processed SDW-MWF:

o Take speech distortion explicitly into account  improve robustness of adaptive stage

o Encompasses GSC and MWF as special cases

Implementation:

o Stochastic gradient algorithms in time- and frequency-domain o Frequency-domain criterion: block-based processing  natural

derivation of different adaptive algorithms

o Block-diagonal vs. diagonal, constrained vs. unconstrained o Comparable implementation cost as QIC-GSC

Experimental results:

o SP-SDW-MWF achieves better noise reduction than QIC-GSC, for a given maximum speech distortion level

o Faster convergence speed for block-diagonal step size matrix

Conclusions Conclusions

Introduction

Adaptive beamforming

Experimental validation

Audio demo

Conclusions

Referenties

GERELATEERDE DOCUMENTEN

It was previously proven that a binaural noise reduction procedure based on the Speech Distortion Weighted Multi-channel Wiener Filter (SDW-MWF) indeed preserves the speech

While the standard Wiener filter assigns equal importance to both terms, a generalised version of the Wiener filter, the so-called speech-distortion weighted Wiener filter (SDW-WF)

In this paper, a multi-channel noise reduction algorithm is presented based on a Speech Distortion Weighted Multi-channel Wiener Filter (SDW-MWF) approach that incorporates a

In this paper, a multi-channel noise reduction algorithm is presented based on a Speech Distortion Weighted Multi-channel Wiener Filter (SDW-MWF) approach that incorporates a

In this paper we establish a generalized noise reduction scheme, called the Spatially Pre-processed Speech Distortion Weighted Multi-channel Wiener filter (SP-SDW-MWF), that

Wouters, “Stochastic gra- dient based implementation of spatially pre-processed speech distortion weighted multi-channel Wiener filtering for noise reduction in hearing aids,”

In this paper we establish a generalized noise reduction scheme, called the Spatially Pre-processed Speech Distortion Weighted Multi-channel Wiener Filter (SP-SDW-MWF), that

This paper presents a variable Speech Distortion Weighted Multichannel Wiener Filter (SDW-MWF) based on soft out- put Voice Activity Detection (VAD) which is used for noise reduction