Single-Microphone Speech Dereverberation: Modulation Domain Processing and Quality Assessment

dc.contributor.authorZheng, Chenxien
dc.contributor.departmentElectrical and Computer Engineeringen
dc.contributor.supervisorChan, Wai-Yip Geoffreyen
dc.date2011-07-20 03:18:30.021
dc.date.accessioned2011-07-25T18:27:09Z
dc.date.available2011-07-25T18:27:09Z
dc.date.issued2011-07-25
dc.degree.grantorQueen's University at Kingstonen
dc.descriptionThesis (Master, Electrical & Computer Engineering) -- Queen's University, 2011-07-20 03:18:30.021en
dc.description.abstractIn a reverberant enclosure, acoustic speech signals are degraded by reflections from walls, ceilings, and objects. Restoring speech quality and intelligibility from reverberated speech has received increasing interest over the past few years. Although multiple channel dereverberation methods provide some improvements in speech quality/ intelligibility, single-channel dereverberation remains an open challenge. Two types of advanced single-channel dereverberation methods, namely acoustic domain spectral subtraction and modulation domain filtering, provide small improvement in speech quality and intelligibility. In this thesis, we study single-channel dereverberation algorithms. Firstly, an upper bound of time-frequency masking (TFM) performance for dereverberation is obtained using ideal time-frequency masking (ITFM). ITFM has access to both the clean and reverberated speech signals in estimating the binary-mask matrix. ITFM implements binary masking in the short time Fourier transform (STFT) domain, preserving only those spectral components less corrupted by reverberation. The experiment results show that single-channel ITFM outperforms four existing multi-channel dereverberation methods and suggest that large potential improvements could be obtained using TFM for speech dereverberation. Secondly, a novel modulation domain spectral subtraction method is proposed for dereverberation. This method estimates modulation domain long reverberation spectral variance (LRSV) from time domain LRSV using a statistical room impulse response (RIR) model and implements spectral subtraction in the modulation domain. On one hand, different from acoustic domain spectral subtraction, our method implements spectral subtraction in the modulation domain, which has been shown to play an important role in speech perception. On the other hand, different from modulation domain filtering which uses a time-invariant filter, our method takes the changes of reverberated speech spectral variance along time into account and implements spectral subtraction adaptively. Objective and informal subjective tests show that our proposed method outperforms two existing state-of-the-art single-channel dereverberation algorithms.en
dc.description.degreeM.A.Sc.en
dc.identifier.urihttp://hdl.handle.net/1974/6609
dc.language.isoengen
dc.relation.ispartofseriesCanadian thesesen
dc.subjectSpeech enhancementen
dc.subjectDereverberationen
dc.subjectSpeech signal processingen
dc.titleSingle-Microphone Speech Dereverberation: Modulation Domain Processing and Quality Assessmenten
dc.typethesisen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Zheng_Chenxi_201107_Master_Of_Applied_Science.pdf
Size:
679.99 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.64 KB
Format:
Item-specific license agreed upon to submission
Description: