한국해양대학교

Detailed Information

Metadata Downloads

히스토그램을 이용한 무성자음과 잡음의 경계 추출

Title
히스토그램을 이용한 무성자음과 잡음의 경계 추출
Alternative Title
Detection of Boundaries between Unvoiced Consonants and Noise using Histogram
Author(s)
朴正任
Issued Date
2001
Publisher
한국해양대학교 대학원
URI
http://kmou.dcollection.net/jsp/common/DcLoOrgPer.jsp?sItemId=000002174104
http://repository.kmou.ac.kr/handle/2014.oak/10846
Abstract
Voice activity detection(VAD), which separates the voice region from silence or noise region of input speech signal, is one of the indispensable pre-processing steps in continuous speech recognition, speech coding and noise estimation/reduction etc. While many successful researches were conducted continuous speech in noiseless environment or for isolated words in noisy environment, there are few method of VAD for continuous speech in heavy noise environment. Since unvoiced consonant signals have very similar characteristics to those of noise signals, it may result in serious distortion of unvoiced consonants to estimate and remove the noise components if voice activity detection and thereafter noise estimation/removal is carried out without paying special attention on unvoiced consonants.


In this dissertation, assuming that the voiced sound regions are removed by a method developed in our lab, we propose a method to explicitly extract the boundaries between unvoiced consonant region and noise region so that more exact VAD could be performed. The proposed method is based on histogram in frequency domain which was successfully used by Hirsch for noise estimation, and also on similarity measure of frequency components between adjacent frames. To evaluate the performance of the proposed method, experiments on unvoiced consonant boundary detection was carried out on noisy speech signals of 10dB and 15dB SNR. For all seven kinds of noised, the overall rate of correct extraction resulted in approximately 90%. The proposed algorithm could be used for VAD for speech recognition and speech coding as well as for noise estimation and reduction in heavy noise environments.
Appears in Collections:
컴퓨터공학과 > Thesis
Files in This Item:
000002174104.pdf Download

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse