한국해양대학교

KMOU Repository 한국해양대학교 대학원 컴퓨터공학과 Thesis

Detailed Information

Metadata Downloads

히스토그램을 이용한 무성자음과 잡음의 경계 추출

Title: 히스토그램을 이용한 무성자음과 잡음의 경계 추출

Alternative Title: Detection of Boundaries between Unvoiced Consonants and Noise using Histogram

Author(s): 朴正任

Issued Date: 2001

Publisher: 한국해양대학교 대학원

URI: http://kmou.dcollection.net/jsp/common/DcLoOrgPer.jsp?sItemId=000002174104
http://repository.kmou.ac.kr/handle/2014.oak/10846

Abstract: Voice activity detection(VAD), which separates the voice region from silence or noise region of input speech signal, is one of the indispensable pre-processing steps in continuous speech recognition, speech coding and noise estimation/reduction etc. While many successful researches were conducted continuous speech in noiseless environment or for isolated words in noisy environment, there are few method of VAD for continuous speech in heavy noise environment. Since unvoiced consonant signals have very similar characteristics to those of noise signals, it may result in serious distortion of unvoiced consonants to estimate and remove the noise components if voice activity detection and thereafter noise estimation/removal is carried out without paying special attention on unvoiced consonants.

In this dissertation, assuming that the voiced sound regions are removed by a method developed in our lab, we propose a method to explicitly extract the boundaries between unvoiced consonant region and noise region so that more exact VAD could be performed. The proposed method is based on histogram in frequency domain which was successfully used by Hirsch for noise estimation, and also on similarity measure of frequency components between adjacent frames. To evaluate the performance of the proposed method, experiments on unvoiced consonant boundary detection was carried out on noisy speech signals of 10dB and 15dB SNR. For all seven kinds of noised, the overall rate of correct extraction resulted in approximately 90%. The proposed algorithm could be used for VAD for speech recognition and speech coding as well as for noise estimation and reduction in heavy noise environments.

Appears in Collections:: 컴퓨터공학과 > Thesis

Files in This Item:: 000002174104.pdf Download

메타데이터 전체 보기

qrcode

트윗하기

OAK

ywm85@kmou.ac.kr Tel: 051-410-4085

KMOU Repository는 국립중앙도서관 OAK Repository 보급사업으로 구축되었습니다.

한국해양대학교

Detailed Information

Browse