한국해양대학교

Login Library

KMOU Repository 한국해양대학교 대학원 제어계측공학과 Thesis

Detailed Information

Metadata Downloads

상대 엔트로피를 이용한 음성 특징벡터의 변별적 변환에 관한 연구

DC Field	Value	Language
dc.contributor.author	유강주	-
dc.date.accessioned	2017-02-22T06:17:42Z	-
dc.date.available	2017-02-22T06:17:42Z	-
dc.date.issued	2009	-
dc.date.submitted	56905-02-20	-
dc.identifier.uri	http://kmou.dcollection.net/jsp/common/DcLoOrgPer.jsp?sItemId=000002175129	ko_KR
dc.identifier.uri	http://repository.kmou.ac.kr/handle/2014.oak/9288	-
dc.description.abstract	Generally, the recognition rate of an automatic speech recognition (ASR) system depends largely on the discriminability of the feature vectors representing the input speech signal. To improve the recognition rate, it is therefore, desirable to increase the discriminating power of the feature vectors. In this thesis, we propose a linear transformation of the feature vector which aims to augment the recognition rate of the ASR by increasing the discriminating power of the feature vectors. By making use of the relative entropy of each phoneme (the unit of recognition), the proposed method tries to shorten the distances between within-class feature vectors, while lengthening the inter-class distances of the feature vectors. The method is based on the observation that as the relative entropy between two classes of feature vectors becomes larger, the dissimilarity increases, and so does the discriminating power between the classes. The proposed transformation matrix of the feature vector is derived as follows: Firstly, the objective function is defined as a function of the divergence which is the average of relative entropy between classes. Then, the objective function is maximized to give the optimal linear transformation matrix by an iterative learning algorithm, the natural gradient ascent method. To examine the effect on the discriminating power of the proposed method, two sets of experiments are performed using the TIMIT corpus: a simple phoneme classification experiment using Euclidian distance measure and a recognition experiment by an ASR system. The results are compared with those of the well known methods, such as PCA, LDA and Li’s method and shown at least 0.28% of improvement.	-
dc.description.tableofcontents	제 1 장 서론 = 1 1.1 연구의 배경 = 1 1.2 연구 방법 및 구성 = 5 제 2 장 음성 인식과정 = 6 2.1 음성 인식을 위한 특징벡터 = 7 2.1.1 선형 예측 계수 = 8 2.1.2 MFCC = 10 2.1.3 미분 계수 = 14 2.2 은닉 마르코프 모델을 이용한 음성인식 = 14 2.2.1 마르코프 프로세서 = 15 2.2.2 은닉 마르코프 모델 = 17 제 3 장 특징벡터의 변별적 변환 = 25 3.1 특징벡터의 변별적 변환을 이용한 음성 인식과정 = 25 3.2 주요 성분분석 = 26 3.3 선형 판별 분석 = 28 3.4 Li의 방법 = 31 제 4 장 상대 엔트로피에 기반한 특징벡터의 변별적 변환 = 35 4.1 상대 엔트로피 = 35 4.2 상대 엔트로피를 이용한 특징벡터의 변별적 변환 = 37 제 5 장 실험 및 고찰 = 44 5.1 음성 데이터 = 44 5.2 특징벡터의 변별적 변환 및 클러스터링 실험 = 46 5.3 음소 단위의 인식 실험 = 61 제 6 장 결론 = 72 참고문헌 = 75	-
dc.language	kor	-
dc.publisher	한국해양대학교 대학원	-
dc.title	상대 엔트로피를 이용한 음성 특징벡터의 변별적 변환에 관한 연구	-
dc.title.alternative	A Study on Discriminative Transformation of Speech Feature Vector based on Relative Entropy	-
dc.type	Thesis	-
dc.date.awarded	2009-02	-
dc.contributor.alternativeName	Yu	-
dc.contributor.alternativeName	Gang-Ju	-

Appears in Collections:: 제어계측공학과 > Thesis

Files in This Item:: 000002175129.pdf Download

메타데이터 간략히 보기

qrcode

트윗하기

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

OAK

ywm85@kmou.ac.kr Tel: 051-410-4085

KMOU Repository는 국립중앙도서관 OAK Repository 보급사업으로 구축되었습니다.

Browse

Library LOGIN