Full metadata record

DC FieldValueLanguage
dc.contributor.author박정식ko
dc.contributor.author박경미[박경미]ko
dc.contributor.author오영환[오영환]ko
dc.date.accessioned2015-12-17T00:02:38Z-
dc.date.available2015-12-17T00:02:38Z-
dc.date.created2015-11-13-
dc.date.issued201005-
dc.identifier.citationIEEE TRANSACTIONS ON CONSUMER ELECTRONICS, v.56, no.2, pp.1123 - 1129-
dc.identifier.issn0098-3063-
dc.identifier.urihttp://hdl.handle.net/YU.REPOSITORY/22500-
dc.identifier.urihttp://dx.doi.org/10.1109/TCE.2010.5506048-
dc.description.abstractThis paper proposes an online speaker segmentation approach based on Gaussian Mixture Model (GMM) adaptation for spoken document retrieval. In the conventional approach using the Bayesian Information Criterion (BIC), two single Gaussian models are respectively constructed for two divided speech streams in an analysis window, and the dissimilarity between the two models is estimated according to the BIC principle. This approach has been widely applied to speaker segmentation. However, its performance may deteriorate when speakers change frequently, since the single Gaussian model hardly represent the speaker's explicit characteristics for short speech data. To overcome this limitation, we propose an approach to use adapted GMMs instead of single Gaussian models. The method proposed herein constructs a local UBM for speech in an analysis window and adapts the local UBM to each of two divided speech streams in the same window. Upon the two adapted GMMs obtained from the adaptation, the likelihood of the respective speech stream is estimated and change of speaker is determined according to our criterion based on local maxima of BIC. On speaker segmentation experiments based on HUB4, a well-known broadcast news corpus, the proposed method exhibited superior performance compared to the conventional approaches(1).-
dc.language영어-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.subjectGAUSSIAN MIXTURE-MODELS-
dc.subjectAUDIO CONTENT-ANALYSIS-
dc.titleGMM Adaptation based Online Speaker Segmentation for Spoken Document Retrieval-
dc.typeArticle-
dc.identifier.wosid000281747600121-
dc.identifier.scopusid2-s2.0-77954754555-
Appears in Collections:
공과대학 > 모바일정보통신공학과 > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE