Full metadata record

DC FieldValueLanguage
dc.contributor.author박정식ko
dc.contributor.authorPark, Kyung-Mi[Park, Kyung-Mi]ko
dc.contributor.authorBae, Jae-Hyun[Bae, Jae-Hyun]ko
dc.contributor.authorOh, Yung-Hwan[Oh, Yung-Hwan]ko
dc.date.accessioned2015-12-17T02:02:22Z-
dc.date.available2015-12-17T02:02:22Z-
dc.date.created2015-11-13-
dc.date.issued201212-
dc.identifier.citationINTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, v.26, no.8-
dc.identifier.issn0218-0014-
dc.identifier.urihttp://hdl.handle.net/YU.REPOSITORY/26828-
dc.identifier.urihttp://dx.doi.org/10.1142/S0218001412600117-
dc.description.abstractSpeaker diarization detects speaker change points in spoken data and organizes speaker clusters so that each cluster contains one speaker's segments. This study aims to develop online speaker diarization for multimedia data retrieval on mobile devices. Researchers have proposed various methods of diarization, but most approaches thus far depend on an empirically determined threshold as a criterion or work in an offline manner that requires prior knowledge, such as the overall number of speakers. There are therefore clear drawbacks with mobile devices, on which various types of spoken data are frequently played and replaced. A new approach to online speaker segmentation and clustering is proposed for overcoming these drawbacks. The proposed segmentation method considers the temporal locality of an analysis window, assuming that each window contains only a small number of speakers. In accordance with this property, a local universal background model (UBM) is constructed in a window and the model is used to detect speaker change points. A cluster boundary-based dynamic decision criterion is proposed for speaker clustering. This approach estimates the internal characteristics of clusters and uses them to determine cluster boundaries. In experiments using a broadcast news corpus, our techniques exhibited superior performance compared to conventional approaches.-
dc.language영어-
dc.publisherWORLD SCIENTIFIC PUBL CO PTE LTD-
dc.subjectGAUSSIAN MIXTURE-MODELS-
dc.subjectSEGMENTATION-
dc.titleONLINE SPEAKER DIARIZATION FOR MULTIMEDIA DATA RETRIEVAL ON MOBILE DEVICES-
dc.typeArticle-
dc.identifier.wosid000315523100009-
dc.identifier.scopusid2-s2.0-84874397543-
Appears in Collections:
공과대학 > 모바일정보통신공학과 > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE