Speaker Identification with Whispered Speech for the Access Control System

NCUIR > College of Electrical Engineering & Computer Science > Department of Computer Science and information Engineering > journal & Dissertation > Item 987654321/107288

Please use this identifier to cite or link to this item: https://ir.lib.ncu.edu.tw/handle/987654321/107288

Title:	Speaker Identification with Whispered Speech for the Access Control System
Authors:	王家慶;Wang, Jia-Ching;Chin, Yu-Hao;Hsieh, Wen-Chi;Lin, Chang-Hong;Chen, Ying-Ren;Siahaan, Ernestasia
Contributors:	資訊電機學院資訊工程學系
Keywords:	Access control;Approximation;Artificial intelligence;Chains;Density;Empirical mode decomposition;Empirical mode decomposition (EMD);Frequencies;Hilbert-Huang transform;instantaneous frequency;Kernels;Mathematical models;Probability;Speaker recognition;Speakers;Speech;Support vector machines;Transforms;Voice recognition;whispered speech
Date:	2015-10-01
Issue Date:	2026-04-23 14:06:10 (UTC+8)
Publisher:	Institute of Electrical and Electronics Engineers Inc.;New York: IEEE
Abstract:	摘要： This work presents an access control system, which is a speaker identification system based on whispered speech. Speaker identification is a main function of an access control system. Hence, a novel speaker identification system using instantaneous frequencies is proposed. The input speech signals pass through both signal independent and signal dependent filters firstly. Then, we derive the signal's instantaneous frequencies by applying the Hilbert transform. The analyzed instantaneous frequencies are proceeded to be modeled as probability density models. We use these probability density models as the feature in the proposed speaker identification system. In this work, we compare the use of parametric and nonparametric probability density estimation for instantaneous frequency modeling. Furthermore, we propose an approximated probability product kernel support vector machine (APPKSVM). In the APPKSVM, Riemann sum is applied in approximating the probability product kernel. The whisper sounds from the CHAIN speech corpus were used in the experiments. Results of the experiments show the superiority of the proposed speaker identification system. 其他題名： TASE 出版者： New York: IEEE 出版日期： 2015-10-01 出處： IEEE transactions on automation science and engineering, 2015-10, Vol.12 (4), p.1191-1199 資源來源： IEEE Xplore 版權： Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Oct 2015 識別號： ISSN: 1545-5955 識別號： EISSN: 1558-3783 識別號： DOI: 10.1109/TASE.2015.2467311 識別號： CODEN: ITASC7
Appears in Collections:	[Department of Computer Science and information Engineering] journal & Dissertation

Files in This Item:

File	Description	Size	Format
index.html		0Kb	HTML	13	View/Open

社群 sharing

Loading...