Speaker-dependent Isolated-Word Speech Recognition System Based on Vector Quantization

Publications

Share / Export Citation / Email / Print / Text size:

International Journal of Advanced Network, Monitoring and Controls

Xi'an Technological University

Subject: Computer Science , Software Engineering

GET ALERTS

eISSN: 2470-8038

DESCRIPTION

15
Reader(s)
29
Visit(s)
0
Comment(s)
0
Share(s)

SEARCH WITHIN CONTENT

FIND ARTICLE

Volume / Issue / page

Related articles

VOLUME 2 , ISSUE 3 (September 2017) > List of articles

Speaker-dependent Isolated-Word Speech Recognition System Based on Vector Quantization

Yinyin Zhao / Lei Zhu

Keywords : Speech recognition, LBG, MFCC, Vector Quantization

Citation Information : International Journal of Advanced Network, Monitoring and Controls. Volume 2, Issue 3, Pages 93-97, DOI: https://doi.org/10.1109/iccnea.2017.103

License : (CC BY-NC-ND 4.0)

Published Online: 12-April-2018

ARTICLE

ABSTRACT

Speaker-dependent speech recognition system requires the system should not only recognize speech, but also recognize the speaker of the segment. In this paper, two indicators are selected—short-time average zero-crossing rate and dual-threshold endpoint to test the signal endpoint through the study of speaker-dependent isolated-word speech characteristics, and MFCC parameters are taken as the characteristic parameters; based on vector quantization, template matching algorithms are designed, and one is adopted to improve LBG algorithm to increase the computing speed; speaker-dependent isolated-word speech recognition system is designed based on vector quantization technique and simulation experiments are conducted in the MATLAB platform under various backgrounds, which proves the system has better recognition effect.

Content not available PDF Share

FIGURES & TABLES

REFERENCES

Y. Wang , F. Tang ,J Zheng . Robust Text-independent Speaker Identification in a Time-varying Noisy Environment[J].Journal of Software,2012, 7(9):1975-1980.

 

Satyanand Singh, E.G Rajan. MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors[J]. International Journal of Computer Applications. 2011(6) :1-5.

 

Zhang Y, Long H, Shen S, et al. A novel codebook design with the LBG algorithm in precoding systems under spatial correlated channel[C]. Communications Circuits and Systems (ICCCAS), 2010 International Conference on . 2010:770-775.

 

Ashkan Parsi, Ali Ghanbari Sorkhi, Morteza Zahedi. Improving the unsupervised LBG clustering algorithm performance in image segmentation using principal component analysis[J].Signal, Image and Video Processing, 2016, 10 (2):301-309.

 

Chen Chen Huang, Wei Gong, Wen Long Fu, Dong Yu Feng.A Research of Speaker Recognition Based on VQ and MFCC[J]. Applied Mechanics and Materials, 2014, 3468 (644):4325-4329.

 

M Sahidullah, G Saha. A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition[J]. IEEE Sinal Processing Letters, 2013, 20(2):149-152.

 

Woo Yong, ChoiHwa Jeon, SongHoon Chung.I-vector Based Utterance Verification for Large-Vocabulary Speech Recognition System[C]. 2016 First IEEE International Conference on Computer Communication and the Internet,2016:334-337.

 

Lin CY, Prangjarote P, Yeh CH, et al. Reversible joint fingerprinting and decryption based on side match vector quantization[J]. Signal Processing, 2014, 98(1):52-61.

 

Jian BS.,Robust Multiple Antennas Cooperative Spectrum Sharing Design With Random Vector Qquantization[J]. 2014, 62(4): 486-492.

 

A Chaudhari,A Rahulkar, SB Dhonde. Combining dynamic features with MFCC for text-independent speaker identification[J]. International Conference on Information Processing,2016:160-164.

 

EXTRA FILES

COMMENTS