▍1. Speech-Signal-Processing
语音信号处理c语言程序-音高、加窗、MFCC、PLP、等(C programming language speech signal processing- pitch, add window, MFCC, PLP, etc)
语音信号处理c语言程序-音高、加窗、MFCC、PLP、等(C programming language speech signal processing- pitch, add window, MFCC, PLP, etc)
语音识别开源代码 sphinx 0.8 windows版本(sphinx 0.8,ASR)
互联网著名国产软件小沁语音信使源代码。使用TAPI接口及语音Modem,实现语音电话自动拨打,语音录制及播放,广泛适用于企事业广告营销。(The use of TAPI interface and voice Modem, realize voice telephone automatic dial, voice recording and playback, widely applicable to the business of advertising and marketing. )
该文件为基于DTW的孤立词识别系统(特定人、小词汇量,未采用鲁棒语音识别技术)(The document DTW for isolated word recognition system based on (a specific person, small vocabulary, did not use robust speech recognition technology))
模式识别k聚类算法源代码,c语言内容描述。(Pattern Recognition Chapter )
将wav格式文件转化为txt格式的程序,用于读入语音文件转换TXT格式,便于程序分析(Wav format files into txt format program)
将wav格式文件转化为txt格式的程序,用于读入语音文件转换TXT格式,便于程序分析(Wav format files into txt format program)
基于MFCC的利用LPCC做的说话人识别论文(Based on the MFCC the use of LPCC do the speaker identification papers)
语音通信控件源码点对点专用版简介VSession语音通信控件源码点对点专用版,版本2.0,此版本集成了G729A压缩算法,实时传输协议,话音清晰流畅!使用简单易懂方便!此控件源码是在本人以前发布的控件件源码VSessionn2.0版本的基础上,加入点对点通信时的呼叫,应答,挂断等通话前后的同步功能,使其用于点对点通信更加方便!如果您想要在程序源码中自定义点对点间的通话联系方式,或者有一对多,多对多的 (Voice communications control source peer-to-peer special edition Introduction VSession voice communication control source peer-to-peer special edition, version 2.0, this version integration the G729A compression algorithm, Real-time Transport Protocol, clear and smooth voice! Easy to understand and easy to use! This control source is in the I control previously released the parts source VSessionn2.0 version based on peer-to-peer communication call, answer, hang up, etc. the synchronization function before and after the call to make it more convenient for point-to-point communication ! If you want to program source from the definition of point-to-point calls between Contact, or one-to-many, many-to-many)
科大讯飞关于语音识别开发SDK库,支持语音转汉字功能及根据读汉字功能。有兴趣的可以好好研究研究(IFLYTEK speech recognition development SDK library are interested in a good studies)
语音合成程序源码!psalo频域基音同步叠加方法。它首先对原始语音信号进行短时频域变换,的到短时谱与短时谱包络。短时谱除以短时谱包络的到声源短时谱,对声源短时谱的实部与与虚部分别进行线性插值,就能达到改变变语音信号基频的目的,然后再进行频域反变换,可的到变换后的短时语音信号。短时谱包络部分也能独立改变,以达到改变音色的目的。 (Voice synthesis program source! psalo frequency domain pitch synchronous superposition method. It was first carried out on the original speech signal a short-time frequency-domain transform, to the short-time spectrum and short-time spectrum envelope. The short-time spectrum divided by the short-time spectrum envelope of the short time spectrum of the sound source, the short time spectrum of the real part of the sound source, and the imaginary parts of the linear interpolation, can achieve the purpose of changing the fundamental frequency of the alternating speech signal, and then then the inverse transform of the frequency domain, can be to the short-time speech signal after conversion. The short-time spectral envelope section can be varied independently, in order to achieve the purpose of changing the tone.)
这 里主要对LMS算法及一些改进的LMS算法(NLMS算法、变步长LMS算法、变换域LMS算法)之间的不同点进行了比较,,在传统的LMS算法的基础上发 展了LMS算法的应用。另一方面又从RLS算法的分析析中对其与LMS算法的不同特性进行了比较。 (Here the main difference between the LMS algorithm and improved LMS algorithm (NLMS algorithm, variable step size LMS algorithm, the transform domain LMS algorithm) comparison, the traditional LMS algorithm based on the development of the application of the LMS algorithm . On the other hand and from its different characteristics of the LMS algorithm of the analytical analysis of the RLS algorithm.)
这是自己毕业设计做的PCA和LDA的结合,训练和识别别过程都有,用的是ORL库里的图像,具有较高的识别率! (This is a combination of PCA and LDA graduation design, training and do the process of the identification with ORL library of images, has a high recognition rate!)
文中研究了6种常用数字调制信号识别的特征参数集,并采用决策树判别方法进行分类识别。仿真结果表明,在SNR≥5dB时,识别正确率在99 以上,且当SNR≥20dB时,识别正确率达到100 。其特点是,算法简单,识别正确率高,达到了自动分类识别的目的,并有利于实现识别的实时化。(In this paper, we study the set of characteristic parameters of the six kinds of commonly used digital modulation signal recognition, and decision tree method for classification. The simulation results show that SNR 鈮� 5dB, the correct rate more than 99 , and when SNR 鈮� 20dB, the correct rate of 100 . Which is characterized by simple algorithm to identify the correct rate, to achieve the purpose of automatic classification and recognition, and help to identify real-time.)
语音信号处理的lbg算法法在语音信号处理中非常重要 (Lbg of voice signal processing algorithms in speech signal processing is very important)
ITU P.563 语音质量评价,能只送入失真信信号,而不需要参考信号,便能评价失真信号的MOS分 (ITU P.563 voice quality assessment can only be sent to the distortion of the letter signals without the need for a reference signal will be able to evaluate distortion signal MOS score)
基于vc++开发,dtw语音识别,MFCC参数(DTW recognice based on vc++)
是一个语音识别的程序,有LPCC,MFCC等的代码,最后用SVM进行分类(Is a voice recognition program, there LPCC, MFCC and other code, the final classification with SVM)
LD3320 是一颗基于非特定人语音识别(SI-ASR:Speaker-Independent Automatic Speech Recognition)技术的语音识别/声控芯片。提供了真正的单 芯片语音识别解决方案。 LD3320 芯片上集成了高精度的A/D 和D/A 接口,不再需要外接辅助的 Flash 和RAM,即可以实现语音识别/声控/人机对话功能。并且,识别的关键词 语列表是可以动态编辑的。(Voice Recognition ;Speech Recognition ;Speech recognition technology )
语音识别,实现语音的识别功能,,特定人的语音识别。识别0~9的发音(Speech recognition, voice recognition function, a particular person' s speech recognition. Identify the pronunciation of 0 to 9)