▍1. cochleagram.tar
一个语音分析程序,包含gammatone滤波器组的分解和重组。对学习语音分析的很有帮助(A speech analysis program contains the gammatone filter group decomposition and reorganization. Helpful learning speech analysis)
一个语音分析程序,包含gammatone滤波器组的分解和重组。对学习语音分析的很有帮助(A speech analysis program contains the gammatone filter group decomposition and reorganization. Helpful learning speech analysis)
这是基于隐马尔可夫模型的连续语音识别代码,不同于dtw,这是个完整的工程,我把需要的voicebox也放进去了,一共用到9个子函数,包括模板的训练算法何识别算法,是完全可以使用的。(This is based on the hidden Markov model of continuous voice recognition code, different the dtw, which is a complete project, I need to put the voicebox into a total of nine sub-functions, including the template training algorithm to identify the algorithm, Is fully available.)
语音信号处理第三版 赵力等编著 配套程序(Codes of speech signal processing 3rd.)
做语音识别时用到DTW 算法。便于初学者的学习。(DTW algorithm is used for speech recognition. Easy for beginners to learn.)
优化小波阈值,对硬阈值和软阈值的改进,通常用于语音信号增强,去噪(Optimization of wavelet threshold, the hard threshold and soft threshold improvement, commonly used in speech signal enhancement and denoise)
matlab程序,功能是在语音中提取基音频率的程序。输入为.wav格式的语音文件,输出各帧基音频率。(matlab procedures, functions in the voice pitch frequency extraction procedure. Input. Wav format audio files, the output of the frame pitch.)
基音提取并换算成音高,matlab程序,加入了优化的功能使得结果更好(Pitch extraction and converted into pitch, matlab program, joined the optimization of the function of making the results better)
这是一个语音处理工具箱,提供了语音处理的基本函数,包括有分帧、能量计算、零交叉计算、多种方法的音调提取、共振峰提取。。。等等。(This is a speech processing toolbox provides the basic function of voice processing, including one of those who frame, energy calculation, zero cross-terms, various methods of pitch extraction, formant extraction. . . And so on.)
语音的识别的前端技术,语音的基音周期检测(Front-end voice recognition technology, Voice Pitch Detection)
短时平均能量和平均幅度差函数进行基音检测的方法(Short-term average energy and average magnitude difference function pitch detection method)
中国科学技术大学本科课程语音信号处理实验代码(speech processing code for USTCers)
基于ICA和PCA相结合实现的语音信号的采集、随机混合,再通过盲分离将混合后的语音信号分离(Speech signal acquisition and random mixing based on ICA and PCA, and then the mixed speech signal is separated by blind separation.)
熵值越大则每个符号包含的平均信息量越大。有研究发现,在有噪声的语音信号中,语音信号的熵和噪声信号的熵存在着较大的差异,对噪声信号来说在整个频带内分布相对平坦,熵值小,语音信号集中在某些特定频段内,熵值大。因此利用这个差异可以区分噪音段和语音段。(The greater the entropy is, the greater the average information of each symbol is. It is found that, in noisy speech signals, the entropy of speech signals and the entropy of noise signals are quite different. For noisy signals, the distribution is relatively flat in the whole frequency band, and the entropy value is small. The speech signal is concentrated in some specific frequency bands, and the entropy value is large. So the difference can be used to distinguish the noise segment and the speech segment.)
语音信号的短时分析,主要包括:分帧、短时能量、短时平均幅度、短时过零率、短时自相关函数、短时幅度差、倒谱、复倒谱、lpc系数、lpc谱估计等 绝对保证质量,是保研后导师布置的一些基础程序(Short-time speech signal analysis, mainly including: sub-frame, short-time energy, short-term average, short-time zero-crossing rate, short-time auto-correlation function, short-term rate of poor cepstrum, complex cepstrum, lpc coefficients, lpc spectral estimation, such as an absolute guarantee that the quality of instructors is the security arrangement after the inquest some of the basis of procedures)
实现对一个余弦信号的均匀量化和μ律非均匀量化,并将结果输出在图形界面上。(The uniform quantization and non uniform quantization of a cosine signal are realized and the results are output on the graphical interface.)