上海交通大学-电子信息与电气工程学院-English

中文

News

· Bin Dai's Team Unveils the Assembly Mechanism of β-Lactoglobulin Fibrils, Providing New Insights for the Development of Functional Nanomaterials
· Mingyi Chen’s research group has made important progress in the field of analog-to-digital converter chips for brain-computer interface
· Progress in the Development of Semiconductor Nanomaterials to Activate Pyroptosis for Cancer Therapy
· Jiamiao Yang’s team achieved the high precision optoelectronic reservoir computing based on complex-value encoding
· Significant Advancements in Resonator-Enhanced Quantum Sensing Achieved by Zenguihua's Team at the School of Sensing Science and Engineering

CS26001 Intelligent Speech Technology 智能语音技术

课程名称 (Course Name) ： Intelligent Speech Technology

课程代码 (Course Code)： CS26001

学分/学时 (Credits/Credit Hours)： 3.0/48

开课时间 (Course Term )： Spring

开课学院（School Providing the Course）: Computer Science Department

任课教师（Teacher）: Kai Yu

课程讨论时数（Course Discussion Hours）: 4

课程实验数（Lab Hours）: 4

课程内容简介(Course Introduction):

This course introduces the basic theory and tools for intelligent speech technology, with a focus on automatic speech recognition. It includes fundamental theories of pattern recognition and machine learning, acoustic modeling (hidden Markov model), language model (N-gram model), and advanced techniques of large vocabulary continuous speech recognition.

教学大纲(Course Teaching Outline):

1. Overview Intelligent Speech Technology

2. Probability and Stochastic Process

3. Pattern Recognition

4. Bayesian decision theory

5. Expectation Algorithm and Gaussian Mixture Model

6. Speech Processing and Feature Extraction

7. Hidden Markov Model

8. Statistical language model

9. Decoding algorithm

10. Issues of large vocabulary continuous speech recognition

课程进度计划(Course Schedule):

Week 1-5: Pattern Recognition and Bayesian decision theory

Week 6: EM algorithm and Gaussian mixture model

Week 7-8: Speech processing and feature extraction

Week 9-10: hidden Markov model

Week 11: Statistical language model

Week 12: Advanced techniques of LVCSR

课程考核要求(Course Assessment Requirements)：

Evaluation will be based on two course works and one group presentation.

参考文献(Course References)：

Xuedong Huang, Alex Acero, Raj Reddy and Hsiao-Wuen Hon, Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice Hall, 2001.

S. J. Young, D. Kershaw, J. J. Odell, D. Ollason, V. Valtchev, and P. C. Woodland. The HTK Book (for HTK version 3.0). Cambridge University Engineering Department, 2000.

预修课程（Prerequisite Course）

Calculus, Probability, Linear Algebra

[ 2015-11-26 ]

Next page: Notice on Arrangement and Enrollment of Graduate Courses for 2022 Fall...

Last page: 【学术报告会】Academic Seminar/Academic Reports

Students

News

CS26001 Intelligent Speech Technology 智能语音技术

Students

Faculty/Staff

Alumni

Vistors

Quick Links