COM4511代做、代写Python设计编程
Practical Exercise - Task 1
MFCCs
Speech Technology - COM4511/6511
12th March 2021
1 Introduction
The aim of this task is to learn how to use Python for speech processing. There are several online Python tutorials,
and particularly on numpy and how to use it for signal processing. In this task you will implement a Python
program to convert a speech signal to Mel Frequency Cepstral Coefficients (MFCCs).
2 Infrastructure
A speech audio file from the TIMIT corpus, a Python helper script and example output are provided. These can be
found on MOLE in the folder named task1. It is also available for download at https://www.dcs.shef.ac.uk/∼th/campus_only3 Task
To help you start the task, the provided Python script contains all the necessary steps and a few utility functions
for this task. You are asked to fill in the missing parts indicated by ####.
1. From the link above, download the audio file “SA1.wav” and the Python script “ task1_compute_mfcc.py ”
2. Take a look at the script and find out which parts need to be implemented
3. Convert the complete audio file into a sequence of MFCC vectors using the following baseline configuration.
(a) A pre-emphasis filter with coefficient 0.97
(b) 10ms frame step, 25ms frame length (Hamming window)
(c) 26 Mel-filters
(d) 12 cepstral coefficients (C1-C12, omitting C0)
(e) 22-order liftering.
(f) Cepstral mean and variance normalisation (optional)
(g) Plot different coefficients and consider the implications of what you see
(h) Adopt any good coding style with meaningful comments. Use functions when possible.
You can (and should) use NumPy and SciPy for basic numerical computing and the FFT/DCT computation, but
not any other packages (i.e. the helper script has provided all the necessary imports). If you are not sure, just raise
your hand and ask.
1
4 Assessment:
Note that for any module assignment full marks will only be obtained for outstanding performance that goes
well beyond the questions asked. The marks allocated for each assignment are 20%. The marks will be assigned according the following general criteria - for every assignment handed in:
1. Fulfilling the basic requirements (5%)
Full marks will be given to fulfilling the work as described, in source code and results given.
2. Submitting high quality documentation (5%)
Full marks will be given to a write-up that is at the highest standard of technical writing and illustration.
3. Showing good reasoning (5%)
Full marks will be given if the experiments and the outcomes are explained to the best standard.
4. Going beyond what was asked (5%)
Full marks will be given for interesting ideas on how to extend work that are well motivated and described.
In order to report this task as complete 3 elements have to be submitted, in gzup form. The file name should have
the following format:
<lastname>_<firstname>_task1.gz
1. The MFCC encoded reference audio file in text format.
Each line contains one MFCC vector, the values are separated by single space.
2. A compiled Latex document (pdf) that contains 4 plots (matplotlib) of the MFFCs in the following configurations (modification to the baseline) and detailed comments on the variation to the baseline configuration
as outlined above. Also briefly comment on the reasons for the effects.
(a) No Hamming window
(b) 40 MFCCs
(c) 80 Filterbanks, 40 MFCCs
(d) No Pre-emphasis.
3. The completed source code for the baseline configuration as outlined in 3.
请加QQ:99515681 邮箱:99515681@qq.com WX:codinghelp
- instagram自动引流新招,ins一键打粉推广软件推荐
- 外贸开端 WhatsApp拉群营销工具是我迈向国际市场的利器
- XC2VP40-5FF1152I: Enhancing Embedded Systems with FPGA Versatility | ChipsX
- 跨境电商WhatsApp代拉群助推品牌,信息传递如风般迅捷!商家营销新时代的必备工具
- A700X567M2R5ATE006: Revolutionizing Power Management in Aerospace Technology | ChipsX
- Telegram自动群发拉群营销软件,TG最强群发软件推荐
- 代做 ticket management system
- 海外营销达人力荐WhatsApp工具揭示市场趋势的秘密武器助力业务腾飞
- Instagram群发软件 - ins自动登录/ig采集指定地区/ins群发助手/无与伦比
- Instagram自动化群发工具,ins营销引流软件,ig批量私信
- 代写Self-supervised vIsion Transformer
- Instagram自动采集软件,ins全球粉丝采集工具,ig博主推广神器
- 引领环保新风向,上海国际环保展观众预登记全面开启!
- 跨境推广新选择!Telegram群发云控带您挖掘海外市场的无限商机
- CHINC2024丨史文钊:大模型时代 神州医疗全面领跑医疗AI行业
- Ins引流营销助手,Instagram打粉工具,助你轻松拓展市场!
- zalo 拉群超能商务的未来预言:zalo筛选器是制定全球营销计划的得力助手
- Instagram博主自动化采集工具,ins营销引流软件,ig批量私信
- 神州医疗与广东医科大学共建多模态数据融合应用实验室,以数据驱动医学研究创新
- 易智瑞:自然资源数字化治理能力提升关键技术解读
- Telegram群组活跃软件,TG自动化炒群工具,电报脚本炒群神器
- 电报拉群神器!Telegram营销软件助你实现社交爆发!
- 共育新质生产力 新点软件与安恒信息签署战略合作协议
- COMP1039代做、代写Java程序语言
- program代做、代写python设计编程
- 电报/TG群发推广加速软件,Telegram/TG营销自动群发工具,TG/纸飞机引流加速器
- WhatsApp怎么养号,ws自动养号教程/ws劫持号推荐/ws筛选器
- 用科技打造优质内容,柠檬微趣荣获“首都文明单位”称号
- instagram引流群控推广软件,ins粉丝精准引流群发助手
- Instagram营销群发软件,Ins一键群发工具,助你实现营销梦想!
推荐
- 创意驱动增长,Adobe护城河够深吗? Adobe通过其Creative Cloud订阅捆绑包具有 科技
- 苹果罕见大降价,华为的压力给到了? 1、苹果官网罕见大降价冲上热搜。原因是苹 科技
- 智慧驱动 共创未来| 东芝硬盘创新数据存储技术 为期三天的第五届中国(昆明)南亚社会公共安 科技
- 老杨第一次再度抓握住一瓶水,他由此产生了新的憧憬 瘫痪十四年后,老杨第一次再度抓握住一瓶水,他 科技
- B站更新决策机构名单:共有 29 名掌权管理者,包括陈睿、徐逸、李旎、樊欣等人 1 月 15 日消息,据界面新闻,B站上周发布内部 科技
- 疫情期间 这个品牌实现了疯狂扩张 记得第一次喝瑞幸,还是2017年底去北京出差的 科技
- 如何经营一家好企业,需要具备什么要素特点 我们大多数人刚开始创办一家企业都遇到经营 科技
- 丰田章男称未来依然需要内燃机 已经启动电动机新项目 尽管电动车在全球范围内持续崛起,但丰田章男 科技
- 全力打造中国“创业之都”名片,第十届中国创业者大会将在郑州召开 北京创业科创科技中心主办的第十届中国创业 科技
- 升级的脉脉,正在以招聘业务铺开商业化版图 长久以来,求职信息流不对称、单向的信息传递 科技