前往小程序,Get更优阅读体验!
立即前往
首页
学习
活动
专区
工具
TVP
发布
社区首页 >专栏 >SP Module 4 the Source-Filter Model

SP Module 4 the Source-Filter Model

作者头像
杨丝儿
发布2022-11-10 15:35:22
3530
发布2022-11-10 15:35:22
举报
文章被收录于专栏:杨丝儿的小站杨丝儿的小站

Harmonics

In the frequency domain, periodic signals have harmonic structure: they contain energy only at multiples of their fundamental frequency.

Voice sounds different from unvoiced sounds, has repeating pattern, in periodicity. So, the peak of the sound in the frequency domain is clear to observe.

Impulse train

An impulse train is the simplest periodic signal that has energy at all multiples of its fundamental frequency (and energy are evenly distributed).

Spectral envelope

Varying the shape of the spectral envelope is the primary means by which a speaker transmits a linguistic message to a listener.

Spectral envelope: The region under the curve in frequency domain.

Resonant tube

The understand how the vocal tract modifies sound, we need to start with the concept of resonance.

Q: Increased damping in a resonant system… A: increases the bandwidth of the frequency response. Increased damping won’t increase gain (i.e., boosting of amplitude) as increased damping means the vibration of an object fades away sooner (i.e. loses amplitude), so we can rule out that answer. Bandwidth refers to the width of the frequency response curve (see M4: Vocal tract resonance and formants). A decreased bandwidth would indicate that less frequencies around the resonance frequency are boosted (and thus consume energy). This sort of narrow bandwidth is what allows a tuning fork to ring for a long time. An increased bandwidth means that more frequencies get boosted around the resonant frequency. This is associated with a lower peak amplitude (as the energy is spread across a bigger band of frequencies). This is consistent with increased damping: energy is spread over more frequencies so the oscillations due to resonance die out quicker. (M4: Vocal tract resonance and formants, Wayland Chapter 6: Damping. This is a very challenging question!)

Vocal tract resonance & formants

A speaker can vary their vocal tract shape to change its resonant frequencies, and therefore the spectral envelope of the speech they are producing.

Formant frequencies: Frequencies around which acoustic energy is concentrated as a result of the filtering action of the vocal tract, visible as prominent peaks in a spectrum. (resonances of the vocal tract)

The peak is called formant, properties for the vocal tract, and F1F_1F1​ is the first format and F2F_2F2​ is the second formant.

But F0F_0F0​ is is the fundamental frequency of the vocal folds, the rate of the vocal folds, not formant.

Filter

We now shift from an explicit physical model of the vocal tract as a resonating tube, to a more general model of the vocal tract as a filter operating on signals.

Filter is something map from input domain XXX to output domain YYY, like a function in mathematics.

Impulse response

If we want to characterise a filter in the time domain, we need to know its impulse response.

How the filter response to the impulse.

In the image below, we narrow down the analysis frame down to only one period of waveform, we have impulse response of the filter on the left, and the frequency response of the filter on the right.

Source-filter model

Finally, we arrive at a complete model of speech signals that can generate any speech sound.

We find the impulse response/frequency response of the original sounds

Phoneme

The source-filter model brings together our understanding of speech signals, speech production, and phonetics. It can generate any speech sound: any phoneme.

Summary


Origin: Module 4 the Source-Filter Model Translate + Edit: YangSier (Homepage)

本文参与 腾讯云自媒体同步曝光计划,分享自作者个人站点/博客。
原始发表:2022-10-18,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 作者个人站点/博客 前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体同步曝光计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
目录
  • Harmonics
  • Impulse train
  • Spectral envelope
  • Resonant tube
  • Vocal tract resonance & formants
  • Filter
  • Impulse response
  • Source-filter model
  • Phoneme
  • Summary
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档