问Python sounddevice.rec()，dtype =‘int8 8’量化为零问题
EN

Stack Overflow用户

提问于 2021-12-07 18:24:54

回答 1查看 280关注 0票数 0

我试图用不同的dtype(位/样本很明显)来绘制我的语音信号。所以我试着用dtype = 'int16‘来捕捉我的声音，这个情节很有意义。但是我试着用dtype = 'int8‘在相同的声音水平上说话，我的情节是零线。

为什么会发生这种事

一种想法是，可能8位的量化器有一个更大的死区，所以对于相同的输入语音级别，量化器会降低0的值。当然，我为量化器的类型做了一个假设。我还没有看到量化器是否是均匀的死区。下面是我的代码和情节

import matplotlib.pyplot as plt
import numpy as np
import sounddevice as sd

Fs = 8000  # Sampling frequency
duration = 5  # Recording duration in seconds
voice = sd.rec(frames=duration * Fs, samplerate=Fs, channels=1, dtype='int16')  # Capture the voice
# frames indicate  indirectly the duration of record, dtype is 16 bits per sample.
sd.wait()  # close after recording finish
time = np.linspace(0, len(voice - 1) / Fs, len(voice - 1))  # split x axis in voice-1 points
print(voice)  # points have 1/Fs distance each other
plt.plot(time, voice)  # plot in seconds
plt.title("Voice Signal")
plt.xlabel("Time [seconds]")
plt.ylabel("Voice amplitude")
plt.show()