我是非常新的学习cognitive
服务的文本到语音(TTS)的Microsoft Azure
.当我在我的TTS
Azure
XML
文档中有一个voice
元素时,我成功地能够使用Azure
的voice
服务将给定的文本转换成音频文件。工作SSML
的例子是;
<speak version="1.0" xml:lang="en-US">
<voice xml:lang="en-US" xml:gender="Male" name="en-US-Jessa24kRUS">
Hello, this is my sample text to convert into audio?
</voice>
</speak>
但是,当我有多个语音标签(基于性别)时,它会导致一个错误。它的SSML
是:
<speak version="1.0" xml:lang="en-US">
<voice xml:lang="en-US" xml:gender="Male" name="en-US-Guy24kRUS"> What’s your name? </voice>
<voice xml:lang="en-US" xml:gender="Female" name="en-US-Jessa24kRUS"> My name is Cindy Smith. Do you know John Silver?</voice>
<voice xml:lang="en-US" xml:gender="Male" name="en-US-Guy24kRUS"> John and I are old friends. </voice>
<voice xml:lang="en-US" xml:gender="Female" name="en-US-Jessa24kRUS"> John just joined our company as a salesperson. </voice>
<voice xml:lang="en-US" xml:gender="Male" name="en-US-Guy24kRUS"> That’s good news. John has been a salesperson for chemical products for many years. </voice>
<voice xml:lang="en-US" xml:gender="Female" name="en-US-Jessa24kRUS"> I head he really likes his new job.</voice>
</speak>
错误是:
响应状态代码并不表示成功: 400 (SSML必须包含最多5个语音元素。6.实际数)。
如果有人解释为什么把我限制在5 voice tags
**,,而在 [documentation](https://learn.microsoft.com/bs-cyrl-ba/azure/cognitive-services/speech-service/speech-synthesis-markup).**中没有提到限制,这对我会有很大的帮助。
发布于 2020-02-05 08:26:45
由于延迟,这是一个已知的设置。我们已经意识到并致力于消除这一限制。希望我们能在本周内完成修复和部署,如果进展顺利,我们可能会提前完成。
https://stackoverflow.com/questions/60021762
复制相似问题