评估模式描述
评测要求:支持多组汉字。音频时长最长60秒。
评测维度:支持返回单词精准度,单词流利度;支持返回音素精准度。
评测功能:支持实时评测,多组文本,指定发音。
是单词模式的超集,主要为支持了分支结构输入,支持流式展示中间结果。
请求参数
主要请求参数说明:
参数名称 | 类型 | 描述 |
ref_text | String | 被评估文本。支持多组汉字。使用| 划分多组分支 |
eval_mode | Integer | 评估模式。7:单词评测模式 |
请求示例
# 参数示例为websocket连接URL展开, 如:soe.cloud.tencent.com/soe/api/1306***?eval_mode=0&voice_format=1&...server_engine_type=16k_zheval_mode=7ref_text="你 | 我 | 他"score_coeff=1.000000voice_format=1
返回结果
主要返回结果说明:
参数名称 | 类型 | 描述 |
Words.PronAccuracy | Float | 单词精准度 |
Words.PronFluency | Float | 单词流利度 |
Words.MatchTag | Float | 当前词的音频与文本的匹配情况 |
Words.PhoneInfos.PronAccuracy | Float | 音素精准度 |
Words.PhoneInfos.MatchTag | Integer | 当前音素的音频与文本的匹配情况 |
返回示例
{ "code": 0, "message": "a26fdf60-68d9-4176-8b0f-3d461f575daa_4", "voice_id": "a26fdf60-68d9-4176-8b0f-3d461f575daa", "result": { "SuggestedScore": 55.34214782714844, "PronAccuracy": 99.6158676147461, "PronFluency": 0.9618735313415527, "PronCompletion": 0.3333333432674408, "Words": [ { "MemBeginTime": 360, "MemEndTime": 760, "PronAccuracy": 99.6158676147461, "PronFluency": 0.9618735313415527, "ReferenceWord": "", "Word": "我", "MatchTag": 0, "KeywordTag": 0, "PhoneInfos": [ { "MemBeginTime": 360, "MemEndTime": 530, "PronAccuracy": 99.60150146484375, "DetectedStress": false, "Phone": "w", "ReferencePhone": "", "ReferenceLetter": "", "Stress": false, "MatchTag": 0 }, { "MemBeginTime": 530, "MemEndTime": 760, "PronAccuracy": 99.62305450439453, "DetectedStress": false, "Phone": "o3", "ReferencePhone": "", "ReferenceLetter": "", "Stress": false, "MatchTag": 0 } ], "Tone": { "Valid": false, "RefTone": -1, "HypothesisTone": -1 } } ], "SentenceId": -1, "RefTextId": -1, "KeyWordHits": null, "UnKeyWordHits": null }, "final": 1 }
指定发音
请求参数
主要请求参数说明
参数名称 | 类型 | 描述 |
ref_text | String | 被评估文本。使用| 划分多组分支 |
eval_mode | Integer | 评估模式。7:单词实时评测模式 |
text_mode | Integer | 输入文本模式。0:普通文本;1:音素结构 |
请求示例
# 参数示例为websocket连接URL展开, 如:soe.cloud.tencent.com/soe/api/1306***?eval_mode=0&voice_format=1&...server_engine_type=16k_zheval_mode=7ref_text="你{::pron{ni2}} | 我 | 他"score_coeff=1.000000voice_format=1
返回结果
主要返回结果说明
参数名称 | 类型 | 描述 |
Words.PronAccuracy | Float | 单词精准度 |
Words.PronFluency | Float | 单词流利度 |
Words.MatchTag | Float | 当前词的音频与文本的匹配情况 |
Words.PhoneInfos.PronAccuracy | Float | 音素精准度 |
Words.PhoneInfos.MatchTag | Integer | 当前音素的音频与文本的匹配情况 |
返回示例
{ "code": 0, "message": "ce39b1ab-5523-489a-ac10-4a91ad3ce69f_4", "voice_id": "ce39b1ab-5523-489a-ac10-4a91ad3ce69f", "result": { "SuggestedScore": 55.34214782714844, "PronAccuracy": 99.6158676147461, "PronFluency": 0.9618735313415527, "PronCompletion": 0.3333333432674408, "Words": [ { "MemBeginTime": 360, "MemEndTime": 760, "PronAccuracy": 99.6158676147461, "PronFluency": 0.9618735313415527, "ReferenceWord": "", "Word": "我", "MatchTag": 0, "KeywordTag": 0, "PhoneInfos": [ { "MemBeginTime": 360, "MemEndTime": 530, "PronAccuracy": 99.60150146484375, "DetectedStress": false, "Phone": "w", "ReferencePhone": "", "ReferenceLetter": "", "Stress": false, "MatchTag": 0 }, { "MemBeginTime": 530, "MemEndTime": 760, "PronAccuracy": 99.62305450439453, "DetectedStress": false, "Phone": "o3", "ReferencePhone": "", "ReferenceLetter": "", "Stress": false, "MatchTag": 0 } ], "Tone": { "Valid": false, "RefTone": -1, "HypothesisTone": -1 } } ], "SentenceId": -1, "RefTextId": -1, "KeyWordHits": null, "UnKeyWordHits": null }, "final": 1 }