Chrome 将SSML与Web Speech API结合使用的正确方法

ryevplcw 于 2022-12-06 发布在 Go

关注(0)|答案(4)|浏览(189)

Web Speech API规范指出：

*文本 * 属性

此属性指定此话语要合成与朗读得文本.它可以是纯文本或完整得，格式正确得SSML文档.对于不支持SSML或只支持某些标记得语音合成引擎，用户代理或语音引擎必须去掉它们不支持得标记并朗读文本.
它没有提供在SSML文档中使用text的示例。
我在Chrome 33中尝试了以下操作：

var msg = new SpeechSynthesisUtterance();
msg.text = '<?xml version="1.0"?>\r\n<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="en-US">ABCD</speak>';
speechSynthesis.speak(msg);

它不起作用--声音试图叙述XML标记。此代码有效吗？
我必须提供XMLDocument对象吗？
我试图了解Chrome是否违反了规范（应该作为bug报告），或者我的代码是否无效。

google-chrome

来源：https://stackoverflow.com/questions/21952736/the-right-way-to-use-ssml-with-web-speech-api

4条答案

按热度按时间

j8ag8udp1#

在Chrome 46中，当语言设置为en时，XML在Windows上被正确地解释为XML文档;但是，我没有看到任何证据表明这些标记实际上在做任何事情。我没有听到这个SSML的<emphasis>和非<emphasis>版本之间有什么区别：

var msg = new SpeechSynthesisUtterance();
msg.text = '<?xml version="1.0"?>\r\n<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="en-US"><emphasis>Welcome</emphasis> to the Bird Seed Emporium.  Welcome to the Bird Seed Emporium.</speak>';
msg.lang = 'en';
speechSynthesis.speak(msg);

<phoneme>标签也被完全忽略了，这让我说IPA的尝试失败了。

var msg = new SpeechSynthesisUtterance();
msg.text='<?xml version="1.0" encoding="ISO-8859-1"?> <speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.w3.org/2001/10/synthesis http://www.w3.org/TR/speech-synthesis/synthesis.xsd" xml:lang="en-US"> Pavlova is a meringue-based dessert named after the Russian ballerina Anna Pavlova. It is a meringue cake with a crisp crust and soft, light inside, usually topped with fruit and, optionally, whipped cream.  The name is pronounced <phoneme alphabet="ipa" ph="p&aelig;v&#712;lo&#650;v&#601;">...</phoneme> or <phoneme alphabet="ipa" ph="p&#593;&#720;v&#712;lo&#650;v&#601;">...</phoneme>, unlike the name of the dancer, which was <phoneme alphabet="ipa" ph="&#712;p&#593;&#720;vl&#601;v&#601;">...</phoneme> </speak>';
msg.lang = 'en';
speechSynthesis.speak(msg);

尽管微软语音API * 确实 * 正确处理SSML，但这是一个事实。下面是一个适合在LinqPad中使用的C#代码片段：

var str = "Pavlova is a meringue-based dessert named after the Russian ballerina Anna Pavlova. It is a meringue cake with a crisp crust and soft, light inside, usually topped with fruit and, optionally, whipped cream.  The name is pronounced /pævˈloʊvə/ or /pɑːvˈloʊvə/, unlike the name of the dancer, which was /ˈpɑːvləvə/.";
var regex = new Regex("/([^/]+)/");
if (regex.IsMatch(str))
{
    str = regex.Replace(str, "<phoneme alphabet=\"ipa\" ph=\"$1\">word</phoneme>");
    str.Dump();
}   
SpeechSynthesizer synth = new SpeechSynthesizer();
PromptBuilder pb = new PromptBuilder();
pb.AppendSsmlMarkup(str);
synth.Speak(pb);

赞(0）回复(0）举报 2022-12-06

sbtkgmzw2#

Chromium当前存在与此问题相关的漏洞。

88072：扩展TTS API平台实现需要支持SSML
428902：speechSynthesis.speak（）无法删除无法识别的标签此错误已在Chrome中修复（截至2016年9月）。

赞(0）回复(0）举报 2022-12-06

slhcrj9b3#

我已经在Chrome 104.0.5112.101（Linux）上试过了。没有用。当我检查调试控制台时，我得到了这样的消息：

speechSynthesis.speak() without user activation is deprecated and will be removed

添加一个像The question of whether speechSynthesis is allowed to run without user interaction中提到的按钮确实对我有用。至少可以说出文本，而不是SSML格式的文本。

赞(0）回复(0）举报 2022-12-06

ix0qys7i4#

我已经测试过了，XML解析似乎在Windows中可以正常工作，但在MacOS中却不能正常工作。

赞(0）回复(0）举报 2022-12-06

我来回答

Chrome 将SSML与Web Speech API结合使用的正确方法

4条答案

相关问题

热门标签

最新问答