RECOGNIZE_SPEECH

c# - Google Speech Api 从返回 {"result":[]} in C# 的音频文件中获取文本

我正在尝试创建一个Windows应用程序，我可以在其中获取我拥有的音频文件，然后使用GoogleSpeechRecognitionAPI将其中的语音转录为文本文件。这是我所做的:1)我去了这里https://groups.google.com/a/chromium.org/forum/?fromgroups#!forum/chromium-dev并成为成员(member)。2)我转到我的GoogleDevelopersConsole并成功生成了一个APIkey。3)我在网上得到了一些代码并运行了它:privatevoidbtnGoogle_Click(objectsender,Even

c#34 HWR_SpeechToText SpeechToText windows google-api google-speech-api

.net - "Windows.Media.SpeechSynthesis"和 "System.Speech.Synthesis"有什么区别？

我正在尝试确定这两个API中的哪一个具有更多功能来在用C#开发的专业应用程序中执行文本到语音转换。操作系统在这里不是问题，问题在于两个命名空间如何提供更多功能、高质量的声音和稳定性。有没有人精通这两种技术并能告诉我这两个命名空间的不同之处？就特征而言，其中一个是另一个的超集吗？编辑:这两个命名空间的背后是同一个语音合成引擎？我的网络应用程序将在服务器端完成所有文本到语音的工作。最佳答案 Windows.Media.SpeechSynthesis是Windows运行时的一部分，仅支持Windows应用商店应用。它不能从您的服务器应用

amp 34 section SpeechSynthesis Windows .net speech-synthesis

.net - System.Speech 语音合成器的奇怪问题

我正在开发一个包含语音合成的程序。几周前，我写了介绍using(SpeechSynthesizers=newSpeechSynthesizer()){s.SetOutputToWaveFile("file.wav");s.Speak(textBox1.Text);}程序。它工作得很好。我从我的任务列表中划掉了“研究语音合成”，然后转到项目的其他部分。现在我正在编写一个真正的程序，并尝试使用相同的基本代码块。但是，它现在在s.SetOutputToWaveFile调用中失败。它会抛出PlatformNotSupportedException，并显示以下消息:“系统上未安装语音或当前安全设

System Speech section SpeechSynthesizer code .net windows text-to-speech

c# - 将 WAV 录制到 IBM Watson Speech-To-Text

我正在尝试录制音频并立即将其发送到IBMWatsonSpeech-To-Text进行转录。我已经使用从磁盘加载的WAV文件测试了Watson，并且成功了。另一方面，我还测试了从麦克风录音并将其存储到磁盘，效果也很好。但是当我尝试使用NAudioWaveIn录制音频时，Watson的结果是空的，就好像没有音频一样。谁能对此有所启发，或者有人有一些想法？privateasyncvoidStartHere(){varws=newClientWebSocket();ws.Options.Credentials=newNetworkCredential("*****","*****");awai

c#Speech-To-Text CancellationToken writer Encoding .net watson watson-conversation

c# - System.Speech.Recognition 备选匹配项和置信度值

我正在使用System.Speech.Recognition命名空间来识别口头句子。我对识别器提供的替代句子及其置信度分数感兴趣。来自[RecognitionResult.Alternates][1]的文档属性:RecognitionAlternatesareorderedbythevaluesoftheirConfidenceproperties.Theconfidencevalueofagivenphraseindicatestheprobabilitythatthephrasematchestheinput.Thephrasewiththehighestconfidenceval

置信度 c#置信 the .net speech-recognition

c# - System.Speech.Synthesis 在 2012 R2 上因高 CPU 而挂起

我有一个asp.netMVC应用程序，它有一个Controller操作，该操作将字符串作为输入并发送合成语音的响应wav文件。这是一个简化的示例:publicasyncTaskSpeak(stringtext){Tasktask=Task.Run(()=>{using(varsynth=newSystem.Speech.Synthesis.SpeechSynthesizer())using(varstream=newMemoryStream()){synth.SetOutputToWaveStream(stream);synth.Speak(text);varbytes=stream.

c#Synthesis code 39 section asp.net-mvc text-to-speech windows-server-2012-r2 speech-synthesis

javascript - SpeechSynthesis.speak(在 Web Speech API 中)在 Google Chrome 中总是在几秒钟后停止

当在WebSpeechAPI中使用speak函数时，在Chrome中，说话会在几秒钟后突然停止，在给它的文本中间，在一个看似随机的地方(没有到达结束)。这只发生在Chrome中(在Firefox上运行良好)，并在两台不同的计算机/系统上进行了测试。查看此jsfiddle以查看/收听:https://jsfiddle.net/fv9ochpq/您可以看到SpeechSynthesis对象.speaking标志在停止说话后保持打开状态(true)。我还没有看到对传递给话语的文本有任何记录限制。这是谷歌浏览器的错误吗？顺便说一句，我从2014年就知道了这一点——当时我试图向我制作的浏览器扩展

SpeechSynthesis javascript strong https code google-chrome text-to-speech speech-synthesis webspeech-api

javascript - Google Web Speech API 中的语法

我可以通过给他一个单词列表(在我的例子中，用户的请求是非常可预测的)来提高Google语音API的识别能力，使识别更准确吗？最佳答案正确答案是:不，你不能。=( 关于javascript-GoogleWebSpeechAPI中的语法，我们在StackOverflow上找到一个类似的问题： https://stackoverflow.com/questions/7433801/

javascript Google section strong stackoverflow google-chrome speech-recognition speech-to-text

javascript - Angular 2 : Web Speech API - Voice recognition

阅读webkitSpeechRecognition的文档后(Javascript中的语音识别)我试图在Angular2中实现它.但是当我这样做的时候:constrecognition=newwebkitSpeechRecognition();typescript说这个错误:[ts]Cannotfindname'webkitSpeechRecognition'.any如果我尝试从window中提取webkitSpeechRecognition:if('webkitSpeechRecognition'inwindow){console.log("Entersinsidetheconditi

recognition javascript webkitSpeechRecognition strong code angular voice-recognition typescript1.8 webspeech-api

php - Google cloud -speech api返回空结果

我使用Google云语音API。当我运行我的脚本时，会调用API和响应。操作信息返回数据，结果为空。这是我的代码(我删除了真实数据的文件url、文件名、keyurl、项目名称和存储桶名称):function__construct(){$file_url='filepath.mp3';$filename='filename.mp3';/**Creategoogleclient**/$client=newGoogle_Client();$key='pathtogooglekey';putenv($key);$client->useApplicationDefaultCredentials(

Google speech gt 39 operation php encoding google-cloud-speech

123 4 5