Difference between revisions of "Automated Speech Recognition"

From Engineered Arts Wiki
Jump to navigation Jump to search
(Google Cloud Speech API)
Line 15: Line 15:
 
</pre>
 
</pre>
  
The ''Before You Begin'' section of the [https://cloud.google.com/speech-to-text/docs/quickstart-client-libraries|Quickstart: Using Client Libraries] tutorial covers this process.  (You do not need to follow the rest of the tutorial.)
+
The ''Before You Begin'' section of the [https://cloud.google.com/speech-to-text/docs/quickstart-client-libraries Quickstart: Using Client Libraries] tutorial covers this process.  (You do not need to follow the rest of the tutorial.)

Revision as of 10:24, 17 May 2018

Speech to text is provided by the Speech Recognition Tritium node. The actual recognition is done by a configurable backend, with Google's Cloud Speech API as the default (and currently only) option.

Google Cloud Speech API

To use the API the node must provide authentication credentials.

The process to create new credentials is as follows

  1. Create a Google Cloud API project
  2. Generate a private service account key JSON file
  3. Copy it onto the robot at...
/opt/tritium/nodes/speech_recognition/google_application_credentials.json

The Before You Begin section of the Quickstart: Using Client Libraries tutorial covers this process. (You do not need to follow the rest of the tutorial.)