Difference between revisions of "Automated Speech Recognition"
Jump to navigation
Jump to search
Line 1: | Line 1: | ||
− | Speech to text is provided by the [[Tritium Node - Speech Recognition|Speech Recognition]] Tritium node. The actual recognition is done by a configurable backend, with Google's Cloud Speech API as the default (and currently only) option. | + | Speech to text is provided by the [[Tritium Node - Speech Recognition|Speech Recognition]] Tritium node. The actual recognition is done by a configurable backend, with Google's Cloud Speech-to-Text API as the default (and currently only) option. |
− | ==Google Cloud Speech API== | + | ==Google Cloud Speech-to-Text API== |
To use the API the node must provide authentication credentials. | To use the API the node must provide authentication credentials. |
Revision as of 10:25, 17 May 2018
Speech to text is provided by the Speech Recognition Tritium node. The actual recognition is done by a configurable backend, with Google's Cloud Speech-to-Text API as the default (and currently only) option.
Google Cloud Speech-to-Text API
To use the API the node must provide authentication credentials.
The process to create new credentials is as follows
- Create a Google Cloud API project
- Generate a private service account key JSON file
- Copy it onto the robot at...
/opt/tritium/nodes/speech_recognition/google_application_credentials.json
The Before You Begin section of the Quickstart: Using Client Libraries tutorial covers this process. (You do not need to follow the rest of the tutorial.)