Automated Speech Recognition
Revision as of 10:08, 17 May 2018 by Rob (talk | contribs) (Created page with "Speech to text is provided by the Speech Recognition Tritium node. The actual recognition is done by a configurable backend, with Google...")
Speech to text is provided by the Speech Recognition Tritium node. The actual recognition is done by a configurable backend, with Google's Cloud Speech API as the default (and currently only) option.
Google Cloud Speech API
To use the API the node must provide authentication credentials.
The process is as follows
- Create a Google Cloud API project
- Generate a private service account key JSON file
- Copy it onto the robot at...
/opt/tritium/nodes/speech_recognition/google_application_credentials.json
The Before You Begin section of the Using Client Libraries tutorial covers this process. (You do not need to follow the rest of the tutorial.)