Automated Speech Recognition
Jump to navigation
Jump to search
Speech to text is provided by the Speech Recognition Tritium node. The actual recognition is done by a configurable backend, with Google's Cloud Speech API as the default (and currently only) option.
Google Cloud Speech API
To use the API the node must provide authentication credentials.
The process to create new credentials is as follows
- Create a Google Cloud API project
- Generate a private service account key JSON file
- Copy it onto the robot at...
/opt/tritium/nodes/speech_recognition/google_application_credentials.json
The Before You Begin section of the Using Client Libraries tutorial covers this process. (You do not need to follow the rest of the tutorial.)