Automated Speech Recognition

From Engineered Arts Wiki
Revision as of 10:08, 17 May 2018 by Rob (talk | contribs) (Created page with "Speech to text is provided by the Speech Recognition Tritium node. The actual recognition is done by a configurable backend, with Google...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Speech to text is provided by the Speech Recognition Tritium node. The actual recognition is done by a configurable backend, with Google's Cloud Speech API as the default (and currently only) option.

Google Cloud Speech API

To use the API the node must provide authentication credentials.

The process is as follows

  1. Create a Google Cloud API project
  2. Generate a private service account key JSON file
  3. Copy it onto the robot at...
/opt/tritium/nodes/speech_recognition/google_application_credentials.json

The Before You Begin section of the Using Client Libraries tutorial covers this process. (You do not need to follow the rest of the tutorial.)