Difference between revisions of "Automated Speech Recognition"

From Engineered Arts Wiki
Jump to navigation Jump to search
Line 1: Line 1:
 +
<accesscontrol>robot_tech</accesscontrol>
 
Speech to text is provided by the [[Tritium Node - Speech Recognition|Speech Recognition]] Tritium node.  The actual recognition is done by a configurable backend, with Google's Cloud Speech-to-Text API as the default (and currently only) option.
 
Speech to text is provided by the [[Tritium Node - Speech Recognition|Speech Recognition]] Tritium node.  The actual recognition is done by a configurable backend, with Google's Cloud Speech-to-Text API as the default (and currently only) option.
  

Revision as of 10:26, 17 May 2018

This is a protected page.

Speech to text is provided by the Speech Recognition Tritium node. The actual recognition is done by a configurable backend, with Google's Cloud Speech-to-Text API as the default (and currently only) option.

Google Cloud Speech-to-Text API

To use the API the node must provide authentication credentials.

The process to create new credentials is as follows

  1. Create a Google Cloud API project
  2. Generate a private service account key JSON file
  3. Copy it onto the robot at...
/opt/tritium/nodes/speech_recognition/google_application_credentials.json

The Before You Begin section of the Quickstart: Using Client Libraries tutorial covers this process. (You do not need to follow the rest of the tutorial.)