Configure the Google Cloud Text to Speech Converter

Text-to-Speech AI is a service available in Google Cloud that converts text into natural-sounding speech using an API powered by the best of Google’s AI technologies.

To configure the Google Cloud Text-to-Speech AI as the text-to-Speech converter used by the Real Voice plugin, proceed as follows:

Visit the Real Voice -> Options menu
Proceed to the Text-to-Speech tab
In the Text-to-speech Converter option, select Google Text-to-Speech API (Cloud service). This option is available in the General section.
Click Save Settings

To use this text-to-speech converter, you must also configure the credentials used to identify your API requests. You can configure the credentials with the following procedure:

Visit the Real Voice -> Options menu
Proceed to the Text-to-Speech tab
Enter your key in the Google Cloud Secret Access Key option available in the Google Cloud Text-to-speech section
Click Save Settings

The plugin provides you additional options to configure your use of the Amazon Polly cloud service. These options are available in the Google Cloud Text-to-speech section of the Text-to-Speech tab:

Audio Encoding – Select one of the audio encoding supported by the service.
Speaking Rate – Speaking rate/speed, in the range [0.25, 4.0]. 1.0 is the normal native speed supported by the specific voice. 2.0 is twice as fast, and 0.5 is half as fast.
Speaking Pitch – Speaking pitch, in the range [-20.0, 20.0]. 20 means increase 20 semitones from the original pitch. -20 means decrease 20 semitones from the original pitch.
Volume Gain db – Volume gain (in dB) of the normal native volume supported by the specific voice, in the range [-96.0, 16.0].
Sample Rate – The synthesis sample rate (in hertz) for this audio. Note that this value affect the audio quality and the space occupied by the generated audio files.
Effects Profile ID – Optionally select one or more audio profiles. Effects are applied on top of each other in the order they are given.
Language Code – The language of the voice as a BCP-47 language tag. Note that this value should match the language code of the selected voice name.
Voice Name – Enter the voice that will be used to speak the utterance.

Knowledge Base

Configure the Google Cloud Text to Speech Converter