Replies: 1 comment 1 reply
-
|
That depends on the model. A new model was released a few days ago that seems to be able to do what you ask for: |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Is there a way to influence the pronunciation, pacing, and emotion in the TTS output?
For instance, in ElevenLabs, placing quotation marks around a word can create stronger emphasis. The only methods I’ve found to actively control pacing involve using punctuation marks (e.g., . , ; : ? !) or adding ellipses or dashes for pauses, see https://github.com/erew123/alltalk_tts?tab=readme-ov-file#-tricks-to-get-the-model-to-say-things-correctly
Any other adjustments appear to be ignored.
Beta Was this translation helpful? Give feedback.
All reactions