Reusable instructor voices
Keep one recognizable instructor style across lessons, quizzes, walkthroughs, and updates.
For learning products
Use TextToSpeechSkills to create consistent educational audio for modules, onboarding paths, product academies, and internal training. Templates keep the instructor voice steady while expression tags add emphasis where learners need it.
TextToSpeechSkills helps learning teams create narration for lessons, training modules, product academies, onboarding paths, and accessibility audio. A course can keep one instructor voice through reusable templates, while expression tags add emphasis and pacing where learners need it. Non-technical course builders can ask an LLM app to adapt lesson text for speech, validate the tags, and generate audio through MCP. Developers can use the same templates and job API when the workflow becomes part of a learning platform.
Easy LLM setup
Non-technical course builders can ask an LLM to adapt lesson text for audio, validate the expression tags, and generate speech through the MCP tool.
Read setup guideKeep one recognizable instructor style across lessons, quizzes, walkthroughs, and updates.
Expression tags add audio direction without making source material difficult to edit.
Generate audio versions of learning content and store usage records for operations teams.
Course creators, learning platforms, internal training teams, and product educators usually need a repeatable path for writing, review, generation, billing, and reuse. The most important jobs here are reusable instructor voices, readable lesson scripts, accessible by default. Those are the moments where voice becomes part of real work instead of a one-off export.
Start with readable text, add expression tags when tone matters, choose an approved voice template, and create a speech job through the UI, API, or MCP. The same pattern works for e-learning text-to-speech, course narration TTS, training audio API, which makes it easier for humans and LLM apps to share one process without exposing internal routing or credentials.
Decide which templates are approved, which expression tags are allowed, who can create workspace keys, and which usage limits are acceptable. Those choices keep automated voice generation useful without letting it sprawl from the first paid Test plan through Pro, Scale, and Business usage.
Common questions
These are the practical details that matter before a team adds speech generation to a real workflow.
Course creators, learning platforms, internal training teams, and product educators should use this page when they want generated speech that is easy to review, consistent across prompts, and simple to connect to LLM tools. The core workflow combines expression tags, voice templates, credit previews, and job-based generation.
Non-technical course builders can ask an LLM to adapt lesson text for audio, validate the expression tags, and generate speech through the MCP tool. The setup guide keeps the first path short while still giving developers a clean API when the workflow moves into a product backend.
Every paid plan uses credits. Teams can add credit packs when needed, and workspaces on Pro and higher add central billing for $2 per user per month.
API playground
{
"text": "[quiet] hello. [loud and angry] how are you?",
"voice_template": "vt_calm_narrator_v1",
"generation_mode": "instant",
"format": "mp3"
}MCP install
pnpm --package texttospeechskills dlx tts-skills-mcppnpm --package texttospeechskills dlx tts-skills-mcppnpm --package texttospeechskills dlx tts-skills-mcppnpm --package texttospeechskills dlx tts-skills tags