Skip to content

Add Speech AI MCP Server (Pronunciation, TTS, STT)#3526

Open
fasuizu-br wants to merge 1 commit intoalibaba:mainfrom
fasuizu-br:add-mcp-speech-ai
Open

Add Speech AI MCP Server (Pronunciation, TTS, STT)#3526
fasuizu-br wants to merge 1 commit intoalibaba:mainfrom
fasuizu-br:add-mcp-speech-ai

Conversation

@fasuizu-br
Copy link

MCP Server: Speech AI

Add Speech AI MCP server providing pronunciation assessment, speech-to-text, and text-to-speech capabilities for AI agents.

Tools

Tool Description
assess_pronunciation Score English pronunciation at phoneme, word, and sentence levels (0-100)
transcribe_audio Transcribe audio to text with word-level timestamps
synthesize_speech Generate speech from text with 12 English voices (US + UK accents)
list_tts_voices List available TTS voices

Key Features

API Details

  • Base URL: https://apim-ai-apis.azure-api.net
  • Auth: Ocp-Apim-Subscription-Key header
  • Formats: WAV, MP3, OGG, FLAC, WebM
  • Pricing: $0.02/call (free tier available via Azure Marketplace)

Links

Files Added

  • plugins/wasm-go/mcp-servers/mcp-speech-ai/mcp-server.yaml - Server configuration with 4 tools
  • plugins/wasm-go/mcp-servers/mcp-speech-ai/README.md - English documentation
  • plugins/wasm-go/mcp-servers/mcp-speech-ai/README_ZH.md - Chinese documentation

Reference: Issue #2113 (How to publish your MCP Server)

@CLAassistant
Copy link

CLAassistant commented Feb 21, 2026

CLA assistant check
All committers have signed the CLA.

@fasuizu-br
Copy link
Author

recheck

@fasuizu-br
Copy link
Author

Hi team! Just a friendly check-in on this PR. The CLA has been signed and all checks are passing. This adds a Speech AI MCP server plugin (pronunciation assessment, TTS, STT) using Higress's declarative YAML format with request/response templates. Happy to address any feedback or make adjustments. Thank you for your time!

@fasuizu-br
Copy link
Author

recheck

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants