Gradium

Gradium TTS implementation for @micdrop/server.

This package provides high-quality real-time text-to-speech implementation using Gradium's WebSocket streaming API.

Installation

npm install @micdrop/gradium

Gradium TTS (Text-to-Speech)

Usage

import { GradiumTTS } from '@micdrop/gradium'
import { MicdropServer } from '@micdrop/server'

const tts = new GradiumTTS({
  apiKey: process.env.GRADIUM_API_KEY || '',
  voiceId: 'YTpq7expH9539ERJ', // Gradium voice ID
  modelName: 'default', // Optional: model name
  outputFormat: 'pcm_16000', // Optional: audio format
  region: 'eu', // Optional: 'eu' or 'us'
})

// Use with MicdropServer
new MicdropServer(socket, {
  tts,
  // ... other options
})

Options

Option	Type	Default	Description
`apiKey`	`string`	Required	Your Gradium API key
`voiceId`	`string`	Required	Gradium voice ID
`modelName`	`string`	`'default'`	Model name to use for speech synthesis
`outputFormat`	`GradiumOutputFormat`	`'pcm_16000'`	Audio output format
`region`	`'eu' \| 'us'`	`'eu'`	API region (EU or US endpoint)
`jsonConfig`	`GradiumJsonConfig`	Optional	Advanced voice configuration
`retryDelay`	`number`	`1000`	Delay in milliseconds between reconnection attempts
`maxRetry`	`number`	`3`	Maximum number of reconnection attempts before failing

Output Formats

Format	Description
`pcm`	PCM 48kHz, 16-bit signed integer mono
`pcm_16000`	PCM 16kHz, 16-bit signed integer mono
`pcm_24000`	PCM 24kHz, 16-bit signed integer mono
`pcm_8000`	PCM 8kHz, 16-bit signed integer mono
`wav`	WAV format
`opus`	Opus codec wrapped in an Ogg container
`ulaw_8000`	u-law 8kHz
`alaw_8000`	A-law 8kHz

Advanced Voice Configuration

The jsonConfig option allows you to fine-tune voice characteristics:

const tts = new GradiumTTS({
  apiKey: 'your-api-key',
  voiceId: 'your-voice-id',
  jsonConfig: {
    temp: 0.7, // Temperature: 0 to 1.4 (default: 0.7)
    cfg_coef: 2.0, // Voice similarity: 1.0 to 4.0 (default: 2.0)
    padding_bonus: 0, // Speed control: -4.0 to 4.0
  },
})

Parameter	Type	Range	Default	Description
`temp`	`number`	0 - 1.4	0.7	Controls randomness in speech generation
`cfg_coef`	`number`	1.0 - 4.0	2.0	Controls similarity to the original voice
`padding_bonus`	`number`	-4.0 - 4.0	0	Controls speech speed

Getting Started

Sign up for a Gradium account and get your API key
Choose a voice ID from the Gradium voice library
Install the package and configure with your credentials

import { GradiumTTS } from '@micdrop/gradium'

const tts = new GradiumTTS({
  apiKey: 'your-gradium-api-key',
  voiceId: 'your-voice-id',
  region: 'eu', // Use 'us' for US endpoint
  outputFormat: 'pcm_16000',
})

// Use with MicdropServer
new MicdropServer(socket, {
  tts,
  // ... other options
})

Installation​

Gradium TTS (Text-to-Speech)​

Usage​

Options​

Output Formats​

Advanced Voice Configuration​

Getting Started​