The Next Generation
of Speech Synthesis.
We're building a leaner TTS architecture that achieves competitive naturalness at a fraction of the training data and cost of industry leaders.
Hear It For Yourself
Our V0 model, trained on 45,000 hours — less than 1% of industry-standard datasets. Type anything and synthesize speech in real time.
Tontaube V0 is an early research model. Improved versions are coming to the app and API soon.
Early experiments showing significant efficiency gains over industry-standard approaches.
Build with Tontaube
A high-speed, cost-efficient voice generation API powered by our architecture. Clone any voice from a single audio file and generate long-form speech at 10× real-time speed.
- One-shot voice cloning
- 0.10× real-time factor — 1 min audio in ~6 s
- ~2% of ElevenLabs pricing
import tontaube
client = tontaube.Client(api_key="your_key")
# Clone any voice from a single audio file
voice_id = client.create_voice(
audio_file="sample.mp3"
)
# Generate speech — 0.10× real-time factor
audio = client.generate_speech(
text="The next generation of speech synthesis.",
voice_id=voice_id
) Tontaube for iOS & Android
Our audiobook and voice cloning app — already in the hands of thousands of listeners. Convert any document to audio, clone your voice, and stream from a public domain library.
- PDF, EPUB & document conversion
- Free voice cloning
- 30,000+ AI audiobooks
Interested in Investing?
We've proven the architecture at prototype scale and are seeking investment to scale compute, data, and the team.