neo.one Model

neo.one is our fast, efficient TTS model, designed for quick processing and resource optimization across a wide range of languages and use cases.

Key Features

  • Multi-language support (10+ languages)
  • Ultra-fast processing
  • Low resource requirements
  • Optimized for batch processing
  • Ideal for large-scale TTS applications

Use Cases

  1. Content Localization: Quickly translate and voice-over content for multiple languages.
  2. News Broadcasting: Generate audio versions of news articles in real-time.
  3. IVR Systems: Power interactive voice response systems with fast, efficient TTS.
  4. Bulk Audio Generation: Create large volumes of audio content for various applications.
  5. E-learning Platforms: Generate audio content for online courses efficiently.

candy.two Model

candy.two is our flagship model focused on high-quality, natural-sounding speech synthesis, making it ideal for applications requiring premium voice output.

Key Features

  • Highest fidelity voice output
  • Transformer-based architecture
  • Advanced emotion and emphasis control
  • Customizable speaking styles
  • Supports multiple voices and languages

Use Cases

  1. Virtual Assistants: Create lifelike AI assistants with natural-sounding voices.
  2. Audiobook Production: Automate the creation of audiobooks with high-quality narration.
  3. Podcast Creation: Produce professional-sounding podcast episodes from written scripts.
  4. Gaming: Develop dynamic, responsive NPC dialogues in video games.
  5. Accessibility Tools: Improve web accessibility with high-quality text-to-speech for visually impaired users.

Model Comparison

Featureneo.one

candy.two

Processing SpeedUltra-fastStandard
Voice QualityGood qualityHighest fidelity
Resource UsageOptimized (lower)Standard
Language Support10+ languages8+ languages
Emotion ControlBasicAdvanced
Batch ProcessingOptimizedSupported
ArchitectureNon-autoregressiveTransformer-based
Use CaseEfficient, large-scale TTSHigh-quality, premium TTS

Choosing the Right Model

  • Choose neo.one

    when:

    • Processing speed is a top priority
    • You’re generating large volumes of audio content
    • You need to optimize resource usage
    • Batch processing is required
  • Choose candy.two

    when:

    • You need the highest quality voice output
    • Advanced emotion and emphasis control is required
    • You’re working on applications requiring premium voice quality
    • Transformer-based architecture benefits are desired

For guidance on selecting the best model for your specific use case, please contact our support team at [email protected].