Skip to main content

General Questions

CARTER is an experimental voice AI built with Cartesia’s Sonic model and deployed on Solana. It demonstrates real-world voice AI integration with genuine emotional expression and an unfiltered, creative personality.
No. CARTER is intentionally experimental and unstable. It’s built as a demonstration of Cartesia’s capabilities and to inspire developers, not for production use as-is.
CARTER itself isn’t meant for direct integration, but you can use the same technology (Cartesia’s Sonic model) to build your own voice AI. Check our Developer Guide.
To showcase the creative potential of AI when guardrails are removed. It demonstrates that voice AI can have genuine personality beyond standard chatbot responses.

Technical Questions

CARTER uses Cartesia’s Sonic model, which provides:
  • Sub-6ms latency
  • Real emotional expression
  • Multiple voice options
  • Streaming audio generation
  • Natural-sounding speech
Solana provides:
  • Ultra-fast transaction processing
  • Scalability for high-volume requests
  • Low transaction costs
  • Decentralized infrastructure
Perfect for real-time voice applications.
Through a combination of:
  • Cartesia’s optimized Sonic model (6ms average)
  • WebSocket connections for streaming
  • Solana’s fast blockchain
  • Efficient audio encoding (PCM)
The integration patterns and architecture are documented in our Developer Section. For specific implementation details, contact the team.

Usage Questions

Rate limits are intentional to demonstrate real-world API constraints. They show how to handle limits in production applications.If you encounter limits, try:
  1. Click the “VOICE INPUT” button
  2. Allow microphone access when prompted
  3. Speak clearly into your microphone
  4. Wait for CARTER’s voice response
Note: Voice input may trigger rate limits during high usage.
Common reasons:
  • Rate limits exceeded (wait and retry)
  • Server overload (try later)
  • Network connectivity issues
  • Browser compatibility (use Chrome/Firefox/Safari/Edge)
Alternative: Try https://play.cartesia.ai/text-to-speech
Currently, CARTER maintains context within a single session but doesn’t persist conversations across sessions. This is intentional for the demo.

Developer Questions

Follow these steps:
  1. Sign up for Cartesia API access
  2. Review our Integration Guide
  3. Study the Code Examples
  4. Check the Voice API Reference
  5. Start with simple TTS, then add streaming
Cartesia provides SDKs for:
  • JavaScript/TypeScript (@cartesia/cartesia-js)
  • Python (cartesia)
You can also use the REST API directly from any language.
Cartesia offers multiple pricing tiers:
  • Free: Limited requests for testing
  • Pro: Higher limits for production
  • Enterprise: Custom limits and support
Visit cartesia.ai for current pricing.
CARTER uses a pre-built Cartesia voice. You can clone your own voices using Cartesia’s voice cloning feature. See our Voice API Reference for details.
Server-Sent Events (SSE):
  • One-way server→client streaming
  • Simpler to implement
  • Good for web applications
WebSocket:
  • Two-way communication
  • Lower latency
  • Better for real-time chat
Choose based on your needs. See Integration Guide.

Troubleshooting

Solution:
  1. Check browser settings (Settings → Privacy → Microphone)
  2. Ensure HTTPS connection (required for mic access)
  3. Try a different browser
  4. Check system microphone permissions
Solution:
  1. Check system volume
  2. Verify browser audio isn’t muted
  3. Try headphones/speakers
  4. Check audio output device in system settings
Possible causes:
  • Network connectivity
  • Server load
  • Using MP3 instead of PCM format
  • Not using WebSocket for real-time
Solution: Use WebSocket with PCM format for lowest latency.
Common errors:
  • 401 Unauthorized: Invalid API key
  • 429 Rate Limit: Too many requests
  • 500 Server Error: Temporary issue, retry
Implement exponential backoff and retry logic.

Project Questions

CARTER was built as a demonstration of Cartesia’s Sonic model capabilities, showcasing real-world integration patterns for voice AI.
The architecture and integration patterns are documented and shared. Check the Developer Section for implementation details.
Currently, CARTER is a reference implementation. To contribute to the ecosystem:
  • Build your own voice AI projects with Cartesia
  • Share your implementations
  • Provide feedback on the documentation
CARTER serves as a snapshot of current Cartesia capabilities. Updates may occur to showcase new features or improvements.

Getting Help


Still have questions? Reach out on X (Twitter) or check the Cartesia documentation.