Welcome to Anam

How is usage time calculated and billed?

Usage time starts when you call the stream() method and ends when that specific stream closes. The time is tracked in seconds and billed per minute.

What causes latency and how can I optimize it?

Latency can come from several sources:

Connection setup time (usually 4-5 seconds, but can be up to 10 seconds)
LLM processing time
TTS generation
Face generation
Network conditions

To optimize latency:

Consider using our turnkey solution instead of custom-LLM
Use the streaming API for custom LLM implementations

How do I handle multilingual conversations?

Current language support:

Speech recognition currently struggles outside of English and will often translate non-English speech to English
TTS supports multiple languages but voice quality may vary
System prompts can be set to specific languages
Language handling is primarily controlled via the system prompt
Auto-language detection is planned for future releases

Can I interrupt the persona while it’s speaking?

Yes, you can interrupt the persona in two ways:

When using streaming, user speech will automatically interrupt the current stream
Use the interruptPersona() method to programmatically stop the persona mid-speech. See the interrupt command for more information.

How do I integrate my own LLM?

See our custom LLM guide for more information.

What are the browser compatibility requirements?

The SDK should work well with any recent Chromium-based or Firefox browser. It requires:

Modern browser with WebRTC support
Microphone permissions for audio input
Autoplay capabilities for video/audio
WebAssembly support

Safari/iOS notes:

Requires explicit user interaction for audio playback
May have additional security policy requirements
WebKit engine has specific autoplay restrictions

How do I monitor current usage?

Usage tracking options:

Available in Anam Lab
API endpoint for usage stats coming soon
Session logs available in the Anam Lab

How do I handle connection issues?

Common issues and solutions:

For “403 Forbidden” errors, verify API key/session token
If video doesn’t appear, check element IDs match exactly
Connection timeouts may require retry logic
Session tokens expire and need refresh
Monitor CONNECTION_CLOSED events for network issues

Getting Started

Core Concepts

Tools

Knowledge

Examples

SDK Reference

Resources

Third-Party Integrations

Community

Frequently Asked Questions

How is usage time calculated and billed?

What causes latency and how can I optimize it?

How do I handle multilingual conversations?

Can I interrupt the persona while it’s speaking?

How do I integrate my own LLM?

What are the browser compatibility requirements?

How do I monitor current usage?

How do I handle connection issues?

Getting Started

Core Concepts

Tools

Knowledge

Examples

SDK Reference

Resources

Third-Party Integrations

Community

​How is usage time calculated and billed?

​What causes latency and how can I optimize it?

​How do I handle multilingual conversations?

​Can I interrupt the persona while it’s speaking?

​How do I integrate my own LLM?

​What are the browser compatibility requirements?

​How do I monitor current usage?

​How do I handle connection issues?

How is usage time calculated and billed?

What causes latency and how can I optimize it?

How do I handle multilingual conversations?

Can I interrupt the persona while it’s speaking?

How do I integrate my own LLM?

What are the browser compatibility requirements?

How do I monitor current usage?

How do I handle connection issues?