Voice Actions

Use virtual speaker and microphone hardware with models such as speech-to-text and text-to-speech to speak, listen, and respond to audio in an active Workstation.

📄️ Speak

Play voice audio into the virtual microphone via a text-to-speech model. You can provide exact copy for the agent to speak, or instructions for an LLM to generate a response.

📄️ Question

Speak a question into the virtual microphone, listening for a user response, and then optionally responding back.