Voice Actions
Use virtual speaker and microphone hardware with models such as speech-to-text and text-to-speech to speak, listen, and respond to audio in an active Workstation.
📄️ Speak
Play voice audio into the virtual microphone via a text-to-speech model. You can provide exact copy for the agent to speak, or instructions for an LLM to generate a response.
📄️ Question
Speak a question into the virtual microphone, listening for a user response, and then optionally responding back.