Changelog Update

Speech-to-Text for Audio & Video Files

🎯 What is it?

A new speech-to-text tool is now available in Dust, allowing your agents to transcribe spoken content from both audio and video sources. You can transcribe content from supported URLs or by uploading files directly, with support for most common formats (mp3, mp4, wav, m4a, mov, webm, aac, flac, and more).

💡 Why is it useful?

Audio and video content is everywhere—meeting recordings, interviews, webinars, voice notes—but the information locked inside it is hard to search, reuse, or act on. Transcription has been a recurring request, and with Dust's expanded processing capabilities, your agents can now turn spoken content into text that can be summarized, analyzed, and integrated into your workflows.

⚙️ How does it work?

Your agent can transcribe content in two ways: from a URL (within an approved list of domains) or from a file you upload directly. Once transcribed, the text becomes available for the agent to work with—summarizing, extracting key points, or feeding into other tasks.

✨ Concrete Use Cases

Here's how you could use it:

Meeting follow-ups: Upload a recording of a call and ask your agent to produce a clean summary with action items and decisions.
Content repurposing: Transcribe a webinar or podcast episode, then have your agent draft a blog post, social snippets, or an internal recap.
Interview analysis: Turn recorded user interviews into searchable text and extract recurring themes or quotes.

📈 Benefits for you

Significant time savings on manual transcription, easier reuse of audio/video content, and the ability to make spoken information searchable and actionable—all within your existing Dust workflows.

🚀 How to access it?

The feature is available to everyone now—no activation needed. Simply provide a supported URL or upload an audio/video file in your conversation, and ask your agent to transcribe it.