Skip to content

chore(providers): AssemblyAI docs #471

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion fern/docs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -440,7 +440,7 @@ navigation:
path: providers/transcriber/gladia.mdx
- page: Talkscriber
path: providers/transcriber/talkscriber.mdx
- page: Assembly AI
- page: AssemblyAI
path: providers/transcriber/assembly-ai.mdx

- section: Cloud storage
Expand Down
57 changes: 21 additions & 36 deletions fern/providers/transcriber/assembly-ai.mdx
Original file line number Diff line number Diff line change
@@ -1,54 +1,39 @@
---
title: AssemblyAI
subtitle: What is AssemblyAI?
slug: providers/transcriber/assembly-ai
---

## Universal-Streaming

**What is AssemblyAI?**
Universal-Streaming is AssemblyAI's purpose-built speech-to-text model that delivers ultra-fast, immutable transcripts in ~300ms with intelligent endpointing and superior accuracy for voice agents. It eliminates common pain points like misheard account numbers, awkward pauses, and premature cutoffs, enabling more natural and successful voice interactions.

AssemblyAI is a leading provider of AI-driven speech recognition and understanding technologies. Their advanced models enable accurate transcription and analysis of audio data, facilitating applications across various industries.

**The Evolution of AI Transcription:**

Speech recognition has evolved from basic systems to sophisticated AI models capable of understanding diverse languages and accents. AssemblyAI has been at the forefront of this evolution, developing models like Universal-2, trained on over 12.5 million hours of audio data, achieving best-in-class transcription accuracy across several industry-critical languages.
## How to use AssemblyAI as transcriber

**Overview of AssemblyAI's Offerings:**
This guide details how to setup AssemblyAI as a transcriber for your assistant.

AssemblyAI offers a comprehensive suite of AI-driven tools designed to meet diverse needs:
<Steps>
**Head to the "Assistants" tab in your Vapi dashboard.**

**Speech-To-Text**
<Frame>
<img src="../../static/images/providers/assemblyai/AssemblyAI-Step1.png" />
</Frame>

- Their core offering converts spoken language into written text with up to 95% accuracy and 30% less hallucinations than other leaders in the space.
**Click on your assistant and then the "Transcriber" tab.**

**Audio Intelligence**
<Frame>
<img src="../../static/images/providers/assemblyai/AssemblyAI-Step2.png" />
</Frame>


- Beyond transcription, AssemblyAI's fully featured speech understanding models can analyze audio to detect sentiment, identify topics, and perform speaker diarization, transforming words into meaningful ideas, insights, and opportunities.
**Select "assembly-ai" on the Provider dropdown.**

**Real-time Transcription**
<Frame>
<img src="../../static/images/providers/assemblyai/AssemblyAI-Step3.png" />
</Frame>
</Steps>

- AssemblyAI's real-time transcription feature enables sub-second latency conversion of speech to text, beneficial for live captioning, customer support, and interactive voice response systems, enhancing user experience and operational efficiency.
## Supported Languages

**Use Cases for AssemblyAI**

AssemblyAI's versatile technology serves multiple industries, enhancing operations and delivering valuable insights:

**Contact Centers**

- For contact centers, AssemblyAI provides real-time transcription and audio analysis to improve customer interactions. By transcribing calls and analyzing sentiment, businesses can identify trends, monitor agent performance, and enhance customer satisfaction.

**Media And Content Creation**

- In the media sector, AssemblyAI’s speech-to-text solutions are used to transcribe interviews, podcasts, and video content. This makes content searchable, accessible, and easier to manage, enhancing the efficiency of media production workflows.

**Innovation and Research:**

- AssemblyAI is committed to continuous innovation and research in the field of speech recognition and AI. Their team of experts is dedicated to enhancing the capabilities of their technology, exploring new applications, and pushing the boundaries of what speech AI can achieve.

**AI Safety and Ethics:**

- Ensuring the ethical use of AI is a core principle at AssemblyAI. They implement robust safeguards to prevent misuse of their technology and are actively engaged in promoting responsible AI development. Protecting user data and maintaining transparency in AI operations are central to their mission.

**Integrations and Compatibility**

- AssemblyAI offers a developer-friendly environment with RESTful API access, WebSocket support for real-time applications, SDKs for popular programming languages, detailed documentation and examples, ensuring seamless integration of speech recognition capabilities into existing systems.
Universal-Streaming currently supports English only.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.