Skip to content
GitHub

General 4xx customer issues

General 4xx customer issues

Introduction

When a customer reaches out with a 4xx error, it means that something is wrong with their API request. This playbook will touch on how to troubleshoot common 4xx errors. It is also worth noting that when a transcription completes with an error, we will return an error key in the JSON response for the GET request. We should educate customers on the error key in case they are not aware of it (see here for our error handling cookbook).

Procedures

Finding the Error Message in Datadog

If a customer is asking why their transcription failed, ask for the transcription id.

  1. Open Datadog and navigate to “Logs” on the sidebar.

  2. Enter @transcript_id:<transcript-id> in the search bar.

  3. Once the logs have finished loading, you can ctrl/cmd + F “Client failed transcript” to find the specific error message that was received.

Common 4xx Messages

Download Error: Forbidden URL

A forbidden URL error occurs when AssemblyAI cannot access the provided URL. A common reason for this error is that the permissions settings of an S3 bucket for the provided URL are blocking us from accessing the audio file.

To resolve this issue, the developer can use pre-signed URLs to allow AssemblyAI temporary access to the files for processing.

Download Error: 404 Client Error: Not Found for url

This error occurs when a URL used in the audio_url is not found. To resolve this, ask the customer to check for any typos and make sure that the link does not point to a webpage that does not exist, has been moved, or has a dead or broken link.

File does not contain audio, appears to be text/html

The URL must point to a downloadable file. Customers can use an S3 bucket, cloud storage or our upload endpoint to generate a valid URL.

If you use a Youtube video as the URL in the audio_url parameter, you will get the above error since Youtube links do point to a downloadable video. You may get customers asking if we accept specific links like Youtube, Github etc. Here are some solutions when customers inquire about these types of links:

How to Get YouTube Video Transcripts with yt-dlp

How to Get Transcripts Using Google Drive Links

Transcribing Files Hosted on GitHub with AssemblyAI

Transcribing from an S3 Bucket

Resources

Tool for reviewing error logs: Datadog

AWS Docs for pre-signed URLs

Cookbooks

How to Get YouTube Video Transcripts with yt-dlp

How to Get Transcripts Using Google Drive Links

Transcribing Files Hosted on GitHub with AssemblyAI

Transcribing from an S3 Bucket

Guide to Common Errors for New AssemblyAI Users

Appendices

Transcript ID: the UUID the developer used to process the audio file.