Documentations

⌘K
  1. Home
  2. Docs
  3. Documentations
  4. AI Harness for Shopify
  5. Audio Converter

Audio Converter

This feature allows you to easily convert audio files into text format.

It uses OpenAI’s state-of-the-art open source Whisper model to convert audio files into text format.

The cost of using OpenAI’s Whisper API is $0.006 per minute of audio.

Based on this pricing, a 10-minute audio would be approximately $0.06.

Using the Audio Converter

In order to use this feature, you will need to follow a few simple steps:

  • Go to Audio Converter page from the plugin menu.
  • There are two options that you can use: Transcription and Translation.
    • Transcription: This option allows you to convert audio files into text format. It currently supports 38 languages. Supported languages are Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Welsh.
    • Translation: This option allows you to convert audio files into text format and translate the text into English. This means that you can convert audio files from any language into English only.
  • You can use to upload your audio file. Supported file types include mp3, mp4, mpeg, mpga, m4a, wav, and webm. The file size limit is 25 MB.
    • Upload File: This option allows you to upload your audio file from your computer. Simply click on the “Choose File” button and select the file that you want to upload.
  • Click on the Start button.
  • Wait for the file to be converted.

There are some additional options that you can use to customize the output:

  • Model: This option allows you to select the model that you want to use for the conversion. Currently the only available model is “whisper-1”.
  • Prompt: An optional text to guide the model’s style or continue a previous audio segment. The prompt should match the audio language.
  • Out Put Format: This option allows you to select the output format that you want to use for the conversion. Available options are post, page, text, JSON. If you select post or page then some additional options will be available such as title, category, author and post status.

How can we help?