Category: Audio AI
-
OpenAI Whisper API: Hands-on Guide
This guide will walk you through on how to get started with making calls to OpenAI Whisper API. Whisper is an AI model from OpenAI that can be used to convert speech to text. Get your OpenAI API Key First, you need to have an OpenAI Account. If you don’t have one, here is a…
-
7 Best Open-Source Text-to-Speech Tools
In this guide, I will cover the best Open-Source Text-to-Speech or TTS tech that you can run yourself free of cost. This post will cover various TTS technologies at a high level. I will post individual guides for each of them in the next few days and link them here. Let’s dive in. Mozilla TTS…
-
A beginner’s guide to generative AI and its applications
Generative AI is a type of artificial intelligence that involves the generation of new data or content based on a set of inputs. This can include creating new images, videos, text, or other forms of data. Generative AI uses machine learning algorithms, such as deep learning, to analyze and learn from existing data in order…
-
Easily Transcribe Youtube Videos with OpenAI Whisper
In this tutorial, we will see how to easily transcribe Youtube Videos with OpenAI Whisper. If you are new to OpenAI Whisper, check out this getting started guide. Transcribe Youtube Videos with OpenAI Whisper Requirements A Mac, Linux, or Windows Computer Python 3.x or higher installed A little knowledge of command line and how to…
-
OpenAI Whisper: Best guide to getting started with speech to text AI
OpenAI Whisper is a new Automatic Speech Recognization AI system. Whisper is created by OpenAI, the company behind GPT-3, Codex, DALL-E, etc. OpenAI Whisper can do automatic speech recognization and convert speech to text at high quality as well as can do very efficient non-English speech to English text with translation at very efficiently. Whisper’s…