I am constantly scanning the AI space for new developments and newcomers in the arena. One of the most practical and stable tool I recently discovered is Pictory. It does video creation with a simple text script input, it generates videos even from a URL from any source (like a blog or article online) and […]
Tag: ai
Microsoft + Gutenberg colab: Thousands of free audiobooks
Project Gutenberg and Microsoft recently worked together to create thousands of free and open audiobooks using a new neural text-to-speech technology and Project Gutenberg’s public domain collection of e-books. Microsoft development team: “Our system allows users to customize an audiobook’s speaking speed and style, emotional intonation, and can even match a desired voice using a […]
Impressive Voice->Text AI tool
I came across this newest entrant to voice to text AI generation tool called AudioNotes. It converts your ‘random’/unstructured voice notes and unstructured text notes into structured text summaries using AI. Essentially, it’s a note-taking app that analyzes the voice notes and can generate many variations depending on the need. I decided to give it […]
Custom Useful, Fun AI Agents
I have created some custom AI agents for specific, contextual queries that hallucinate less and give you focused answers: for real-life uses or for entertainment. You can start using these tuned and customized agents without distractions. The following two agents are based on OpenAI platform. They’re free to use so far, so no need to […]
Generative Voice AI – resemble.ai
I don’t like the ending tone in some of the words as I don’t use that tone; it seems a bit annoying; also I noticed (may or may not be apparent to you from this quick sample), but the AI model seems to have a more British tonality than American for certain words and expressions. […]
Summarization & Detecting Topics by Deepgram Whisper AI
This is the last of 3-part series on Datagram’s Audio->Text Transcriber using their latest AI engine called Whisper. Be sure to read them in this order, if you haven’t already, to follow along best: To complete essentially all the features I care to implement, today I’m going to add the last 2 features in my […]
AI Transcription with Diarization
This is the last of 3-part series on Datagram’s Audio->Text Transcriber using their latest AI engine called Whisper. Be sure to read them in this order, if you haven’t already, to follow along best: This is a continuation of the post about Deepgram’s AI technology used for transcribing real-time or pre-recorded audio in virtually any […]
Powerful auto-transcription using AI (openAI’s Whisper)
Today, I’ll show you how to tap into the “world’s most powerful speech-to-text API” from our own applications. We’ll be using Deepgram, which is based on OpenAI’s Whisper AI SST technology. Deepgram claims to have trained their AI model with10,000+ years worth of audio data. (For more of my posts about Text to Speech and vice […]
One of the newest TTS on the block
Today, I’ll share some information on one of the latest transformer-based TTS (text-to-speech) or text-to-audio that’s generating some buzz. Yes, AI. While there are several really great models out there from IBM, Microsoft, Google for example, this one’s a little different. Let me introduce Bark. By the way, to see older posts on TTS related […]
Write your own program to use ChatGPT
With OPENAI’s ChatGPT being open to the public, it is now easy and possible to harness the incredible power of LLM like ChatGPT using any language of your choice. The good folks at OpenAI has good API documentations and handout free API keys to anyone who is interested. The free account allows you to use […]