ai – Musings

A very effective, new AI-driven Video Creator

September 19, 2023September 19, 2023Leave a Comment

I am constantly scanning the AI space for new developments and newcomers in the arena. One of the most practical and stable tool I recently discovered is Pictory. It does video creation with a simple text script input, it generates videos even from a URL from any source (like a blog or article online) and […]

STEM

Microsoft + Gutenberg colab: Thousands of free audiobooks

September 12, 2023Leave a Comment

Project Gutenberg and Microsoft recently worked together to create thousands of free and open audiobooks using a new neural text-to-speech technology and Project Gutenberg’s public domain collection of e-books. Microsoft development team: “Our system allows users to customize an audiobook’s speaking speed and style, emotional intonation, and can even match a desired voice using a […]

Analytics Coding STEM

Impressive Voice->Text AI tool

September 12, 2023Leave a Comment

I came across this newest entrant to voice to text AI generation tool called AudioNotes. It converts your ‘random’/unstructured voice notes and unstructured text notes into structured text summaries using AI. Essentially, it’s a note-taking app that analyzes the voice notes and can generate many variations depending on the need. I decided to give it […]

Break Education Life STEM

Custom Useful, Fun AI Agents

May 19, 2023March 17, 2024Leave a Comment

I have created some custom AI agents for specific, contextual queries that hallucinate less and give you focused answers: for real-life uses or for entertainment. You can start using these tuned and customized agents without distractions. The following two agents are based on OpenAI platform. They’re free to use so far, so no need to […]

STEM

Generative Voice AI – resemble.ai

May 18, 2023May 18, 2023Leave a Comment

I don’t like the ending tone in some of the words as I don’t use that tone; it seems a bit annoying; also I noticed (may or may not be apparent to you from this quick sample), but the AI model seems to have a more British tonality than American for certain words and expressions. […]

Coding STEM

Summarization & Detecting Topics by Deepgram Whisper AI

May 13, 2023May 13, 2023Leave a Comment

This is the last of 3-part series on Datagram’s Audio->Text Transcriber using their latest AI engine called Whisper. Be sure to read them in this order, if you haven’t already, to follow along best: To complete essentially all the features I care to implement, today I’m going to add the last 2 features in my […]

Coding STEM

AI Transcription with Diarization

May 12, 2023May 13, 2023Leave a Comment

This is the last of 3-part series on Datagram’s Audio->Text Transcriber using their latest AI engine called Whisper. Be sure to read them in this order, if you haven’t already, to follow along best: This is a continuation of the post about Deepgram’s AI technology used for transcribing real-time or pre-recorded audio in virtually any […]

Coding STEM

Powerful auto-transcription using AI (openAI’s Whisper)

May 11, 2023May 13, 2023Leave a Comment

Today, I’ll show you how to tap into the “world’s most powerful speech-to-text API” from our own applications. We’ll be using Deepgram, which is based on OpenAI’s Whisper AI SST technology. Deepgram claims to have trained their AI model with10,000+ years worth of audio data. (For more of my posts about Text to Speech and vice […]

Coding STEM

One of the newest TTS on the block

April 24, 2023April 24, 2023Leave a Comment

Today, I’ll share some information on one of the latest transformer-based TTS (text-to-speech) or text-to-audio that’s generating some buzz. Yes, AI. While there are several really great models out there from IBM, Microsoft, Google for example, this one’s a little different. Let me introduce Bark. By the way, to see older posts on TTS related […]

STEM

Write your own program to use ChatGPT

April 4, 2023April 4, 2023Leave a Comment

With OPENAI’s ChatGPT being open to the public, it is now easy and possible to harness the incredible power of LLM like ChatGPT using any language of your choice. The good folks at OpenAI has good API documentations and handout free API keys to anyone who is interested. The free account allows you to use […]