Search Results

Found 399 repositories(showing 30)

gTTS

pndurette

💛71

Python library and CLI tool to interface with Google Translate's text-to-speech API

2.6k

383

MIT

Python

Updated 12 hours ago

cligttspypi+6

vid2cleantxt

pszemraj

🧡61

Python API & command-line tool to easily transcribe speech-based video files into clean text

221

Apache-2.0

Jupyter Notebook

Updated 3 weeks ago

audioaudio-processingkeyword+17

gemini-2-podcast

agituts

🧡55

A Python-based tool that generates engaging podcast conversations using Google's Gemini 2.0 Flash Experimental model for script generation and text-to-speech conversion.

115

Python

Updated 2 weeks ago

gemini-2gemini-2-0-flash-exppodcast

PyBot-A-ChatBot-For-Answering-Python-Queries-Using-NLP

abhishek305

🧡65

Pybot can change the way learners try to learn python programming language in a more interactive way. This chatbot will try to solve or provide answer to almost every python related issues or queries that the user is asking for. We are implementing NLP for improving the efficiency of the chatbot. We will include voice feature for more interactivity to the user. By utilizing NLP, developers can organize and structure knowledge to perform tasks such as automatic summarization, translation, named entity recognition, relationship extraction, sentiment analysis, speech recognition, and topic segmentation. NLTK has been called “a wonderful tool for teaching and working in, computational linguistics using Python,” and “an amazing library to play with natural language.The main issue with text data is that it is all in text format (strings). However, the Machine learning algorithms need some sort of numerical feature vector in order to perform the task. So before we start with any NLP project we need to pre-process it to make it ideal for working. Converting the entire text into uppercase or lowercase, so that the algorithm does not treat the same words in different cases as different Tokenization is just the term used to describe the process of converting the normal text strings into a list of tokens i.e words that we actually want. Sentence tokenizer can be used to find the list of sentences and Word tokenizer can be used to find the list of words in strings.Removing Noise i.e everything that isn’t in a standard number or letter.Removing Stop words. Sometimes, some extremely common words which would appear to be of little value in helping select documents matching a user need are excluded from the vocabulary entirely. These words are called stop words.Stemming is the process of reducing inflected (or sometimes derived) words to their stem, base or root form — generally a written word form. Example if we were to stem the following words: “Stems”, “Stemming”, “Stemmed”, “and Stemtization”, the result would be a single word “stem”. A slight variant of stemming is lemmatization. The major difference between these is, that, stemming can often create non-existent words, whereas lemmas are actual words. So, your root stem, meaning the word you end up with, is not something you can just look up in a dictionary, but you can look up a lemma. Examples of Lemmatization are that “run” is a base form for words like “running” or “ran” or that the word “better” and “good” are in the same lemma so they are considered the same.

Python

Updated 1 day ago

nlpnltk-librarynumpy+4

gemini-2-tts

agituts

❤️45

AI-Powered Podcast Generator: A Python-based tool that converts text scripts into realistic audio podcasts using Google's Generative AI API. This project leverages advanced text-to-speech technology to create dynamic, multi-speaker conversations with customizable voices.

Python

Updated 1 month ago

gemini-2-0-flash-expgenerative-language-apigoogle-ai+1

VoiceTyper-Pro

perrypixel

❤️40

VoiceTyper-Pro is an advanced speech-to-text dictation tool built with Python and powered by the Deepgram API. Alternative to Mac Whisper, Voice Access, and other voice typing tools.

MIT

Python

Updated 3 months ago

deepgrammacvoice-recognition+2

text-to-speech

Atamyrat2005

❤️45

There are several APIs available to convert text to speech in Python. One of such APIs is the Google Text to Speech API commonly known as the gTTS API. gTTS is a very easy to use tool which converts the text entered, into audio which can be saved as a mp3 file.

Python

Updated 2 months ago

gttsgtts-apipython-3+2

Text-to-Voice__Voice-to-Text_2nd-Semester

arunsanjeevms

❤️45

🔊🗣️ Python enthusiast | Built text-to-voice & voice-to-text tool! Seamlessly convert text to speech & speech to text. Enhance accessibility & communication! 📈

Python

Updated 2 months ago

InterviewWarmupLocal

amar-chheda

🧡50

AI-powered local interview prep tool. Practice answering custom questions with speech recognition and get AI feedback based on your resume and job description. Features: local execution, speech-to-text, custom questions, AI analysis. Built with Python, Streamlit, Ollama, and LlamaIndex.

Apache-2.0

Python

Updated 2 months ago

Albanian-ASR

florijanqosja

❤️35

This project is an AI-based transcription tool for the Albanian language. The tool is designed to automatically transcribe Albanian speech to text using Python.

MIT

Jupyter Notebook

Updated 3 months ago

aialbanianalbanian-language+8

Social-Media-Video-Transcriber

TheSethRose

❤️45

A powerful Python tool for downloading and transcribing videos from multiple platforms (TikTok, YouTube, Facebook, Instagram) using Parakeet-MLX for high-quality speech-to-text conversion.

Python

Updated 2 months ago

-IMAGE-TO-SPEECH-CONVERTOR-

LeadingIndiaAI

❤️35

The aim of the project was to convert an image to speech. An image is processed and segmented to identify the text in the image. Then the characters are combined to form words and save it as a text file. This text file is converted to speech. We use two tools for the completion of image to text to speech conversion. They are OCR (Optical Character Recognition) and TTS (Text to Speech) engines. Using OCR, we can optically recognize the characters in an image. TTS is used to convert the text file to speech. The audio output can be heard by using a python library Pygame for playing the audio at runtime

Python

Updated 10 months ago

AI-Automated-Short-Video-Generator-Editor-Uploader-For-Views

YoussefBechara

🧡65

Python automation tool that creates, edits, and uploads TikTok videos using AI. Generates scripts, text-to-speech, captions, and visual content. Fully automated pipeline from idea to published video.

Python

Updated 2 days ago

aiai-automationfaceless+10

analysis-pipelines

CRANE-toolbox

❤️45

Project CRANE (Crisis Racism and Narrative Evaluation) aims to support researchers and anti-racist organisations that wish to use state-of-the-art text analysis algorithms to study how specific events impact online hate speech and racist narratives. CRANE Toolbox is a Python package: once installed, the tools in CRANE are available as functions that users can use in their Python programs or directly through their terminal. CRANE targets users with basic programming but no machine learning skills.

AGPL-3.0

Python

Updated 1 week ago

covid19data-sciencedata-visualization+10

SpeedReader

DeadlyApps

❤️40

Python based tool to use text to speech to read books or study material quickly.

MIT

Python

Updated 1 year ago

utter

rsgalloway

❤️40

Text to speech python library and command line tool, with optional language translation

NOASSERTION

Python

Updated 5 years ago

edge-tts-go

difyz9

🧡50

`edge-tts-go` is a Go library and command-line tool that allows you to use Microsoft Edge's online text-to-speech service without needing Windows or the Edge browser. This is a Go port of the Python [edge-tts](https://github.com/rany2/edge-tts) package.

LGPL-3.0

Updated 1 week ago

PDFToAudio

ExceptedPrism3

❤️40

"PDF To Audio" is a Python tool that transforms PDF documents into audio files using OCR and Text-to-Speech technology. Ideal for accessibility and auditory learning, it supports multiple languages, parallel processing, and smart rate limit handling.

MIT

Python

Updated 5 months ago

pdfpdf-converterpdf-to-audio+5

KaldiLongAligner

zelaki

❤️35

Speech to Text Alignment tool implemented with Python and Kaldi

Python

Updated 1 year ago

forced-alignment

CrispTTS

CrispStrobe

❤️45

(wip) python command-line Text-to-Speech (TTS) tool esp. for German, leveraging numerous endpoints like orpheus, piper, outetts, kokoro, csm, edge, coqui, kartoffelbox, etc

EUPL-1.2

Python

Updated 2 months ago

chatterboxchatterbox-ttscoqui-tts+9

TextToSpeechPython

thomasthaddeus

❤️30

This repository contains a Python-based text-to-speech (TTS) program that converts written text into spoken words. Leveraging state-of-the-art TTS libraries, this program aims to provide clear and natural-sounding audio output for a variety of applications, from audiobook generation to accessibility tools.

MIT

Python

Updated 1 year ago

pythonpython3ssml+3

Tabib-HealthCare-Chatbot

aakif123

❤️35

Purpose : Major Project Team Size : 4 Duration : 10 Months [ Oct. 1, 2021 - June 31, 2022 ] Key Skills : Rasa AI , Python , NLP , Flask , HTML , CSS , JavaScript It is a Web-based Chatbot to automate healthcare management with audio assistance. Users can get immediate medication for their symptoms and book appointments via an audio feature. In addition to text assistance, this chatbot has an audio assistance feature. This feature eliminates the restrictions that visually impaired patients face with currently available text-enabled healthcare chatbots. This voice-enabled chatbot was designed and developed using the Rasa interface for the backend, the Web Speech API, and the Talkify API for voice input and output, respectively. Title: ' TaBiB: Chatbot for Healthcare Automation with Audio Assistance using Artificial Intelligence '. The project was presented and was approved at the 6th National Conference of Science and Engineering (NCSEM), 2022. Tools used : Rasa AI, Python, NLP, Flask, Web Speech API, Talkify API, HTML, CSS in Atom Editor.

Python

Updated 2 months ago

DDoS

iDarrylPiul

❤️40

# Anonymous DDoS Tool This DDoS Tool has been written by Muneeb Khurram, and this Script could only be used for Educational Purposes see License. Now added a GUI with only Two Dependancies Pyfiglet and GoLang. ## Installation #### - Dependancies a) pyttsx3 (Text-to-Speech) Just to make it cooler. Not Neccesary for GUI. ``` pip3 install pyttsx3 ``` b) pyfiglet (Cause its Neccesary to be Cooler) Install Figlet in Kali Linux as some results show that pyfiglet doesnt show anything. ``` sudo apt-get install figlet ``` ``` pip3 install pyfiglet ``` c) colorama (Another Step towards CLI Beauty) ``` pip3 install colorama ```` d) os (Already in Python3) ``` pip3 install os ``` e) socket (For an Upcomming Release) ``` pip3 install socket ``` f) wheel (To make wheel of colorama) ``` pip3 install wheel ``` e) GoLang (Download for your OS form Golang.org/dl/) ### MacOS > https://medium.com/golang-learn/quick-go-setup-guide-on-mac-os-x-956b327222b8 ### Windows > https://www.geeksforgeeks.org/how-to-install-go-on-windows/ ### Linux > https://tecadmin.net/install-go-on-ubuntu/ ``` sudo apt-get install golang ``` or ``` sudo apt-get install golang-go ``` ### Kali Linux Kali has GoLang Pre-Installed. Check by typing; ``` go ``` If not Follow, the above shown for Linux/Ubuntu ## Other Use Install_Dependancies.py to Install all of these except GoLang (it has to be downloaded Manually) ```` python3 Install_Dependancies.py ```` ## Usage ```` python3 Python-Script.py ```` ## GUI Requirements > Install Pyfiglet and GoLang as Above and you are ready to go. Use this if you cannot satisfy one of the above dependancies. Highly Easy to use. Recommended for Beginners using Windows. If you use this on a Linux Distro and get tkinter not found install tkinter. ``` python3 DDoS_GUI.py ``` ## Tested OS/ENV - Linux Stable Release 2020 - Kali Linux 2019.3 - Google Colab - Windows 10 - MacOS X 10.10.5 Onwards (OS X Yosmite) > This should run on all enviorments. Even on Oldest OS's provided the above dependancies are completed ## Images Showing some Interfaces and their Interactive Enviorments. ![AnonymousDDoS_GUI](https://munublogs.files.wordpress.com/2020/05/screen-shot-2020-05-13-at-8.11.04-pm.png) ![Terminal_Photos](https://munublogs.files.wordpress.com/2020/05/screen-shot-2020-05-13-at-8.09.41-pm.png) ![Terminal_Photos2](https://munublogs.files.wordpress.com/2020/05/screen-shot-2020-05-13-at-8.10.30-pm.png) ## License All Copyrights Reserved to Muneeb Khurram, HULK-DoS Tool’s Copyrights to their Authors as well. For all the Script Kiddes out there, Welcome to DDoS Heaven’s. See License before using

NOASSERTION

Python

Updated 8 months ago

transcript-ai

ombharatiya

💛70

AI-powered audio transcription tool using OpenAI Whisper. Convert speech to text with support for 8+ formats, batch processing, and multilingual transcription. Python CLI tool for developers.

MIT

HTML

Updated 4 days ago

aiai-voice-generatoraudio-processing+6

VideoNews

akumanomi1988

❤️45

This repository features a Python script that automates the creation of videos from news articles. It scrapes and processes news content, converts text into speech, and assembles the final video using editing tools. The project aims to simplify and accelerate the production of video content from written news

Python

Updated 1 month ago

realtime-voice-and-vision-assistant

varadganjoo

❤️25

A Python-based voice assistant for Windows that performs system control, web searches, weather updates, and more. Integrates Text-to-Speech, real-time transcription, and various tools for seamless voice interaction. Includes a simple GUI for microphone control and supports OpenWeather, DuckDuckGo, and Groq Vision AI for enhanced functionality.

GPL-2.0

Python

Updated 1 year ago

gptspeak

gptscript-ai

❤️25

Text-to-speech CLI tool and Python library using OpenAI's TTS API

Apache-2.0

Python

Updated 6 months ago

video2text

stellarloop

❤️40

Python API & command-line tool to easily transcribe speech-based video files into clean text

Apache-2.0

Jupyter Notebook

Updated 6 months ago

audioboundary-detectionnlp+4

postgrad-ai-nlp2-voice-ui

tjas

❤️40

A Voice User Interface tool for Text-to-Speech and Speech-to-Text, built with Python and Django Framework, to solve the proposed exercise in "Cognitive Computing 2: Voice User Interface" discipline.

MIT

JavaScript

Updated 4 months ago

aiartificial-intelligencedjango+13

real-time-translation-tool

Sai010607

🧡50

🎙️ Real-Time Translator – A real-time speech-to-speech translation tool using Python, Whisper AI, gTTS, and Streamlit. Supports audio, video, and text translation across multiple languages with live playback and subtitle support.

MIT

Python

Updated 2 months ago

GitHub Explorer

Search Results

gTTS

vid2cleantxt

gemini-2-podcast

PyBot-A-ChatBot-For-Answering-Python-Queries-Using-NLP

gemini-2-tts

VoiceTyper-Pro

text-to-speech

Text-to-Voice__Voice-to-Text_2nd-Semester

InterviewWarmupLocal

Albanian-ASR

Social-Media-Video-Transcriber

-IMAGE-TO-SPEECH-CONVERTOR-

AI-Automated-Short-Video-Generator-Editor-Uploader-For-Views

analysis-pipelines

SpeedReader

utter

edge-tts-go

PDFToAudio

KaldiLongAligner

CrispTTS

TextToSpeechPython

Tabib-HealthCare-Chatbot

DDoS

transcript-ai

VideoNews

realtime-voice-and-vision-assistant

gptspeak

video2text

postgrad-ai-nlp2-voice-ui

real-time-translation-tool

gTTS

vid2cleantxt

gemini-2-podcast

PyBot-A-ChatBot-For-Answering-Python-Queries-Using-NLP

gemini-2-tts

VoiceTyper-Pro

text-to-speech

Text-to-Voice__Voice-to-Text_2nd-Semester

InterviewWarmupLocal

Albanian-ASR

Social-Media-Video-Transcriber

-IMAGE-TO-SPEECH-CONVERTOR-

AI-Automated-Short-Video-Generator-Editor-Uploader-For-Views

analysis-pipelines

SpeedReader

utter

edge-tts-go

PDFToAudio

KaldiLongAligner

CrispTTS

TextToSpeechPython

Tabib-HealthCare-Chatbot

DDoS

transcript-ai

VideoNews

realtime-voice-and-vision-assistant

gptspeak

video2text

postgrad-ai-nlp2-voice-ui

real-time-translation-tool