Skip to content

nurlanjalil/AI-Audio-Assistant

Repository files navigation

AI Audio Assistant 🎙️

Convert your audio content into text with AI-powered transcription.

Features ✨

  • Audio file upload (MP3, WAV, M4A)
  • Live voice recording (up to 5 minutes)
  • Automatic speech-to-text transcription with AI-powered error correction
  • Audio summarization with AI
  • Simple and intuitive interface
  • Drag-and-drop support
  • Azerbaijani speech recognition

Try It Out 🚀

Visit: AI Audio Assistant

How to Use 📝

  1. Choose your preferred method:
    • Upload an audio file (up to 5 minutes)
    • Record your voice directly in the browser
  2. For file upload:
    • Drag and drop your file into the upload area
    • Or click "Select File" to select from your device
  3. For voice recording:
    • Click "⏺ Start Recording" to start recording
    • Click "⏹ Stop" when finished
  4. Click "Convert to Text" to process your audio
  5. View your transcript and summary

Supported Formats 📁

  • MP3
  • WAV
  • M4A

Technologies Used 🛠️

  • OpenAI Whisper for high-accuracy transcription
  • GPT-4o for advanced transcription correction and summarization
  • FastAPI backend
  • Vanilla JavaScript frontend

License 📄

This project is licensed under the GNU General Public License v3.0 (GPL-3.0).

You may freely use, modify, and distribute the code, but any derivative work must also be distributed under the same GPLv3 license.

About

An AI-powered tool for Azerbaijani speech-to-text conversion and automatic summarization. Upload audio files or record live for quick and accurate transcription.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors