Skip to content

wooii/interpreter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Interpreter: Real-Time Meeting Assistant

A real-time audio recording, transcription, and interpretation tool for meetings and live conversations. This project is designed for scenarios where speed and accuracy are critical, such as live interpretation or meeting environments.

Key Features

  • Real-time audio recording and transcription with continuous listening
  • Speaker identification: print out the words received and indicate who is speaking
  • Option to use either advanced speech recognition APIs (OpenAI Whisper or Google Speech Recognition) or local models for transcription
  • Highlight keywords and uncommon words in the transcript
  • Translate highlighted/uncommon words for better understanding
  • Designed for live interpretation, meeting assistance, and second language conversations

Requirements

  • Python 3.8+
  • openai-whisper
  • Other dependencies listed in pyproject.toml

Installation

  1. Open Terminal and navigate to the directory containing pyproject.toml:
    cd /Users/chen/Library/CloudStorage/Dropbox/Code/py/ai/interpreter
  2. Install in development mode:
    pip install -e .

Usage

  • To run the real-time interpreter:
    python interpreter/main.py
  • To compare different speech recognition models:
    python interpreter/model_comparison.py

Roadmap

  • Implement real-time, continuous audio recording and streaming
  • Integrate Whisper API/local model, and evaluate other advanced models
  • Add speaker identification and display who is talking
  • Highlight and translate keywords/uncommon words in real time
  • Build user interface (optional)
  • Add Whisper model finetuning (future)

License

MIT

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages