This project is designed to fetch the transcript of long-form online videos (including those over 60 hours). It utilizes audio extraction, chunking, and transcription techniques to handle lengthy videos efficiently.
- Download and process audio from video links.
- Split audio into manageable chunks for transcription.
- Transcribe audio chunks using advanced speech recognition libraries.
-
Clone the repository:
git clone https://github.com/Jitendra-GRL/transcript-tool.git -
Navigate to the project directory:
cd transcript-tool -
Install the required dependencies:
pip install -r requirements.txt
To fetch the transcript of a video, run the main script:
python src/main.py <Video-URL>
Replace <Video-URL> with the URL of the video you want to transcribe.
python src/main.py "https://example.com/video" --chunk-size 300 --model-size large-v3
Configuration settings, such as file paths, can be found in src/config/settings.py (if applicable).
This project is licensed under the MIT License - see the LICENSE file for details.