This project is an AI-based transcription tool for the Albanian language. It includes a web interface for labeling and validating speech data, and an API for processing audio.
- Automatic speech recognition for Albanian language.
- User-friendly interface to label and validate speech data.
- Dataset management tools.
api/: FastAPI backend service.web/: React frontend application.scripts/: Utility scripts for data processing and automation.notebooks/: Jupyter notebooks for model training and experiments.
- Docker and Docker Compose
-
Clone the repository:
git clone https://github.com/florijanqosja/Albanian-ASR.git cd Albanian-ASR -
Set up environment variables:
cp .env.example .env
-
Run the application:
docker-compose up --build -d
-
Access the services:
- Web Interface: http://localhost:3000
- API Documentation: http://localhost:8000/docs
Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.
This project is licensed under the MIT License - see the LICENSE file for details.
