Skip to content

florijanqosja/Albanian-ASR

Repository files navigation

Albanian Transcriber using Machine Learning | DibraSpeaks

Logo

This project is an AI-based transcription tool for the Albanian language. It includes a web interface for labeling and validating speech data, and an API for processing audio.

Features

  • Automatic speech recognition for Albanian language.
  • User-friendly interface to label and validate speech data.
  • Dataset management tools.

Project Structure

  • api/: FastAPI backend service.
  • web/: React frontend application.
  • scripts/: Utility scripts for data processing and automation.
  • notebooks/: Jupyter notebooks for model training and experiments.

Getting Started

Prerequisites

  • Docker and Docker Compose

Installation

  1. Clone the repository:

    git clone https://github.com/florijanqosja/Albanian-ASR.git
    cd Albanian-ASR
  2. Set up environment variables:

    cp .env.example .env
  3. Run the application:

    docker-compose up --build -d
  4. Access the services:

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

This project is an AI-based transcription tool for the Albanian language. The tool is designed to automatically transcribe Albanian speech to text using Python.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

Packages

 
 
 

Contributors