Bisaya-Tagalog Morphological Analyzer

A finite-state morphological analyzer that performs morphological decomposition and code-switching detection on mixed Bisaya-Tagalog text using a python-based Non-Deterministic Finite Automaton (NFA).

Project Structure

data_v2/: Lexicon data files (JSON) - Prefix, Infix, Suffix, Circumfix tables, and Root Lexicons.
src/python/: Core logic including FSM implementation (bindings.py) and Web Server (server.py).
web/: Frontend assets (HTML, CSS, JS).
docs/: Formal documentation and academic proposal.

Prerequisites

Python 3.x

Build and Run

Install Python dependencies:
```
pip install -r requirements.txt
```
Run the Web Server:
```
python src/python/server.py
```

Usage

Access the web interface at http://localhost:8000. Enter mixed Bisaya-Tagalog text to analyze morphology and detect code-switching.

System Architecture

The analyzer helps linguistic research by decomposing words using a Finite-State Morphotactic approach. The system consists of four interacting components controlled by a global automaton:

PrefixFSM: Lexical automaton for prefix tokens.
InfixFSM: Handling ε-transitions for inserting morphemes.
RootLexicon: Stem lexicon lookup structure for validation.
SuffixCircumfixFSM: Lexical automaton for suffixes and circumfix constraints.

Finite-State Morphotactic Output Format

The system outputs a formal morphotactic parse string where morphemes are separated with + and annotated with tags derived from the FSM states.

Examples:

magsulat -> mag[PFX] + sulat[ROOT]
sinulat -> in[INFX] + sulat[ROOT]
kasulatan -> ka[CIRCUMFIX_PREFIX] + sulat[ROOT] + an[CIRCUMFIX_SUFFIX]

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data_v2		data_v2
docs		docs
extension		extension
scripts		scripts
src/python		src/python
web		web
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
test_loader.py		test_loader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bisaya-Tagalog Morphological Analyzer

Project Structure

Prerequisites

Build and Run

Usage

System Architecture

Finite-State Morphotactic Output Format

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Bisaya-Tagalog Morphological Analyzer

Project Structure

Prerequisites

Build and Run

Usage

System Architecture

Finite-State Morphotactic Output Format

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages