learner-corpus

Here are 16 public repositories matching this topic...

ELI-Data-Mining-Group / PELIC-dataset

The University of Pittsburgh English Language Institute Corpus (PELIC) dataset

corpus esl lexical-analysis longitudinal-data concordancer tesol second-language-acquisition learner-corpus intensive-english-program english-for-academic-purposes second-language-writing

Updated Mar 31, 2023
HTML

anaistack / cefr-asag-corpus

Star

A corpus of short answers written by learners of English and graded with CEFR levels

natural-language-processing english automated-essay-scoring efl learner-corpus cefr essay-grading automated-essay-grading automated-short-answer-grading automated-short-answer-scoring essay-scoring educational-nlp bea-workshop

Updated Dec 17, 2021

upunaprosk / grammar-checker

Star

Essay Grammar Checker trained on REALEC Corpus using SpaCy

spacy grammar-checker grammar-errors learner-corpus gec spacy-pipeline automatic-annotation l1-interference

Updated May 26, 2025
Jupyter Notebook

ELI-Data-Mining-Group / PELIC-spelling

Star

Information and code about applying spelling correction to the PELIC dataset

spelling-correction learner-corpus pelic intensive-english-program

Updated Feb 17, 2021
Jupyter Notebook

elmiram / russian_learner_corpus

Star

Russian Learner Corpus, a platform for corpus search and annotation

django annotation corpus-linguistics russian-corpus learner-corpus

Updated Oct 25, 2018
JavaScript

upunaprosk / corpora-manipulation

Star

Tool for converting error corpora to parallel datasets

corpus-linguistics english-learning learner-corpus gec parallel-data-processing learner-errors

Updated Jun 7, 2022
Python

jirkle / ErrCorp

Star

wiki tool wikipedia corpora error-corpora learner-corpus nlp-tool

Updated May 26, 2017
Python

Aniezka / inspector-lab

Star

The implementation of the Inspector tool.

python corpus-linguistics learner-corpus linguistic-complexity

Updated Apr 26, 2021
Jupyter Notebook

Aniezka / REALEC

Star

Statistics on some error categories from the REALEC corpus.

nlp english corpus-linguistics learner-corpus learner-errors

Updated Sep 15, 2019
Python

Aniezka / syntactic-complexity

Star

Code for the thesis "A Corpus-Based Case Analysis on Syntactic Complexity in Russian ESL Learners’ Writing".

python corpus-linguistics learner-corpus linguistic-complexity

Updated Feb 4, 2022
Jupyter Notebook

Aniezka / CAF

Star

Supplementary material for "Correlations between accuracy, complexity, and task type: Learner corpus research"

r corpus-linguistics learner-corpus linguistic-complexity

Updated May 12, 2023
R

Aniezka / Course_work

Star

Coursework on "Clustering of English texts on the basis of automated extraction of key properties"

machine-learning text-mining data-mining clustering learner-corpus

Updated May 30, 2020
Jupyter Notebook

tlu-dt-nlp / Estonian-CEFR-Assessment

Star

Dataset of Estonian L2 writings and source code used to train and test machine learning models for CEFR-based classification.

machine-learning text-classification language-learning learner-corpus language-assessment cefr-prediction

Updated Feb 12, 2026
Python

atts-dum / daily-english-journal

Star

language-learning linguistics open-data aging english-learning sociolinguistics learner-corpus deepl digital-legacy daily-journal ai-assisted human-ai-interaction applied-linguistics autoethnography generative-ai ai-assisted-writing japanese-learner

Updated Feb 14, 2026

upunaprosk / writing-assistant

Star

Writing assistant

learner-corpus learner-errors writing-assistant l1-interference

Updated Jun 9, 2022
Python

lgaliero / LearnTextNorm-De

Star

Text Normalization on Learner Texts (South Tyrolean German as a L2)

german-language learner-corpus text-normalization german-nlp llm-inference

Updated Feb 11, 2026
Python

Improve this page

Add a description, image, and links to the learner-corpus topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the learner-corpus topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

learner-corpus

Here are 16 public repositories matching this topic...

ELI-Data-Mining-Group / PELIC-dataset

anaistack / cefr-asag-corpus

upunaprosk / grammar-checker

ELI-Data-Mining-Group / PELIC-spelling

elmiram / russian_learner_corpus

upunaprosk / corpora-manipulation

jirkle / ErrCorp

Aniezka / inspector-lab

Aniezka / REALEC

Aniezka / syntactic-complexity

Aniezka / CAF

Aniezka / Course_work

tlu-dt-nlp / Estonian-CEFR-Assessment

atts-dum / daily-english-journal

upunaprosk / writing-assistant

lgaliero / LearnTextNorm-De

Improve this page

Add this topic to your repo