Making-Adversarial-perterbation-for-Text-classification

Dataset

Consists of 2225 documents from the BBC news website corresponding to stories in five topical areas from 2004-2005.
Class Labels: 5 (business, entertainment, politics, sport, tech)

Classification model using

Multinomial Naive Baye

Preprocessing

Convert all letter to lower case
Remove punctuation
Tokenize word
Remove stopwords
Remove stopwords
Lemmatize

Make perturbation

Random swap characters in each significant word collected from training Corpus (others are applied with fix probability )
Change some characters to a similar one.
Dropout some frequent words.

Run

python3 controller.py

Result

Input	Accuracy	F1
News report	0.964	0.963
News report with perturbation	0.424	0.396

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
bbc		bbc
model		model
.DS_Store		.DS_Store
README.md		README.md
controller.py		controller.py
data_to_csv_script.py		data_to_csv_script.py
example.png		example.png
freq_word.pickle		freq_word.pickle
non_ascii.pickle		non_ascii.pickle
preprocess.py		preprocess.py
train_classifier_script.py		train_classifier_script.py
ui.py		ui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Making-Adversarial-perterbation-for-Text-classification

Dataset

Classification model using

Preprocessing

Make perturbation

Run

Result

Example program

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Making-Adversarial-perterbation-for-Text-classification

Dataset

Classification model using

Preprocessing

Make perturbation

Run

Result

Example program

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages