GitHub

PS3 code based for ECML-PKDD paper

requirements :
python = 3.6
scikit-learn
nltk
pandas
numpy

For simplicity we used jupyter notebook and use Founta dataset as it represent the most skewed dataset.
we run the experiment 10 times and take the average for the report in the paper.

For the experiment such as in Figure 3 (in the paper), please modify the dataset to have similar ratio as in the paper
.

If you use our implementation please cite the paper as
@inproceedings{Fajri2020PS3,
title={PS3:Partition-based Skew-Specialized Sampling for Batch Mode Active Learning in Imbalanced Text Data},
author={Ricky Maulana Fajri and Samaneh Khoshrou and Robert Peharz and Mykola Pechenizkiy},
booktitle={ECML-PKDD},
year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
result		result
training		training
PS3_final.ipynb		PS3_final.ipynb
README.md		README.md
dataPreparation.py		dataPreparation.py
final_dataset.csv		final_dataset.csv
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

rmfajri/PS3

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages