EquationLearning

We provide a dataset of equations represented as PNG-images and Latex code. This dataset is useful for learning to detect similarities in equations. Check it out HERE!

Usage

Building the dataset(s)

You have to build the respective datasets before you can train or evaluate the equation-encoder.

To build the training dataset run:

python formula_data.py train path/to/weak_data_train

To build the evaluation dataset (Gold-Label Evaluation Data) run:

python formula_data.py eval path/to/eval2

To build the evaluation dataset (Hold-Out Data) run:

python formula_data.py test path/to/weak_data_test

Pretraining

In order to pretrain the equation-encoder run this:

python pretrain_experiment.py with dataset=task data_source=path/to/weak_data_train

task should be either abstract or symbols depending on which pretraining task you want to run.

Training

In order to train the equation-encoder run this:

python equen_experiment.py

If you want to use weights from pretraining you should run something like:

python equen_experiment.py with pretrained_weights=path/to/weights

path/to/weights should be something like equen_runs/x with x as the number of the respective training routine.

Evaluation

In order to evaluate the trained weights from all epochs of a training routine run this:

python evaluation.py with run=path/to/run

path/to/run should be something like equen_runs/x with x as the number of the respective training routine.

If you want to evaluate on Hold-Out data instead of the Gold-Label data you should run:

python evaluation.py with run=path/to/run dataset=test

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
docs		docs
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EquationLearning

Usage

Building the dataset(s)

Pretraining

Training

Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EquationLearning

Usage

Building the dataset(s)

Pretraining

Training

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages