Clone this repo and navigate into its home directory. Also make sure Bun is installed.
From within the ./scraper
directory run:
python filter_words.py
bun scraper.ts
From within the home director of the repo
python convert_data_to_paper_data.py
Set the hyperparameters in the file ./train_model/mix/train_pair_bert.py
. Then run:
python -m train_model.mix.train_pair_bert
Code found in ./train_model
is an adaptation of BertForRD.