Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update README.md #36

Open
wants to merge 54 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
54 commits
Select commit Hold shift + click to select a range
01fcbf9
update README.md
smiyawaki0820 May 21, 2020
1a19d32
create parallel corpus from CoNLL
smiyawaki0820 May 23, 2020
49f6dbe
update preprocess/
smiyawaki0820 May 23, 2020
6295215
update preprocess/
smiyawaki0820 May 23, 2020
ced51af
update README.md
smiyawaki0820 May 23, 2020
3c708ff
update preprocess/
smiyawaki0820 May 23, 2020
39d680d
update README.md
smiyawaki0820 May 23, 2020
b8ef8fd
add srl_setting.sh
smiyawaki0820 May 23, 2020
85b9b6a
add train/
smiyawaki0820 May 23, 2020
419ca0c
update config.sh
smiyawaki0820 May 23, 2020
d42440e
add run/train .sh
smiyawaki0820 May 23, 2020
1f854d7
update README.md
smiyawaki0820 May 23, 2020
d947521
update run.sh
smiyawaki0820 May 23, 2020
bdb01a3
update preprocess/
smiyawaki0820 May 26, 2020
3030ffe
comment preprocess.py
smiyawaki0820 May 26, 2020
ddebf89
update preprocess/
smiyawaki0820 May 26, 2020
bd073bf
update preprocess/
smiyawaki0820 May 26, 2020
d6d00dd
update preprocess/
smiyawaki0820 May 26, 2020
4bbe54a
update config.sh
smiyawaki0820 May 26, 2020
7c7ff04
comment train.py
smiyawaki0820 May 26, 2020
78d1a38
update run.sh
smiyawaki0820 May 26, 2020
4b9d951
modified _
smiyawaki0820 May 26, 2020
475b6cd
add train
smiyawaki0820 May 26, 2020
9c3cfb1
add comment
smiyawaki0820 May 26, 2020
42130b8
rename train/ to sh/
smiyawaki0820 May 26, 2020
9383b6d
add sh/evaluate.sh
smiyawaki0820 May 26, 2020
ae64265
add sh/srl_eval.sh
smiyawaki0820 May 26, 2020
edb4452
update sh/srl_eval.sh
smiyawaki0820 May 26, 2020
6869ad3
add eval module
smiyawaki0820 May 26, 2020
4da228a
add logging into generate.py
smiyawaki0820 May 27, 2020
1b8f92b
update sh
smiyawaki0820 May 27, 2020
294e6dc
add comment
smiyawaki0820 May 27, 2020
56c469d
overwrite dicts/dict.src.txt
smiyawaki0820 May 28, 2020
7e260dc
update preprocess/
smiyawaki0820 May 28, 2020
c3e1050
update preprocess/
smiyawaki0820 May 28, 2020
1c6bbd5
add creating small
smiyawaki0820 May 28, 2020
2e5fac9
update run.sh
smiyawaki0820 May 31, 2020
f6f9c17
comment fairseq/trainer.py
smiyawaki0820 May 31, 2020
189aadc
add sh/create_datasets
smiyawaki0820 May 31, 2020
d760fcf
update config.sh
smiyawaki0820 May 31, 2020
fbfa70d
update sh/create_datasets
smiyawaki0820 May 31, 2020
bbcbd79
comment fairseq/criterions/cross_entropy.py
smiyawaki0820 May 31, 2020
1dd8473
update fairseq/
smiyawaki0820 May 31, 2020
10f30b5
add sh/src/JSON_to_CoNLL.py
smiyawaki0820 Jun 1, 2020
a0ad9c0
memo fairseq/criterions/cross_entropy.py
smiyawaki0820 Jun 1, 2020
657913d
add eval_step()
smiyawaki0820 Jun 1, 2020
4116797
update run.sh
smiyawaki0820 Jun 1, 2020
a861b31
update train.py
smiyawaki0820 Jun 1, 2020
82b590f
add srl eval
smiyawaki0820 Jun 2, 2020
7a5b53f
update sh/
smiyawaki0820 Jun 2, 2020
051d22c
update train.py
smiyawaki0820 Jun 2, 2020
4119eda
data 10% train
smiyawaki0820 Jun 3, 2020
ffcea20
update run.sh
smiyawaki0820 Jun 3, 2020
cca41a5
update train.py
smiyawaki0820 Jun 3, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
26 changes: 24 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,26 @@
# Introduction
# Experimental Settings (my extension)

## Experimental Settings
* Various experimental values for SRL are written in `srl/experiment.config`


## fairseq-Preprocess
* using WSJ as test set (need to rewrite the following file to use Brown)
``` running fairseq-preprocess
$ bash preprocess/create_preprocess_data.sh
```

## fairseq-Train
* if change the save dir, rewrite `OUT=*` in `config.sh`
``` running fairseq-train
$ bash run.sh -g [GPU_ID] -m [MODE=train]
```





# Introduction (source)

Source code for the paper:
**Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data**
Expand Down Expand Up @@ -31,7 +53,7 @@ sh download.sh
## Train with the pre-trained model
```
cd fairseq-gec
pip install --editable
pip install --editable .
sh train.sh \${device_id} \${experiment_name}
```

Expand Down
10 changes: 7 additions & 3 deletions config.sh
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
#!/usr/bin/env bash
#!/usr/bin/bash

device=0
if [ $# -ge 1 ]; then
Expand All @@ -10,8 +10,12 @@ if [ $# -ge 2 ]; then
exp=$2
fi

DATA='data' # input dir
OUT='out' # output dir
exp='_small'
OUT='small' # output dir
DATA='datasets/preprocessed/conll05' # input dir

#exp=''
#OUT=srl

DATA_BIN=$OUT/data_bin
DATA_RAW=$OUT/data_raw
Expand Down
Loading