GitHub - mojesty/professor_forcing: Professor forcing future code

#Professor forcing paper code

https://arxiv.org/pdf/1610.09038.pdf

Progress

Generator and discriminator implemented
Char-level PTB tested
Jupyter Notebook with results
Word-level tests
Sequential MNIST (also mentioned in the paper)
Publish pre-trained model

Requirements

PyTorch 0.4 (older versions will not work)
TensorboardX for tensorboard usage (optional)

Preprocessing

No special preprocessing is required, just create a file with tokens separated by space. In the original paper, the car-level language modelling is used, so in this case you should create a single file with content like 'h e l l o _ w o r l d !'

Training

python train.py Useful command-line arguments:

-cuda for GPU
-adversarial for training both generator and discriminator (otherwise model.generator will not be initialized and trained)
-data_path
-vocab_path. Vocab file is created if not provided and saved in vocab.pt file.
-save_path Path to save the model. Model is saved after each epoch and info about the model itself and its results is appended to its name.

For more parameters, consult opts.py

Evaluating

python sampler.py

For sampling, -checkpoint, data_path and vocab_path arguments must be provided.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
modules		modules
README.md		README.md
cfg.py		cfg.py
dataset.py		dataset.py
model.py		model.py
opts.py		opts.py
sampler.py		sampler.py
train.py		train.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Progress

Requirements

Preprocessing

Training

Evaluating

About

Releases

Packages

Languages

mojesty/professor_forcing

Folders and files

Latest commit

History

Repository files navigation

Progress

Requirements

Preprocessing

Training

Evaluating

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages