Skip to content

Commit 3a99724

Browse files
authored
Create README.md
1 parent 02f69d5 commit 3a99724

File tree

1 file changed

+15
-0
lines changed

1 file changed

+15
-0
lines changed

Diff for: README.md

+15
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
# xlnet.jl
2+
3+
This repository contains the [Knet](https://github.com/denizyuret/Knet.jl) implementation of XLNet Pretraining model. (Z. Yang Et al.) Original implementation can be found [here](https://github.com/zihangdai/xlnet).
4+
5+
## Usage
6+
7+
Two main models are exported from ```XLNetModel``` and ```XLNetClassifier```. ```XLNetClassifier``` model can be constructed for arbitary number of classes, and also easy to use interface is provided for saving and loading model weights.
8+
Only classification downstream task is implemented for know but ```XLNetModel``` can be used for text modeling, but you need to specify special tokens carefully.
9+
10+
Detailed usage is demonstrated in the [notebook](xlnet_sentiment_classification.ipynb) for IMDB sentiment classification dataset.
11+
12+
## Todo
13+
14+
Models for other downstream tasks (Question-Answering etc.) will be be implemented.
15+

0 commit comments

Comments
 (0)