MAML

Unofficial Pytorch implementation of Model-Agnostic Meta-Learning (MAML) for Fast Adaptation of Deep Networks

Requirements

python 3.6+
torch 1.2+
torchvision 0.2+
dataset (I downloaded mini-imagenet images from the following github link.)
tqdm 4.32+

Usage

Training

I designed the MAML model in sklearn style. Note that, I copied the dataset generator from the dragen1860 github repo.

import os
import torch
import torch.nn as nn

from dragen_imagenet import MiniImagenet
from base_models.conv4 import conv4
from maml import maml

# Set parameters.
n, k = 5, 1
num_inner_loop = 5
num_inner_loop_test = 10
inner_lr = 1e-2
outer_lr = 1e-4
num_batch = 4  # 2
max_iter = 60000
use_cuda = True

# Define model. You can use any neural network-based model.
model = conv4(image_size=84, num_channels=3, num_classes=n,
              hidden_dim=32, use_dropout=False)
# Define loss function.
loss_f = torch.nn.functional.cross_entropy
# Define MAML.
maml_model = maml(n, k, model, loss_f, num_inner_loop, inner_lr, outer_lr, use_cuda)
# Load training dataset.
tr_dataset = MiniImagenet(batchsz=max_iter // 10)
# Fit the model according to the given dataset.
maml_model.fit(tr_dataset, num_batch)

Test

# Load test dataset.
ts_dataset = MiniImagenet(batchsz=600, mode="test")
maml_model.eval()
# Predict and calculate accuracy.
acc = maml_model.prediction_acc(ts_dataset, num_inner_loop_test)

Modification to ANIL

The Almost No Inner Loop (ANIL) is a recently introduced model (see ref. 3). The model removes the inner loop for all but the last layer of the base model. My MAML implementation can be simply extended to ANIL by modifying the 24th and 63rd lines of maml.py. By changing the line 24 to self.weight_name = [name for name, _ in list(self.model.named_parameters()[-2:]]) and line 63 to list(self.model.parameters())[-2:],, our code becomes ANIL.

Results

Note that, I set the parameter outer_lr to 1e-4, which was set to 1e-3 in the original paper. With my code, the model is unstable when the outer_lr is 1e-3. This result was also observed in the previous work. By reducing the outer_lr, the model can be trained reliably. I did not get the same performance, but I expect that increasing the number of iterations leads to the same performance.

MiniImagenet 5-way 1-shot.

	Test Acc.
MAML (paper)	48.70
MAML (my implementation)	33.79

Test accuracy according to the number of iterations.

References

Finn, C., Abbeel, P., & Levine, S. (2017). Model-agnostic meta-learning for fast adaptation of deep networks. ICML.
Antoniou, A., Edwards, H., & Storkey, A. (2018). How to train your MAML. ICLR.
Raghu, A., Raghu, M., Bengio, S., & Vinyals, O. (2019). Rapid learning or feature reuse? towards understanding the effectiveness of MAML. arXiv preprint arXiv:1909.09157.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
base_models		base_models
dragen_imagenet.py		dragen_imagenet.py
main.py		main.py
maml.py		maml.py
readme.md		readme.md
test.csv		test.csv
testaccs.png		testaccs.png
train.csv		train.csv
val.csv		val.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MAML

Requirements

Usage

Training

Test

Modification to ANIL

Results

MiniImagenet 5-way 1-shot.

Test accuracy according to the number of iterations.

References

About

Releases

Packages

Contributors 2

Languages

hoyeoplee/pytorch-maml

Folders and files

Latest commit

History

Repository files navigation

MAML

Requirements

Usage

Training

Test

Modification to ANIL

Results

MiniImagenet 5-way 1-shot.

Test accuracy according to the number of iterations.

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages