Skip to content
This repository was archived by the owner on May 28, 2025. It is now read-only.

Commit ab425e4

Browse files
authored
update license and documentation (#22)
* add license and contact info Signed-off-by: fred@pegasus <[email protected]> * update license Signed-off-by: fred@pegasus <[email protected]> * add link to paper Signed-off-by: fred@pegasus <[email protected]>
1 parent 38cb045 commit ab425e4

File tree

2 files changed

+9
-6
lines changed

2 files changed

+9
-6
lines changed

LICENSE

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
BSD 3-Clause License
22

3-
Copyright (c) 2020, Frederic Zhang
3+
Copyright (c) 2021, Frederic Zhang
44
All rights reserved.
55

66
Redistribution and use in source and binary forms, with or without

README.md

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,9 +2,9 @@
22
<img src="assets/teaser.gif" align="justify" width="500">
33

44
This repository contains the official PyTorch implementation for the paper
5-
> Frederic Z. Zhang, Dylan Campbell and Stephen Gould. _Efficient Two-Stage Detection of Human&ndash;Object Interactions with a Novel Unary&ndash;Pairwise Transformer._
5+
> Frederic Z. Zhang, Dylan Campbell and Stephen Gould. _Efficient Two-Stage Detection of Human&ndash;Object Interactions with a Novel Unary&ndash;Pairwise Transformer._ arXiv preprint arXiv:2112.01838.
66
7-
\[[__project page__](https://fredzzhang.com/unary-pairwise-transformers)\] \[[__preprint__]()\]
7+
\[[__project page__](https://fredzzhang.com/unary-pairwise-transformers)\] \[[__preprint__](https://arxiv.org/pdf/2112.01838.pdf)\]
88

99
## Abstract
1010
> ...<br/>However, the success of such one-stage HOI detectors can largely be attributed to the representation power of transformers. We discovered that when equipped with the same transformer, their two-stage counterparts can be more performant and memory-efficient, while taking a fraction of the time to train. In this work, we propose the Unary&ndash;Pairwise Transformer, a two-stage detector that exploits unary and pairwise representa-tions for HOIs. We observe that the unary and pairwise parts of our transformer network specialise, with the former preferentially increasing the scores of positive examples and the latter decreasing the scores of negative examples. We evaluate our method on the HICO-DET and V-COCO datasets, and significantly outperform state-of-the-art approaches. At inference time, our model with ResNet50 approaches real-time performance on a single GPU.
@@ -29,6 +29,8 @@ We provide weights for UPT models pre-trained on HICO-DET and V-COCO for potenti
2929
|UPT-R101-DC5|V-COCO|`61.3`|`67.1`|`0.131s`|[weights](https://drive.google.com/file/d/17ivP1npCR6jkxOQ3mfN83hrQfeVkiDC9/view?usp=sharing)|[weights](https://drive.google.com/file/d/1uenwkJ_0dSb_nb4HAlMUZ-bMjxD55XTR/view?usp=sharing)|
3030

3131
The inference speed was benchmarked on a GeForce RTX 3090. Note that weights of the UPT model include those of the detector (DETR). You do not need to download the DETR weights, unless you want to train the UPT model from scratch. Training UPT-R50 with 8 GeForce GTX TITAN X GPUs takes around `5` hours on HICO-DET and `40` minutes on V-COCO, almost a tenth of the time compared to other one-stage models such as [QPIC](https://github.com/hitachi-rd-cv/qpic).
32+
## Contact
33+
For general inquiries regarding the paper and code, please post them in [Discussions](https://github.com/fredzzhang/upt/discussions). For bug reports and feature requests, please post them in [Issues](https://github.com/fredzzhang/upt/issues). You can also contact me at [email protected].
3234
## Prerequisites
3335
1. Install the lightweight deep learning library [Pocket](https://github.com/fredzzhang/pocket). The recommended PyTorch version is 1.9.0.
3436
2. Download the repository and the submodules.
@@ -59,7 +61,8 @@ git submodule update
5961
cd /path/to/upt/vcoco
6062
ln -s /path/to/coco ./mscoco2014
6163
```
62-
64+
## License
65+
UPT is released under the [BSD-3-Clause License](./LICENSE).
6366
## Inference
6467
We have implemented inference utilities with different visualisation options. Provided you have downloaded the model weights to `checkpoints/`, run the following command to visualise detected instances together with the attention maps from the cooperative and competitive layers. Use the flag `--index` to select images, and `--box-score-thresh` to modify the filtering threshold on object boxes.
6568
```bash
@@ -93,7 +96,7 @@ If you find our work useful for your research, please consider citing us:
9396
@article{zhang2021upt,
9497
author = {Frederic Z. Zhang and Dylan Campbell and Stephen Gould},
9598
title = {Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer},
96-
journal = {arXiv preprint},
99+
journal = {arXiv preprint arXiv:2112.01838},
97100
year = {2021}
98101
}
99102
@@ -105,4 +108,4 @@ If you find our work useful for your research, please consider citing us:
105108
year = {2021},
106109
pages = {13319-13327}
107110
}
108-
```
111+
```

0 commit comments

Comments
 (0)