You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+129-10
Original file line number
Diff line number
Diff line change
@@ -9,37 +9,156 @@ This is the official codebase of the paper
9
9
10
10
## Overview
11
11
12
+
*GeomEtry-Aware Relational Graph Neural Network (**GearNet**)* is a simple yet effective structure-based protein encoder.
13
+
It encodes spatial information by adding different types of sequential or structural edges and then performs relational message passing on protein residue graphs, which can be further enhanced by an edge message passing mechanism.
14
+
Though conceptually simple, GearNet augmented with edge message passing can achieve very strong performance on several benchmarks in a supervised setting.
15
+
16
+

17
+
18
+
Five different geometric self-supervised learning methods based on protein structures are further proposed to pretrain the encoder, including **Multivew Contrast**, **Residue Type Prediction**, **Distance Prediction**, **Angle Prediction**, **Dihedral Prediction**.
19
+
Through extensively benchmarking these pretraining techniques on diverse
20
+
downstream tasks, we set up a solid starting point for pretraining protein structure representations.
21
+
22
+

23
+
12
24
This codebase is based on PyTorch and [TorchDrug] ([TorchProtein](https://torchprotein.ai)). It supports training and inference
Here are the results of GearNet w/ and w/o pretraining on standard benchmark datasets. All the results are obtained with 4 A100 GPUs (40GB). Note results may be slightly different if the model is trained with 1 GPU and/or a smaller batch size.
93
+
More detailed results are listed in the paper.
94
+
95
+
<table>
96
+
<tr>
97
+
<th>Method</th>
98
+
<th>EC</th>
99
+
<th>GO-BP</th>
100
+
<th>GO-MF</th>
101
+
<th>GO-CC</th>
102
+
</tr>
103
+
<tr>
104
+
<th>GearNet</th>
105
+
<td>0.730</td>
106
+
<td>0.356</td>
107
+
<td>0.503</td>
108
+
<td>0.414</td>
109
+
</tr>
110
+
<tr>
111
+
<th>GearNet-Edge</th>
112
+
<td>0.810</td>
113
+
<td>0.403</td>
114
+
<td>0.580</td>
115
+
<td>0.450</td>
116
+
</tr>
117
+
<tr>
118
+
<th>Multiview Contrast</th>
119
+
<td>0.874</td>
120
+
<td>0.490</td>
121
+
<td>0.654</td>
122
+
<td>0.488</td>
123
+
</tr>
124
+
<tr>
125
+
<th>Residue Type Prediction</th>
126
+
<td>0.843</td>
127
+
<td>0.430</td>
128
+
<td>0.604</td>
129
+
<td>0.465</td>
130
+
</tr>
131
+
<tr>
132
+
<th>Distance Prediction</th>
133
+
<td>0.839</td>
134
+
<td>0.448</td>
135
+
<td>0.616</td>
136
+
<td>0.464</td>
137
+
</tr>
138
+
<tr>
139
+
<th>Angle Prediction</th>
140
+
<td>0.853</td>
141
+
<td>0.458</td>
142
+
<td>0.625</td>
143
+
<td>0.473</td>
144
+
</tr>
145
+
<tr>
146
+
<th>Dihedral Prediction</th>
147
+
<td>0.859</td>
148
+
<td>0.458</td>
149
+
<td>0.626</td>
150
+
<td>0.465</td>
151
+
</tr>
152
+
</table>
153
+
154
+
## Citation
155
+
If you find this codebase useful in your research, please cite the following paper.
156
+
157
+
```bibtex
158
+
@article{zhang2022protein,
159
+
title={Protein representation learning by geometric structure pretraining},
160
+
author={Zhang, Zuobai and Xu, Minghao and Jamasb, Arian and Chenthamarakshan, Vijil and Lozano, Aurelie and Das, Payel and Tang, Jian},
0 commit comments