Range-aware Graph Positional Encoding via High-order Pretraining: Theory and Practice

Publication

NeurIPS 2024 Workshop NeurReps: https://openreview.net/forum?id=tN0n5BuLEI
Preprint: https://arxiv.org/abs/2409.19117

Overview

This repository contains the implementation of HOPE-WavePE (High-Order Permutation-Equivariant Autoencoder for Wavelet Positional Encoding), a novel graph representation learning method. HOPE-WavePE extends Wavelet Positional Encoding (WavePE) by utilizing a multi-resolution autoencoder to capture both local and global structures in graphs.

This project is designed for unsupervised pretraining on graph datasets, making it adaptable for a wide range of graph-based machine learning tasks, including:

Molecular property prediction
Materials discovery
Social network analysis
Bioinformatics

Features

Graph-agnostic pretraining: Learns structural representations independent of node/edge features.
Wavelet-based encoding: Captures multi-resolution graph structures.
High-order autoencoder: Preserves long-range dependencies in graphs.
Flexible training pipeline: Supports multiple datasets (e.g., MoleculeNet, TU Dataset, ZINC, LRGB).
Supports Graph Transformers & MPNNs: Integrates with GNNs for downstream tasks.

Installation

Ensure you have Python 3.8+ installed.

Clone this repository:

git clone https://github.com/HySonLab/WaveletPE.git
cd WaveletPE

Create a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows, use: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Usage

1. Pretraining the HOPE-WavePE Autoencoder

Pretrain the autoencoder on a dataset (e.g., MolPCBA) with:

python train.py --dataset molpcba --epochs 100

2. Fine-tuning on a Downstream Task

After pretraining, you can fine-tune the model on a specific graph-based task. For example, to train on ZINC molecular dataset:

python train_zinc.py --pretrained_model checkpoints/hope_wavepe.pth

3. Running Inference

To run inference using a pretrained model:

python example.py --model checkpoints/hope_wavepe.pth --input data/sample_graph.json

4. Evaluating the Model

Evaluate the pretrained model on a benchmark dataset (e.g., TU Dataset, LRGB):

python downstream.py --dataset tu-proteins --model checkpoints/hope_wavepe.pth

Directory Structure

wavepe-master/
│── encoder/              # Implementation of HOPE-WavePE autoencoder
│── layer/                # Graph layers and model components
│── net/                  # Neural network model definitions
│── data.py               # Graph data processing
│── train.py              # Training script for pretraining
│── train_zinc.py         # Training script for ZINC dataset
│── downstream.py         # Fine-tuning on downstream tasks
│── transform.py          # Graph transformations and preprocessing
│── utils/                # Helper functions
│── logs/                 # Training logs
│── checkpoints/          # Model checkpoints
│── example.py            # Running inference
│── config/               # Configuration files
│── scripts/              # Additional scripts for automation
│── requirements.txt      # Dependencies
│── README.md             # This documentation

Citation

If you use this work in your research, please cite the corresponding paper:

@inproceedings{
nguyen2025rangeaware,
title={Range-aware Positional Encoding via High-order Pretraining: Theory and Practice},
author={Viet Anh Nguyen and Nhat Khang Ngo and Hy Truong Son},
booktitle={NeurIPS 2024 Workshop on Symmetry and Geometry in Neural Representations},
year={2025},
url={https://openreview.net/forum?id=tN0n5BuLEI}
}

Contact

For questions or issues, please reach out to Truong-Son Hy at [email protected].

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Range-aware Graph Positional Encoding via High-order Pretraining: Theory and Practice

Publication

Overview

Features

Installation

Usage

1. Pretraining the HOPE-WavePE Autoencoder

2. Fine-tuning on a Downstream Task

3. Running Inference

4. Evaluating the Model

Directory Structure

Citation

Contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
__pycache__		__pycache__
checkpoints		checkpoints
config		config
encoder		encoder
layer		layer
logs		logs
loss		loss
net		net
scripts		scripts
utils		utils
README.md		README.md
aggr_results.py		aggr_results.py
aggr_results.sh		aggr_results.sh
data.py		data.py
downstream.py		downstream.py
example.py		example.py
main.py		main.py
main_figure.png		main_figure.png
shortest_path.py		shortest_path.py
todo.txt		todo.txt
train.py		train.py
train_image.py		train_image.py
train_tu.py		train_tu.py
train_zinc.py		train_zinc.py
transform.py		transform.py

HySonLab/WaveletPE

Folders and files

Latest commit

History

Repository files navigation

Range-aware Graph Positional Encoding via High-order Pretraining: Theory and Practice

Publication

Overview

Features

Installation

Usage

1. Pretraining the HOPE-WavePE Autoencoder

2. Fine-tuning on a Downstream Task

3. Running Inference

4. Evaluating the Model

Directory Structure

Citation

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages