TopoGen3D: Topology-Aware Open3D Code Generation from 2D Images

TopoGen3D is an innovative pipeline that transforms 2D images of shapes (such as a torus or a Klein bottle) into fully-functional Open3D code. By leveraging a fine-tuned CLIP model and IBM’s Granite-3B-Code-Instruct-2k model, TopoGen3D automatically identifies the best matching shape based on accuracy scores and then generates corresponding Open3D code. The resulting code can be rendered in a web environment or visualized using PyVista.

Overview

TopoGen3D automates the process of generating 3D models from 2D images by combining advanced deep learning techniques with state-of-the-art 3D rendering tools. The pipeline consists of three main steps:

2D Image Input: Provide an image of a shape.
Shape Detection & Scoring: A fine-tuned CLIP model evaluates and scores potential shapes.
3D Code Generation: The highest-scoring shape description is fed into IBM Granite-3B-Code-Instruct-2k to generate Open3D code.

Features

Topology-Aware Recognition: Accurately identifies complex topological shapes such as torus and Klein bottles.
Automated Code Generation: Transforms shape descriptions into executable Open3D code using a powerful IBM model.
Versatile Visualization: Render the generated 3D model in a web application or using PyVista for interactive visualization.
Modular Design: Easily integrates with various components such as CLIP, IBM Granite, and Open3D.

How It Works

Input Image:
A 2D image of a shape is provided as input (e.g., a torus or a Klein bottle).
Shape Scoring with CLIP:
A fine-tuned CLIP model processes the image and generates accuracy scores for a range of possible shapes based on a predefined JSON knowledge base.
- All possible shapes and their confidence scores are printed.
- The shape with the highest score is selected.
Code Generation with IBM Granite:
The best scoring shape description is saved to a file and then fed to the IBM Granite-3B-Code-Instruct-2k model, which generates Open3D code to create a 3D model of the identified shape.
Visualization:
The generated Open3D code can be rendered either in a web environment or visualized using PyVista.

Requirements

Python 3.8+
Transformers
Open3D
PyVista
Pillow
Other dependencies as listed in requirements.txt (if provided)

Installation

Clone the Repository:

git clone https://github.com/yourusername/TopGen.git
cd TopGen

Create and Activate a Virtual Environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install Dependencies
```
pip install -r requirements.txt
```

Usage

Step 1: Generate Image Description

Input:

Run the script to process a 2D image and generate a shape description:

python main.py

output:

Detected shapes:
torus (0.38): A doughnut-shaped surface of revolution, genus 1, with a hole in the center.
mobius strip (0.25): A surface with only one side and one boundary, non-orientable.

Best shape:
torus (0.38): A doughnut-shaped surface of revolution, genus 1, with a hole in the center.

This script calls clip.py with the image path, which detects shapes and writes the highest confidence description to description.txt.

Step 2: Generate 3D Open3D Code

The same main.py then calls granite_code.py, which reads the description and uses IBM Granite to generate the Open3D code.

output:

import open3d as o3d
import numpy as np

# Create a doughnut-shaped surface of revolution
doughnut = o3d.geometry.DoughnutSurfaceMesh(
    radius=1.0,
    thickness=0.1,
    resolution=100,
    genus=1
)

# Create a hole in the center
hole = o3d.geometry.TriangleMesh.create_sphere(
    radius=0.05,
    resolution=10
)

# Create a mesh by subtracting the hole from the doughnut
mesh = doughnut.create_mesh_from_triangle_mesh(hole)

# Visualize the mesh
o3d.visualization.draw_geometries([mesh])

Step 3: Visualize the Generated Model

After code generation, use a visualization script (e.g., visualization.py) to render the 3D model:

python visualization.py

Contributors

License

This project is licensed under the MIT License.

Feel free to modify this README to suit your project's specific needs or to add any additional instructions or details about your workflow. Enjoy generating 3D models from 2D images with TopoGen3D!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
fine_tuned_model/runs		fine_tuned_model/runs
images		images
output		output
README.md		README.md
blip.py		blip.py
clip.py		clip.py
clip_description.txt		clip_description.txt
demo.gif		demo.gif
export_mesh.py		export_mesh.py
granite_code.py		granite_code.py
granite_code_finetune.py		granite_code_finetune.py
main.py		main.py
requirements.txt		requirements.txt
sample_3d_model.jpeg		sample_3d_model.jpeg
sample_output.jpeg		sample_output.jpeg
shape_knowledge.json		shape_knowledge.json
test.py		test.py
test2.py		test2.py
train_data.json		train_data.json
visualization.py		visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TopoGen3D: Topology-Aware Open3D Code Generation from 2D Images

Table of Contents

Overview

Features

How It Works

Requirements

Installation

Usage

Step 1: Generate Image Description

Input:

Step 2: Generate 3D Open3D Code

output:

Step 3: Visualize the Generated Model

Contributors

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

vijaysr4/IBM_Granite_Open3d_code_gen_for_TDA_assistance

Folders and files

Latest commit

History

Repository files navigation

TopoGen3D: Topology-Aware Open3D Code Generation from 2D Images

Table of Contents

Overview

Features

How It Works

Requirements

Installation

Usage

Step 1: Generate Image Description

Input:

Step 2: Generate 3D Open3D Code

output:

Step 3: Visualize the Generated Model

Contributors

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages