Flowchart-to-Mermaid Transformer

Thanks to Joel Timana for his support during the development of this project.

This project converts flowchart images into Mermaid code using a vision-language model Llama 3.2 Vision

It supports both fine-tuned (FT - LoRA adapters) and non-fine-tuned (NFT) model evaluation.

Thanks to Unsloth AI for their packages that make possible this project.

🚀 Quick Start

Install dependencies:
```
pip install -r requirements.txt
```
Run inference:

I strongly recommend use a GPU, otherwise the time that you are going to need is quite a lot.

```bash
python inferenceNFT.py
# or for FT model
python inferenceFT.py
```

Evaluate and compare:

python metric.py
python plot_comparison.py

📊 Benchmarks

The fine-tune model was tested on ussing differents algorithms of string comparation. The question relies is how the most popular LLM's are tested on the well known benchmarks!

Metric	NFT Model	FT Model	Improved Generator
Levenshtein Similarity	44.3%	63.6%	70.1%
Jaccard Similarity	52.7%	68.2%	74.5%
Cosine Similarity	60.1%	75.3%	80.2%
Sequence Matcher	47.8%	66.7%	72.9%
Hamming Similarity	10.2%	15.4%	18.7%
Jaro Similarity	55.6%	70.8%	77.0%

These are average scores over the test set. See metric.py for details.

🖼️ Example Output

Input:

Generated Mermaid:

flowchart TD
    A((Start)) --> B["Load Application"]
    B --> C[/"User Input Required"/]
    C --> D{"Valid Input?"}
    D -->|Yes| E["Process Request"]
    D -->|No| F["Show Error Message"]
    E --> G[/"Display Results"/]
    F --> C
    G --> H((End))

🛠️ Project Structure

inferenceNFT.py / inferenceFT.py — Run inference with NFT/FT models
metric.py — String similarity metrics
plot_comparison.py — Visualize and compare results
utilsFunctions.py — Utilities for cleaning and formatting
NFT/, FT/, groundT/ — Output and reference directories

Base Model

Llama Vision 3.2 Instruct 11B

Model Weights

LoRA adapters are available on Hugging Face.
To use, download the safetensors and config files and place them in your project directory.

Data set used

All togheter forms more than 100000 flowcharts, using Data Augmentation we have that.

Training Process

Using the state-of-the-art Lora.

📑 Citation

If you use this project, please cite:

@misc{flowchart2mermaid2025,
  author = {Munoz Jorge},
  title = {Flowchart-to-Mermaid Transformer},
  year = {2025},
  url = {https://github.com/jorgemunozl/vllm}
}

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
docs		docs
reports		reports
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Flowchart-to-Mermaid Transformer

🚀 Quick Start

📊 Benchmarks

🖼️ Example Output

🛠️ Project Structure

Base Model

Model Weights

Data set used

Training Process

📑 Citation

About

Uh oh!

Releases

Packages

Languages

License

jorgemunozl/Finetunning-Llama-Vision-11b

Folders and files

Latest commit

History

Repository files navigation

Flowchart-to-Mermaid Transformer

🚀 Quick Start

📊 Benchmarks

🖼️ Example Output

🛠️ Project Structure

Base Model

Model Weights

Data set used

Training Process

📑 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages