Releases · vulnerability-lookup/VulnTrain

11 Mar 07:31

cedricbonhomme

v1.2.0

d405b7d

Release 1.2.0 Latest

Latest

Changes

Dataset generation: CVSS are now extracted from GitHub and PySec security advisories.
Dataset generation: CVSS, CPE, title and description (summary) are now extracted from CSAF document.

Assets 2

27 Feb 07:44

cedricbonhomme

v1.1.0

c94d3d0

Release 1.1.0

News

Trainers: Support of roberta-base for the text classifier with improved
settings for TrainingArguments.
Validators: Validator for severity classification.

Assets 2

25 Feb 07:40

cedricbonhomme

v1.0.0

3f11a97

Release 1.0.0

News

Introduced a new trainer to automatically classify vulnerabilities based on their descriptions,
even when CVSS scores are unavailable.
Added CVSS parsing to the dataset generation script.

Changes

Refactored the project structure for better organization.
Improved CPE parsing.
Enhanced the dataset generation script.
Optimized the trainer for text generation on vulnerability descriptions.
Improved command-line argument parsing.
Improved the process of pushing the tokenizer and trainer to Hugging Face.

Assets 2

21 Feb 23:02

cedricbonhomme

v0.5.1

2a250c1

Release 0.5.1

Fixed configuration module name.

Assets 2

21 Feb 22:43

cedricbonhomme

v0.5.0

6aaa31f

Release 0.5.0

Added support of configuration file.

Assets 2

21 Feb 17:19

cedricbonhomme

v0.4.0

d922d3a

Release 0.4.0

The dataset generation step now uses data from GitHub Advisories, and the VulnExtractor cleans the summary and details fields.

Assets 2

20 Feb 22:31

cedricbonhomme

v0.3.0

35918af

Release 0.3.0

News

Dataset generation: allow specifying a commit message when uploading to Hugging Face.

Validation: Added a simple validation script for a model optimized for text generation. The script is
able to pull a model and send tasks via a Pipeline

Changes

Training step: added the choices of model: gpt2, distilgpt2, meta-llama/Llama-3.3-70B-Instruct, and distilbert-base-uncased

Various improvements to the command line parsing.

Assets 2

20 Feb 06:55

cedricbonhomme

v0.2.0

e169baf

Release 0.2.0

News

Added a trainer.
Experimenting distilbert-base-uncased (AutoModelForMaskedLM) and gpt2 (AutoModelForCausalLM).
The goal is to generate text.

Changes

Various improvements to the dataset generator. And added a command line parser.

Assets 2

19 Feb 15:27

cedricbonhomme

v0.1.0

bb53ba1

Release 0.1.0

First release with upload of datasets to HuggingFace.

Datasets are build based on NIST data with enrichment from FKIE and vulnrichment.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes

News

News

Changes

News

Changes

News

Changes

Releases: vulnerability-lookup/VulnTrain

Release 1.2.0

Changes

Release 1.1.0

News

Release 1.0.0

News

Changes

Release 0.5.1

Release 0.5.0

Release 0.4.0

Release 0.3.0

News

Changes

Release 0.2.0

News

Changes

Release 0.1.0