XLS backend by vasdommes · Pull Request #1475 · fastmachinelearning/hls4ml

vasdommes · 2026-05-14T19:15:52Z

Description

This PR adds XLS backend. It is based on PR #1343, with most of the code rewritten and new features added.

Google XLS is an open-source (Apache 2) High Level Syntesis toolchain that produces an RTL (Verilog or SystemVerilog) design from a high-level description (DSLX or C++).

Adding XLS as a new hls4ml backend allows to generate RTL without vendor-specific dependencies and benefit from the developments that XLS brings to HLS field.

XLS workflow

XLS backend performs the following transformations:

write(): hls4ml representation -> DSLX project
compile(): DSLX -> XLS IR -> Optimized XLS IR
build(): Optimized XLS IR -> (System)Verilog -> IP

DSLX -> IR -> (System)Verilog conversion is done by XLS.
IP is generated by Vivado. One can choose another vendor and generate IP from Verilog file manually.

XLS features

XLS backend supports the following layers: Input, ApplyAlpha, BatchNormalization, Dense, Conv1D, DepthwiseConv1D, Conv2D, DepthwiseConv2D, Pooling1D, Pooling2D, GlobalPooling1D, GlobalPooling2D, Merge, Concatenate, Dot, Activation, HardActivation, ParametrizedActivation, PReLU, Reshape, Softmax, Transpose, TernaryTanh.

You can override default codegen options as follows:

config = hls4ml.utils.config_from_keras_model(model)
# This sets hls_model.config['XLSCodegenFlags']
hls_model = hls4ml.converters.convert_from_keras_model(
    model, hls_config=config, backend='XLS',
    xls_codegen_flags={'delay_model': 'asap7', 'generator': 'pipeline', 'use_system_verilog': False}
)

DSLX standard library has only signed FixedPoint type (similar to ap_fixed). Thus, unsigned types are not supported.

Currently, XLS backend implements only IOType: io_parallel. Strategy is ignored.
All operations are fully unrolled.

io_stream could be implemented via DSLX procs. @calad0i and I are going to work on that after finishing this PR.

Other changes

I made some minor changes in non-XLS code:

Fixed test_softmax.py does not test argmax and latency implementations; latency fails #1443, since it was needed to test all softmax implementations in XLS.
Added hook to ModelGraph to call custom backend.get_top_function(). This is needed for XLS because it uses optimized XLS IR file instead of .so library generated by other backends.
Updated docs/ir/attributes.rst. Aside from adding XLS, this commit some other missing layers and attributes.

Dependencies

XLS backend uses xls-python to access XLS API. It is enabled by dependency group xls:

pip install hls4ml[xls]

xls-python comes with batteries (libxls.so and DSLX standard library) included, no separate XLS installation is required.
The code has been tested for the version xls-python=0.1.9875.

Known issues

XLS doesn't work with Dense layer imported from PyTorch Linear layer because of shape mismatch: PyTorch stores Linear weights as (out_features, in_features), while hls4ml Dense layers use the Keras-style layout (in_features, out_features).

Repro: add XLS backend to test_pytorch_api.py/test_squeeze and run the test.

Note that the weights in this test are constant, and other backends flatten them without checking shape.
So, it is unclear whether they handle this situation correctly or not.

Type of change

Documentation update
New feature (non-breaking change which adds functionality)

Tests

📝 Please describe the tests that you ran to verify your changes.

Provide instructions so we can reproduce.

Please also list any relevant details for your test configuration.

XLS has been added to the following tests:
test_activations.py, test_auto_precision.py, test_binary_cnn.py, test_causalpadding.py, test_depthconv1d.py, test_depthconv2d.py, test_keras_api.py, test_keras_v3_api.py, test_merge.py, test_multi_dense.py, test_pointwiseconv.py, test_pooling.py, test_pytorch_api.py, test_reshape.py, test_sepconv1d.py, test_sepconv2d.py, test_softmax.py.

Test Configuration

Add xls dependency, e.g.

pip install .[da,testing,testing-keras2,sr,optimization,xls]"
# or: pip install .[da,testing,testing-keras3,sr,xls]"

and run tests, e.g.:

pytest test/pytest --randomly-dont-reset-seed -k XLS

Notes on performance

Some test cases are very slow for XLS (e.g. ~30 minutes vs ~10 seconds on other backends).
This happens because XLS generates (in model.compile()) and uses (in model.predict()) an optimized XLS IR code, where all loops are fully unrolled. The resulting file can be huge and thus slow for the likes of Conv2D.

During development, I made test faster by reducing dimensions in some tests.
For example, in test_keras_api.py/test_conv2d I replaced

input_shape = (28, 28, 3)
filters=32

with

input_shape = (14, 14, 3)
filters=8

I haven't pushed such changes, but that could be one of the ways of speeding things up.

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

…mation in writer

… pass, merge of dense_relu written as an opt pass

This fixes two test cases in test_softmax.py (one of them still fails due to another error) TODO: check layer.class_name == 'Input' instead of taking layers[0]?

This fixes DSLX compilation error in test_softmax.py

# Conflicts: # docs/requirements.txt # hls4ml/backends/__init__.py # hls4ml/model/graph.py # hls4ml/report/__init__.py # test/pytest/test_activations.py # test/pytest/test_keras_api.py # test/pytest/test_softmax.py

These things were removed in fastmachinelearning#1321

See fastmachinelearning#1443 Setting 'strategy' for Softmax layer did not affect anything, and the code always chose the default implementation=stable. TODO: all backends fail when implementation=latency (low accuracy, probably due to overflow).

…precision is different from input.

… mismatch.

…ision for thresholded_relu

…ision, make assertion for TensorVariable precision less strict.

TODO: implement and test layers that actually use that, e.g. Bidirectional (multiple output) or Merge (multiple input).

QKeras can generate weights of XnorPrecisionType {0, 1}, which encode values {-1, 1}. See e.g. test_binary_cnn.py

…ion location

Fixes test_sepconv2d.py for XLS

…lly used.

…mpile(). This avoids reparsing .opt.ir file on subsequent model.predict() calls.

Updated by running docs/attr_doc_gen.py Added XLS backend and other things added to hls4ml since the last update of attributes.rst (Dec 2024): - Libero backend - New layers, e.g.: BipolarQuant, Cropping1D, Cropping2D - New attributes, e.g.: n_inner and n_outer for Softmax.

… speed up tests. This should fix timeout failure on CI: https://gitlab.cern.ch/fastmachinelearning/hls4ml/-/jobs/75309630 Note that XLS tests can be slow due to big (fully unrolled) IR size.

Girjoaba and others added 30 commits July 16, 2025 18:04

init: xls backend (not working), implemented xls specific layer infor…

f88faa0

…mation in writer

test

4496252

feat: loading weights and creating infrastructure added to writer

d8b2415

feat: init writer complete

fbe2e82

feat: first end2end working test

f9b2863

fix: vector input support, change back to current directory

0952f1b

refactoring: predict function call

b32405b

feat: solo relu activation test pass

6d91666

debt cleanup: split dslx templates in multiple files

86dd94a

refactoring: simplified writer -> attribute factory written as an opt…

0b9ad57

… pass, merge of dense_relu written as an opt pass

feat: softmax xls implementation of table lookup

a78bd1b

integrated strategies for the softmax implementation

b24e581

bugfix: softmax latency implementation

039c514

cleanup: removed junk file

dc8f5a9

feat: stable softmax 1 specific precision working

248c0f0

cleanup: removed junk

4ff3f94

feat: integrated stable softmax with all layers

9a73968

feat: softmax stable and argmax working any bit precision combination

5684d26

feat: xls utilization report parsing with vivado

ff02a02

wip: cnn

c6a9ccd

feat: prepared writer weights for CNNs

81af6b6

feat: conv2d_latency is now code generated

5c42f5c

reverted look_up tables

092a809

feat: timing report when building the project

90c3231

Fix input type for top function in DSLX

a7fbae3

This fixes two test cases in test_softmax.py (one of them still fails due to another error) TODO: check layer.class_name == 'Input' instead of taking layers[0]?

Add fix_softmax_table_size to XLS optimization passes

9e7afe0

This fixes DSLX compilation error in test_softmax.py

Fix after merge: remove get_shape() and dim names from XLS.

a5cccbe

These things were removed in fastmachinelearning#1321

XLS: fix softmax_latency parametric types, refactor func_call generation

75d5d40

vasdommes added 28 commits May 12, 2026 15:51

XLS: implement Dot layer

3fd9243

XLS: implement Concatenate layer (1d, 2d, 3d)

460039d

XLS: fixed thresholded_relu/relu compilation error when output width/…

eb41b4b

…precision is different from input.

XLS: implement BatchNormalization layer.

8f14f5f

Add more XLS tests

1af1021

Disable XLS in test_pytorch_api.py/test_squeeze, add TODO about shape…

74fa4a3

… mismatch.

XLS: fix binary_tanh and ternary_tanh, allow different threshold prec…

a1a13cd

…ision for thresholded_relu

XLS: convert XnorPrecisionType and ExponentPrecisionType to FixedPrec…

5743912

…ision, make assertion for TensorVariable precision less strict.

XLS: support multiple layer inputs and outputs

d4cf878

TODO: implement and test layers that actually use that, e.g. Bidirectional (multiple output) or Merge (multiple input).

XLS: custom weights and bias name

0e57ca3

XLS: fix XnorPrecisionType handling.

92de294

QKeras can generate weights of XnorPrecisionType {0, 1}, which encode values {-1, 1}. See e.g. test_binary_cnn.py

XLS BuildAttr.transform(): use raise ... from to show original except…

2b9f104

…ion location

XLSBackend.predict(): fix output conversion for bit_count > 64

22861fc

Fixes test_sepconv2d.py for XLS

Do not throw ModuleNotFoundError for xls when XLSBackend is not actua…

7ead10e

…lly used.

XLSBackend: get rid of os.chdir() calls

103bc18

XLSBackend: save top function to model._xls_top_function, clear on co…

91156f6

…mpile(). This avoids reparsing .opt.ir file on subsequent model.predict() calls.

Merge branch 'refs/heads/main' into xls_backend

479032e

Revert unrelated cosmetic changes

137c988

Add .idea to .gitignore

8c6ecf8

XLS: fix description in build_prj.tcl

efe2aca

Add XLS to docs

08a32bf

pre-commit

2425a5f

setup.rst: add example pip install hls4ml[xls]

e60a23d

Add xls to EXTRA_DEPS in pytest/ci-template.yml

4c41b3a

cosmetics: inline variable

c94b146

XLS docs: add xls_codegen_flags example

fbe97e0

pre-commit

12b32a7

jmitrevs added the please test Trigger testing by creating local PR branch label May 16, 2026

Reduce XLS dimensions in test_pointwiseconv.py and test_pooling.py to…

8578a62

… speed up tests. This should fix timeout failure on CI: https://gitlab.cern.ch/fastmachinelearning/hls4ml/-/jobs/75309630 Note that XLS tests can be slow due to big (fully unrolled) IR size.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XLS backend#1475

XLS backend#1475
vasdommes wants to merge 104 commits into
fastmachinelearning:mainfrom
vasdommes:xls_backend

vasdommes commented May 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vasdommes commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

XLS workflow

XLS features

Other changes

Dependencies

Known issues

Type of change

Tests

Test Configuration

Notes on performance

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vasdommes commented May 14, 2026 •

edited

Loading