[aes] Base RTL implementation of GCM extension #2

vogelpi · 2024-11-08T13:51:50Z

This PR contains the base RTL implementation of the AES-GCM extension. The GHASH implementation in this PR is unhardened against SCA. The hardened implementation will be part of a follow-up PR.

This implementation has been tested using a basic Verilator testbench and some NIST vectors.

Support for AES-GCM will be added in a backward compatible manner. Existing software won't need to change. Signed-off-by: Pirmin Vogel <[email protected]>

Signed-off-by: Pirmin Vogel <[email protected]>

andrea-caforio · 2024-11-11T10:06:05Z

hw/ip/aes/rtl/aes_control.sv

@@ -79,6 +85,13 @@ module aes_control
  output logic                      cipher_data_out_clear_o,
  input  logic                      cipher_data_out_clear_i,

+  // GHASH control and sync
+  output sp2v_e                     ghash_in_valid_o,


These sparse two-value signals are only used in the AES block, why is that?

We use sparse two-value signals for FI hardening purposes. Implementing that hardening is quite cumbersome at the moment. Since the GHASH block is not yet final, I did not yet do the hardening there yet. But I wanted to keep the interface between the cores as stable as possible.

andrea-caforio · 2024-11-11T10:08:11Z

hw/ip/aes/rtl/aes_control_fsm.sv

-        aes_ctrl_ns  = !cipher_dec_key_gen_i ? CTRL_PRNG_UPDATE : CTRL_FINISH;
-      end
-
-      CTRL_PRNG_UPDATE: begin


Why is this state being superseded by CTRL_GHASH_READY?

I wanted to properly decouple the valid from the ready in the various handshake pairs. After investigating the FSM I realized I could do something similar to how we handled the clearing PRNG. Then I realized that the clearing PRNG actually doesn't need a handshake for the updating function (it's always ready, see separate commit). That's why I changed the integration of the PRNG and then re-purposed this FSM state to integrate the GHASH block.

andrea-caforio · 2024-11-11T10:09:43Z

hw/ip/aes/rtl/aes_core.sv

+      end
+
+      // Avoid aggressive synthesis optimizations.
+      logic [3:0][3:0][7:0] ghash_state_done_buf [NumShares];


In the worst case, what could be optimized away if the signal is not being buffered?

The mux could be optimized away, because most tools realize that in case where the output is not marked as valid, we anyway don't store it into the output registers. Without the mux, the two shares of the GHASH state will constantly get compared at the output.

But for the real hardened design, I think we won't need this. We can re-use existing logic inside GHASH to add the shares at the very end of the computation. This will take one more clock cycle but it will be more efficient area wise.

I've now removed this. It saves around 300 GEs :-)

vogelpi · 2024-11-11T11:10:55Z

Thanks for your review @andrea-caforio !

This commit adds a first implementation of the GHASH module required for AES-GCM support. This first version uses a single, pipelined GF(2^128) multiplier and 3 128-bit registers for the GHASH state, the hash subkey and the encrypted initial counter block J_0 (= S). The latency of the GF multiplier is matched to the latency of the cipher core. Signed-off-by: Pirmin Vogel <[email protected]>

For historical reasons, the clearing PRNG had a req/ack interface for both the pseudo-random data and the reseed interface. The former is actually not required as the PRNG can always provide suitable pseudo- random data, even in case of an outstanding reseed request. This commit thus simplifies the req/ack interface to a single update signal, similar to what is used for the masking PRNG. In addition to simplifying the code, this enables re-using the main control FSM state previously used for performing the handshake with the clearing PRNG, to perform an actually required handshake with another functional block such as the GHASH block used for AES-GCM. Signed-off-by: Pirmin Vogel <[email protected]>

Signed-off-by: Pirmin Vogel <[email protected]>

In contrast to the regular CTR mode where the counter performs inc128(), the counter only performs inc32() in GCM, i.e., the counter wraps at 32 bits. Signed-off-by: Pirmin Vogel <[email protected]>

This commit optimizes the clearing logic of the GHASH block to clear internal registers by loading the cipher core output after the cipher output has cleared its internal state. This allows reducing the internal muxing logic by 2x 128-bit wide 2-to-1 muxes and the big 128-bit wide 5-to-1 state mux looses one input, too. When using the open source synthesis flow, this helps reducing the area by roughly 1 kGE. Signed-off-by: Pirmin Vogel <[email protected]>

vogelpi · 2024-11-12T16:31:01Z

CHANGE AUTHORIZED: hw/ip/aes/data/aes.hjson
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_control.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_control_fsm.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_control_fsm_n.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_control_fsm_p.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_core.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_ctr.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_ctr_fsm.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_ctr_fsm_n.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_ctr_fsm_p.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_ctrl_gcm_reg_shadowed.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_ctrl_reg_shadowed.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_ghash.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_pkg.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_prng_clearing.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_reg_pkg.sv
CHANGE AUTHORIZED: hw/ip/aes/rtl/aes_reg_top.sv

vogelpi added 2 commits November 8, 2024 14:49

[aes] Bump minor version and return to D1/V1 for AES-GCM

7c90d2e

Support for AES-GCM will be added in a backward compatible manner. Existing software won't need to change. Signed-off-by: Pirmin Vogel <[email protected]>

[aes] Extend CSRs for AES-GCM support

7fef8b2

Signed-off-by: Pirmin Vogel <[email protected]>

vogelpi force-pushed the aes-gcm-base-rtl branch from 998629f to 3aaa967 Compare November 8, 2024 16:20

vogelpi requested a review from andrea-caforio November 8, 2024 16:25

andrea-caforio reviewed Nov 11, 2024

View reviewed changes

vogelpi added 5 commits November 11, 2024 16:19

[aes] Integrate GHASH module into design and interface main controller

ff67697

Signed-off-by: Pirmin Vogel <[email protected]>

[aes] Add support for inc32() to the counter module

71167e2

In contrast to the regular CTR mode where the counter performs inc128(), the counter only performs inc32() in GCM, i.e., the counter wraps at 32 bits. Signed-off-by: Pirmin Vogel <[email protected]>

vogelpi force-pushed the aes-gcm-base-rtl branch from 3aaa967 to d249304 Compare November 11, 2024 15:19

vogelpi merged commit 537e756 into aes-gcm-review Nov 12, 2024
13 of 17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[aes] Base RTL implementation of GCM extension #2

[aes] Base RTL implementation of GCM extension #2

vogelpi commented Nov 8, 2024

andrea-caforio Nov 11, 2024

vogelpi Nov 11, 2024

andrea-caforio Nov 11, 2024

vogelpi Nov 11, 2024

andrea-caforio Nov 11, 2024

vogelpi Nov 11, 2024

vogelpi Nov 11, 2024

vogelpi commented Nov 11, 2024

vogelpi commented Nov 12, 2024

[aes] Base RTL implementation of GCM extension #2

[aes] Base RTL implementation of GCM extension #2

Conversation

vogelpi commented Nov 8, 2024

andrea-caforio Nov 11, 2024

Choose a reason for hiding this comment

vogelpi Nov 11, 2024

Choose a reason for hiding this comment

andrea-caforio Nov 11, 2024

Choose a reason for hiding this comment

vogelpi Nov 11, 2024

Choose a reason for hiding this comment

andrea-caforio Nov 11, 2024

Choose a reason for hiding this comment

vogelpi Nov 11, 2024

Choose a reason for hiding this comment

vogelpi Nov 11, 2024

Choose a reason for hiding this comment

vogelpi commented Nov 11, 2024

vogelpi commented Nov 12, 2024