JSON collaterals serialization with Bytemuck, and implementation of their corresponding ZeroCopy types #19

preston4896 · 2025-05-14T06:03:27Z

Overview

The pod module in dcap-rs provides an efficient method for handling Intel SGX/TDX attestation data structures (TcbInfo and EnclaveIdentity) using a zero-copy approach. This is particularly valuable in memory-constrained environments like Solana programs.

Instead of deserializing data into deeply nested Rust structs (which require significant heap allocations), the zero-copy approach allows direct access to data within a contiguous byte buffer, reading only what's needed without allocating new memory for each component.

Serialization Format and Process

General Binary Format

All serialized structures follow a common pattern:

[Signature (64 bytes)] | [Main Header (fixed-size)] | [Variable Payload]

The main header contains metadata needed to interpret the variable payload, including lengths, counts, and flags. The payload contains all variable-length and nested data in a carefully structured format.

Serialization Process

For both TcbInfo and EnclaveIdentity, serialization begins with creating a fixed-size header structure.
Header fields are populated with metadata from the original Rust struct.
A payload byte vector is incrementally built by appending:
- Fixed-size data (like TDX module data)
- String data (sometimes stored as ASCII/hex strings)
- Serialized nested structures (each with their own headers and payloads)
Proper alignment is maintained using padding bytes where necessary
The signature, header, and payload are combined into a single byte vector

For example, in SerializedTcbInfo::from_rust_tcb_info(), the function:

Creates and populates a TcbInfoHeader
Builds the payload by appending data for TDX module, TDX identities, and TCB levels
Tracks offsets and sizes to populate header metadata fields

Deserialization and Zero-Copy Access

Deserialization Process

The signature is extracted from the first 64 bytes.
The fixed-size header is read using bytemuck::try_from_bytes().
Based on header metadata, slices of the payload are created for each section.
These slices are used to create zero-copy view structs that reference the original buffer.

For example, TcbInfoZeroCopy::from_bytes():

pub fn from_bytes(bytes: &'a [u8]) -> Result<Self, ZeroCopyError> {
    // Extract header from start of bytes
    let (header_bytes, main_payload) = bytes.split_at(core::mem::size_of::<TcbInfoHeader>());
    let header: &TcbInfoHeader = cast_slice(header_bytes)?;
    
    // Use header metadata to slice the payload into sections
    let mut current_offset = 0;
    let tdx_module_data_payload = if header.tdx_module_present == 1 {
        // Extract TDX module data based on length in header
        let len = header.tdx_module_data_len as usize;
        let slice = main_payload.get(current_offset..current_offset + len)?;
        current_offset += len;
        Some(cast_slice(slice)?)
    } else {
        None
    };
    
    // Extract remaining sections similarly
    // ...
    
    Ok(Self {
        header,
        tdx_module_data_payload,
        tdx_module_identities_section_payload,
        tcb_levels_section_payload,
    })
}

Accessing Payload Data

The zero-copy structs provide accessor methods that:

Use header metadata to locate data within the payload
Interpret bytes directly from the original buffer
Return either references to the data or parsed values

Examples from TcbInfoZeroCopy:

pub fn fmspc(&self) -> [u8; 6] { 
    hex::decode(from_utf8(&self.header.fmspc_hex).unwrap()).unwrap().try_into().unwrap()
}

pub fn tcb_levels(&self) -> TcbLevelIter<'a> {
    TcbLevelIter::new(
        self.tcb_levels_section_payload,
        self.header.tcb_levels_count,
    )
}

Handling Variable and Nested Data

Payload Structure

The payload is organized hierarchically:

Top-level payload contains sections for different components (TDX module, TCB levels, etc.)
Each section may contain multiple items (e.g., multiple TCB levels)
Each item has its own header and payload, embedded within the parent's payload

The header at each level contains metadata needed to locate and interpret its children:

Counts (number of items in a collection)
Lengths (size of sections in bytes)
Presence flags (for optional components)

Iterators for Nested Structures

Complex nested collections use iterator types (e.g., TcbLevelIter, TdxModuleIdentityIter):

Iterators are initialized with:
- A byte slice containing the section payload
- A count of items from the parent header
The iterator maintains:
- Current position in the byte slice
- Index of the current item
- Item count to determine when iteration is complete
For each .next() call:
- The iterator reads a fixed-size header at the current position
- Uses header metadata to determine payload boundaries
- Creates a zero-copy view struct for the item
- Advances position accounting for padding and alignment

Example structure of TcbLevelIter:

pub struct TcbLevelIter<'a> {
    bytes: &'a [u8],      // Slice of payload containing all TCB levels
    count: u32,           // Total number of levels to iterate over
    current_index: usize, // Current index in the iteration
    current_offset: usize, // Current byte offset in the payload
}

impl<'a> Iterator for TcbLevelIter<'a> {
    type Item = Result<TcbLevelZeroCopy<'a>, ZeroCopyError>;
    
    fn next(&mut self) -> Option<Self::Item> {
        if self.current_index >= self.count as usize {
            return None; // Iteration complete
        }
        
        // Extract header at current offset
        let header: &TcbLevelHeader = match cast_slice(&self.bytes[self.current_offset..]) {
            Ok(h) => h,
            Err(e) => return Some(Err(e)),
        };
        
        // Get section payload based on header metadata
        let total_payload_size = /* calculate from header fields */;
        let payload_slice = &self.bytes[self.current_offset + mem::size_of::<TcbLevelHeader>()..
                                         self.current_offset + total_payload_size];
        
        // Create view struct
        let result = TcbLevelZeroCopy::new(header, payload_slice);
        
        // Update state for next item
        self.current_index += 1;
        self.current_offset += total_payload_size;
        
        // Align to required boundary for next header
        self.current_offset = align_up(self.current_offset, align_of::<TcbLevelHeader>());
        
        Some(result)
    }
}

Relationship Between Types

Original Rust Types vs. Zero-Copy Types

Original Types (TcbInfo, EnclaveIdentity):
- Full Rust structs with owned data
- Support normal Rust operations (clone, serialize to JSON, etc.)
- Use heap allocations for strings and collections
- Simple to use but memory-intensive
Zero-Copy Types (TcbInfoZeroCopy, EnclaveIdentityZeroCopy):
- View types that borrow slices of the original byte array
- No heap allocations during access
- Lifetime tied to the source byte array
- Provide accessor methods and iterators for efficient access
- More complex to use but minimal memory overhead

Conversion Between Types

When needed, zero-copy types can be converted to original types (gaining usability but losing memory efficiency):

// In the conversion module:
pub fn tcb_info_from_zero_copy(view: &TcbInfoZeroCopy) -> Result<TcbInfo, ZeroCopyError> {
    // Convert zero-copy view to owned TcbInfo struct
    // ...
}

// Usage in parse_tcb_pod_bytes:
let tcb_info_view = TcbInfoZeroCopy::from_bytes(bytes)?;
let rust_tcb_info = tcb_info_from_zero_copy(&tcb_info_view)?;

Summary

The zero-copy design in dcap-rs allows for efficient handling of complex attestation data:

Space Efficiency: Data is accessed directly from the source buffer without duplicating structures.
Minimal Allocations: Only the bytes needed for the original buffer are allocated.
Performance: Avoids parsing entire structures when only portions are needed.
Flexibility: Can convert between zero-copy views and full Rust structs when necessary.

This approach is particularly valuable in memory-constrained environments like Solana programs, where traditional deserialization would be too resource-intensive.

preston4896 added 19 commits May 12, 2025 13:36

expose TcbInfoAndSignature fields, serialize version to u16

e02881f

fmspc and pceid deserialize as strings

f9bd528

fixed fmspc and pceid pck comparison

5a71e11

more bytes should be deserialized as string types

977324a

dont use map for getters so we dont depend on copy trait

544a92c

change tcbv3components ordering

9200192

removed borsh and use bytemuck instead

0262abb

removed compute digest

15efb22

first attempt at bytemuck serialization

ab0aff2

bytemuck tcb serialization passed testing

519b794

renamed tcb_info pod module

17ed021

zero-copy independant of rust native tcb_info

c2a3b9e

zero-copy feature

18d22f7

i64 for timestamps

c9db61a

incorrect tdx component lookup

39e2543

implemented zero copy tcb lookup

f16142c

features set update

1052a69

pod tcb_info utils should not depend on TcbStatus

6250baa

comments about space and alignment for pod types

a5e5135

preston4896 marked this pull request as ready for review May 15, 2025 05:04

preston4896 requested a review from udsamani May 15, 2025 05:04

preston4896 added 4 commits May 16, 2025 12:25

reorganize pod modules

a181c10

qe identity type update: parse hexstrings as strings

d3533b6

enclave identity bytemuck serialization

912c931

implemented getters to return hex string data as bytes array

8521a87

preston4896 changed the title ~~TcbInfo Serialization with Bytemuck, and implementation of TcbInfo ZeroCopy types~~ JSON collaterals serialization with Bytemuck, and implementation of their corresponding ZeroCopy types May 19, 2025

preston4896 added 3 commits May 20, 2025 15:02

modified tcb status enum ordering

50aae7e

change tee type byte order in verified output

42ac557

fixed converge_tcb_status_with_tdx_module

3561f23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

JSON collaterals serialization with Bytemuck, and implementation of their corresponding ZeroCopy types #19

JSON collaterals serialization with Bytemuck, and implementation of their corresponding ZeroCopy types #19

Uh oh!

preston4896 commented May 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

JSON collaterals serialization with Bytemuck, and implementation of their corresponding ZeroCopy types #19

Are you sure you want to change the base?

JSON collaterals serialization with Bytemuck, and implementation of their corresponding ZeroCopy types #19

Uh oh!

Conversation

preston4896 commented May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Serialization Format and Process

General Binary Format

Serialization Process

Deserialization and Zero-Copy Access

Deserialization Process

Accessing Payload Data

Handling Variable and Nested Data

Payload Structure

Iterators for Nested Structures

Relationship Between Types

Original Rust Types vs. Zero-Copy Types

Conversion Between Types

Summary

Uh oh!

Uh oh!

preston4896 commented May 14, 2025 •

edited

Loading