[FSU] Inference FSU with Shared memory #2969

DonghakPark · 2025-02-25T09:44:02Z

[FSU] Inference FSU with Shared memory

To reduce memory usage during inference by utilizing FSU, and to minimize speed degradation by performing loading during forwarding, changed to use shared memory. and ensure the existing swap in training mode is also performed normally.

Commit 1 : [FSU] Update FSU Forwarding (Load) Logis

Change FSU Forwarding Logic ( Load weight with look ahead)

Commit 2 : [FSU] Update swap device & cache element

Update Swap Device's function to Support FSU (Inference)

Commit 3 : [FSU] Update FSU mem allocate Logic

Update Memory Allocation to Shared Mem

Commit 4 : [FSU] add FSU file offset info

Add Weight bin file offset that can pass to swap device

Commit 5 : [FSU] Apply Shared Mem & FSU

Update Logic to support both Inference Mode & Training Mode

This PR was include #2957 #2927 #2949, So i will close previous PRs

Update FSU forwarding logic - FSU will handle look ahead tensor inside of pool - so we don't need to call Loadtensor for f + i **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghak PARK <[email protected]>

Add memory ptr for allocate shared mem - add mem_ptr - add unmap - array for manage unmapped ptr **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghak PARK <[email protected]>

I have changed the method from using dynamic memory allocation to using static memory allocation. In order to prevent multiple frees, I added a map to check whether the mem_address has already been processed. Previously, memory was allocated through buf, but now it is being allocated directly. **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Co-authored-by: jijoong.moon <[email protected]> Signed-off-by: Donghak PARK <[email protected]>

make neuralnet can pass path to the swap_device & weight offset (file offset) it can make calculate weight file's offset **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Co-authored-by: hyeonseok <[email protected]> Signed-off-by: Donghak PARK <[email protected]>

Apply Shared mem & FSU - when inference mode : read from weight bin ( weight offset ) - when train mode : same logic with swap **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghak PARK <[email protected]>

Fix Unittest Fail bug at Training Case Swap - There are some issue on PutBuffer that can not free ptr **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghak PARK <[email protected]>

Apply clang format at changed File **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghak PARK <[email protected]>

Update FSU Unitter - For now, we should set our weight & input size as pagesize * N - For later i will add Page Align Algorithm **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghak PARK <[email protected]>

DonghakPark and others added 4 commits February 25, 2025 15:35

DonghakPark requested review from myungjoo, jijoongmoon, again4you, jaeyun-jung, leemgs, wooksong, gichan-jang, anyj0527, lhs8928, songgot, jihochu, SeoHyungjun, baek2sm, skykongkong8, djeong20, EunjuYang, dkjung and haehun as code owners February 25, 2025 09:44

This was referenced Feb 25, 2025

[FSU] Apply Static Memory allocation at FSU(Swap) #2949

Closed

[Wait For #2906] [FSU] file offset #2927

Closed

github-actions bot added the Need Review label Feb 25, 2025

DonghakPark mentioned this pull request Feb 25, 2025

[FSU] Update File Load for FSU & Static Memory Use #2957

Closed

DonghakPark added the DO NOT MERGE label Feb 25, 2025

DonghakPark force-pushed the FSU_with_shared_Mem branch from 3b395aa to 1629bad Compare February 26, 2025 01:22

DonghakPark force-pushed the FSU_with_shared_Mem branch from 1629bad to 769c28a Compare February 26, 2025 03:36

DonghakPark force-pushed the FSU_with_shared_Mem branch from ec044b7 to d328785 Compare February 26, 2025 06:44

DonghakPark removed the DO NOT MERGE label Feb 26, 2025

[Formatting] Apply clang-format

995c37c

Apply clang format at changed File **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghak PARK <[email protected]>

DonghakPark force-pushed the FSU_with_shared_Mem branch from d328785 to 995c37c Compare February 26, 2025 06:59

DonghakPark force-pushed the FSU_with_shared_Mem branch from 1ebd358 to e41af8c Compare February 26, 2025 10:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FSU] Inference FSU with Shared memory #2969

[FSU] Inference FSU with Shared memory #2969

DonghakPark commented Feb 25, 2025

[FSU] Inference FSU with Shared memory #2969

Are you sure you want to change the base?

[FSU] Inference FSU with Shared memory #2969

Conversation

DonghakPark commented Feb 25, 2025