[Refactor]Refactor of vllm_ascend/distributed module #5719

luoxiaolin712 · 2026-01-08T07:01:13Z

What this PR does / why we need it?

Based on the RFC:#5604

This PR is a refactoring of vllm_ascend/distributed, moving all kv_transfer realtaed codes into a dedicated folder, which has already been done in vLLM

Does this PR introduce any user-facing change?

NA

How was this patch tested?

vLLM version: v0.13.0
vLLM main: vllm-project/vllm@2f4e654

github-actions · 2026-01-08T07:01:39Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request is a large-scale refactoring of the vllm_ascend/distributed module, moving files into a more organized directory structure and updating import paths accordingly. While this is a positive change for code organization, I've identified several incorrect import paths that will cause ImportError exceptions at runtime. These are critical issues that need to be fixed. I've provided specific comments and suggestions for each case.

gemini-code-assist · 2026-01-08T07:03:08Z

tests/ut/distributed/mooncake/test_config_data.py

+from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend import (  # noqa: E402
    _convert_to_bytes, _parse_global_segment_size)


The import for _convert_to_bytes and _parse_global_segment_size is incorrect. They are being imported from the backend package, but the __init__.py for that package is empty, which will cause an ImportError. Please either update the __init__.py to expose these functions or change the import to be direct from their module.

Suggested change

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend import ( # noqa: E402

_convert_to_bytes, _parse_global_segment_size)

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend.mooncake_backend import ( # noqa: E402

_convert_to_bytes, _parse_global_segment_size)

gemini-code-assist · 2026-01-08T07:03:08Z

vllm_ascend/distributed/kv_transfer/kv_pool/ascend_store/kv_transfer.py

 from vllm.logger import logger

-from vllm_ascend.distributed.kvpool.backend.backend import Backend
+from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend import Backend


The import for Backend is incorrect. It is being imported from the backend package, but the __init__.py for that package is empty, which will cause an ImportError. Please either update the __init__.py to expose this class or change the import to be direct from its module.

Suggested change

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend import Backend

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend.backend import Backend

gemini-code-assist · 2026-01-08T07:03:08Z

vllm_ascend/distributed/kv_transfer/kv_pool/ascend_store/pool_worker.py

+from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend import Backend
+from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend.memcache_backend import \
    MemcacheBackend
-from vllm_ascend.distributed.kvpool.backend.mooncake_backend import \
+from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend import \
    MooncakeBackend


The imports for Backend and MooncakeBackend are incorrect. They are being imported from the backend package, but the __init__.py for that package is empty, which will cause an ImportError. Please either update the __init__.py to expose these classes or change the imports to be direct from their respective modules.

Suggested change

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend import Backend

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend.memcache_backend import \

MemcacheBackend

from vllm_ascend.distributed.kvpool.backend.mooncake_backend import \

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend import \

MooncakeBackend

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend.backend import Backend

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend.memcache_backend import \

MemcacheBackend

from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend.mooncake_backend import \

MooncakeBackend

gemini-code-assist · 2026-01-08T07:03:08Z

vllm_ascend/ops/fused_moe/prepare_finalize.py


 from vllm_ascend.ascend_config import get_ascend_config
-from vllm_ascend.distributed.utils import fc3_all_gather_and_maybe_unpad_impl
+from vllm_ascend.distributed.kv_transfer.utils.utils import fc3_all_gather_and_maybe_unpad_impl


The import path for fc3_all_gather_and_maybe_unpad_impl is incorrect. This function was not moved to vllm_ascend.distributed.kv_transfer.utils.utils as part of the refactoring; it remains in vllm_ascend.distributed.utils. This change will cause an ImportError.

Suggested change

from vllm_ascend.distributed.kv_transfer.utils.utils import fc3_all_gather_and_maybe_unpad_impl

from vllm_ascend.distributed.utils import fc3_all_gather_and_maybe_unpad_impl

vllm_ascend/distributed/kv_transfer/__init__.py

luoxiaolin712 · 2026-01-09T09:09:54Z

The RFC has reached an agreement.

github-actions · 2026-01-09T09:50:35Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

wangxiyuan · 2026-01-13T09:24:28Z

@LCAIZJ @liziyu179

wangxiyuan · 2026-01-14T00:53:39Z

please fix the merge conflict.

Signed-off-by: lty <[email protected]>

lidenghui1110 · 2026-01-14T08:38:59Z

organized by function is ok to me, howerver, cpu_offload_manager in distributed directory is relied by cpu_offload_connector, maybe cpu_offload_manager should be moved into cpu_offload dir after this refactor?

Pz1116 · 2026-01-14T09:31:05Z

organized by function is ok to me, howerver, cpu_offload_manager in distributed directory is relied by cpu_offload_connector, maybe cpu_offload_manager should be moved into cpu_offload dir after this refactor?

my bad, moving cpu_offload_manager under cpu_offload dir is exactly what we planed to do, but I missed this part when drawing the table in the RFC, we'll fix this shortly.

Signed-off-by: lty <[email protected]>

### What this PR does / why we need it? Based on the RFC:vllm-project#5604 This PR is a refactoring of vllm_ascend/distributed, moving all kv_transfer realtaed codes into a dedicated folder, which has already been done in vLLM ### Does this PR introduce _any_ user-facing change? NA ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2f4e654 --------- Signed-off-by: lty <[email protected]>

LCAIZJ · 2026-01-20T04:02:29Z

LGTM

### What this PR does / why we need it? Based on the RFC:vllm-project#5604 This PR is a refactoring of vllm_ascend/distributed, moving all kv_transfer realtaed codes into a dedicated folder, which has already been done in vLLM ### Does this PR introduce _any_ user-facing change? NA ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2f4e654 --------- Signed-off-by: lty <[email protected]>

Based on the RFC:vllm-project#5604 This PR is a refactoring of vllm_ascend/distributed, moving all kv_transfer realtaed codes into a dedicated folder, which has already been done in vLLM NA - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2f4e654 --------- Signed-off-by: lty <[email protected]>

github-actions bot added ci/build module:tests module:ops labels Jan 8, 2026

gemini-code-assist bot reviewed Jan 8, 2026

View reviewed changes

luoxiaolin712 changed the title ~~[RPC]Refactor of vllm_ascend/distributed module~~ [RFC]Refactor of vllm_ascend/distributed module Jan 8, 2026

luoxiaolin712 changed the title ~~[RFC]Refactor of vllm_ascend/distributed module~~ [Refactor]Refactor of vllm_ascend/distributed module Jan 8, 2026

Pz1116 reviewed Jan 8, 2026

View reviewed changes

vllm_ascend/distributed/kv_transfer/__init__.py Show resolved Hide resolved

luoxiaolin712 force-pushed the main branch from 09e122f to dbe97cb Compare January 9, 2026 03:24

wangxiyuan mentioned this pull request Jan 9, 2026

[RFC]: Refactor of vllm_ascend/distributed module #5604

Open

github-actions bot added the merge-conflicts label Jan 9, 2026

luoxiaolin712 closed this Jan 12, 2026

luoxiaolin712 force-pushed the main branch from 0863f38 to 920bbe9 Compare January 12, 2026 02:24

luoxiaolin712 reopened this Jan 12, 2026

github-actions bot removed the merge-conflicts label Jan 12, 2026

luoxiaolin712 closed this Jan 12, 2026

luoxiaolin712 reopened this Jan 12, 2026

luoxiaolin712 closed this Jan 12, 2026

luoxiaolin712 reopened this Jan 12, 2026

luoxiaolin712 closed this Jan 12, 2026

luoxiaolin712 force-pushed the main branch from ecc1a57 to 297f6de Compare January 12, 2026 06:27

luoxiaolin712 reopened this Jan 12, 2026

luoxiaolin712 closed this Jan 13, 2026

luoxiaolin712 force-pushed the main branch from 786cb57 to 297f6de Compare January 13, 2026 01:23

luoxiaolin712 reopened this Jan 13, 2026

wangxiyuan approved these changes Jan 13, 2026

View reviewed changes

luoxiaolin712 closed this Jan 13, 2026

luoxiaolin712 force-pushed the main branch from be9c7fa to eed9e36 Compare January 13, 2026 09:44

luoxiaolin712 reopened this Jan 13, 2026

luoxiaolin712 closed this Jan 14, 2026

luoxiaolin712 force-pushed the main branch from 71c4898 to e20813f Compare January 14, 2026 01:21

Refactor of vllm_ascend/distributed module

7f45782

Signed-off-by: lty <[email protected]>

luoxiaolin712 reopened this Jan 14, 2026

luoxiaolin712 closed this Jan 14, 2026

luoxiaolin712 reopened this Jan 14, 2026

luoxiaolin712 added 2 commits January 14, 2026 10:12

fix pre-commit

59fe641

Signed-off-by: lty <[email protected]>

fix pre-commit

a15a988

Signed-off-by: lty <[email protected]>

luoxiaolin712 requested a review from wangxiyuan January 14, 2026 07:52

luoxiaolin712 added 2 commits January 14, 2026 17:33

Refactor of vllm_ascend/distributed module

fd5d120

Signed-off-by: lty <[email protected]>

Refactor of vllm_ascend/distributed module

08be6d5

Signed-off-by: lty <[email protected]>

wangxiyuan approved these changes Jan 15, 2026

View reviewed changes

wangxiyuan merged commit 295018e into vllm-project:main Jan 15, 2026
16 checks passed

		from vllm_ascend.distributed.kv_transfer.kv_pool.ascend_store.backend import ( # noqa: E402
		_convert_to_bytes, _parse_global_segment_size)

	from vllm_ascend.distributed.kv_transfer.utils.utils import fc3_all_gather_and_maybe_unpad_impl
	from vllm_ascend.distributed.utils import fc3_all_gather_and_maybe_unpad_impl

[Refactor]Refactor of vllm_ascend/distributed module #5719

[Refactor]Refactor of vllm_ascend/distributed module #5719

Uh oh!

Conversation

luoxiaolin712 commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Jan 8, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

luoxiaolin712 commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

wangxiyuan commented Jan 13, 2026

Uh oh!

wangxiyuan commented Jan 14, 2026

Uh oh!

lidenghui1110 commented Jan 14, 2026

Uh oh!

Pz1116 commented Jan 14, 2026

Uh oh!

Uh oh!

LCAIZJ commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

luoxiaolin712 commented Jan 8, 2026 •

edited

Loading

luoxiaolin712 commented Jan 9, 2026 •

edited

Loading