[CI] Enable multi thread load weight to speed up #3188

Potabk · 2025-09-25T11:38:32Z

What this PR does / why we need it?

Enable multi thread load weight to speed up test term

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.10.2
vLLM main: vllm-project/vllm@52d0cb8

Signed-off-by: wangli <[email protected]>

github-actions · 2025-09-25T11:38:40Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request enables multi-threaded weight loading in numerous E2E tests to speed up CI execution. While this is a beneficial performance improvement, the implementation introduces significant code duplication by hardcoding the model_loader_extra_config dictionary in many places. I've added a review comment with a suggestion to refactor this into a shared, centralized constant to improve code maintainability and reduce redundancy.

gemini-code-assist · 2025-09-25T11:39:28Z

tests/e2e/multicard/test_expert_parallel.py

+            model_loader_extra_config={
+                "enable_multithread_load": True,
+                "num_threads": 8
+            },


This model_loader_extra_config dictionary is duplicated across at least 14 locations in 7 different files within this pull request. This widespread duplication creates a significant maintainability issue. For instance, changing the number of threads would require manually updating every occurrence.

To address this, I strongly recommend centralizing this configuration. You could define it as a constant in a shared module, such as tests/e2e/conftest.py, and then import it wherever it's needed.

Example of centralization:

# In a shared file like tests/e2e/conftest.py import os # Use half of the available CPUs, with a fallback for CI environments. # This makes tests more portable. NUM_LOADER_THREADS = (os.cpu_count() or 16) // 2 MULTI_THREAD_LOAD_CONFIG = { "enable_multithread_load": True, "num_threads": NUM_LOADER_THREADS, }

Then, in each test, you would use:

# In your test file from tests.e2e.conftest import VllmRunner, MULTI_THREAD_LOAD_CONFIG # ... with VllmRunner( # ... model_loader_extra_config=MULTI_THREAD_LOAD_CONFIG, # ... )

This approach not only resolves the duplication but also makes the thread count more dynamic and suitable for different environments by using os.cpu_count().

Potabk · 2025-09-26T01:17:26Z

This mechanism has little effect, closed

enable multi thread load weight

22f00f2

Signed-off-by: wangli <[email protected]>

github-actions bot added the module:tests label Sep 25, 2025

gemini-code-assist bot reviewed Sep 25, 2025

View reviewed changes

Potabk added ready read for review ready-for-test start test by label for PR labels Sep 25, 2025

Potabk closed this Sep 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Enable multi thread load weight to speed up #3188

[CI] Enable multi thread load weight to speed up #3188

Uh oh!

Potabk commented Sep 25, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Sep 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 25, 2025

Uh oh!

Potabk commented Sep 26, 2025

Uh oh!

Uh oh!

[CI] Enable multi thread load weight to speed up #3188

[CI] Enable multi thread load weight to speed up #3188

Uh oh!

Conversation

Potabk commented Sep 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Sep 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Potabk commented Sep 26, 2025

Uh oh!

Uh oh!

Potabk commented Sep 25, 2025 •

edited by github-actions bot

Loading