CVE-2025-24357

Description

vLLM is a library for LLM inference and serving. vllm/model_executor/weight_utils.py implements hf_model_weights_iterator to load the model checkpoint, which is downloaded from huggingface. It uses the torch.load function and the weights_only parameter defaults to False. When torch.load loads malicious pickle data, it will execute arbitrary code during unpickling. This vulnerability is fixed in v0.7.0.

CVSS breakdown

CVSS 3.1

Attack Vector

Network

Attack Complexity

High

Privileges Required

None

User Interaction

Required

Scope

Unchanged

Confidentiality

High

Integrity

High

Availability

High

Affected products

vllm-project / vllm< 0.7.0 – < 0.7.0

References

VENDOR_ADVISORYhttps://github.com/vllm-project/vllm/security/advisories/GHSA-rh4j-5rhw-hr54
PATCHhttps://github.com/vllm-project/vllm/pull/12366
PATCHhttps://github.com/vllm-project/vllm/commit/d3d6bb13fb62da3234addf6574922a4ec0513d04
MISChttps://pytorch.org/docs/stable/generated/torch.load.html