CVE-2026-54233

Description

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, vLLM's /v1/audio/transcriptions endpoint limits compressed upload size but not decoded PCM output. A 25MB OPUS file expands to ~14.9GB of float32 PCM at decode time. This vulnerability is fixed in 0.23.1rc0.

CVSS breakdown

CVSS 3.1

Attack Vector

Network

Attack Complexity

Low

Privileges Required

Low

User Interaction

None

Scope

Unchanged

Confidentiality

None

Integrity

None

Availability

High

Affected products

vllm-project / vllm< 0.23.1rc0 – < 0.23.1rc0

References

VENDOR_ADVISORYhttps://github.com/vllm-project/vllm/security/advisories/GHSA-6pr9-rp53-2pmc
PATCHhttps://github.com/vllm-project/vllm/pull/44970