Skip to content

Issue/1030: Nvidia 支持w4a16推理#1040

Open
qinyiqun wants to merge 2 commits intomainfrom
Issue/1030
Open

Issue/1030: Nvidia 支持w4a16推理#1040
qinyiqun wants to merge 2 commits intomainfrom
Issue/1030

Conversation

@qinyiqun
Copy link
Collaborator

@qinyiqun qinyiqun commented Mar 2, 2026

Support w4a16 fp16 inference.

@qinyiqun qinyiqun requested a review from a team March 2, 2026 08:07
@qinyiqun qinyiqun requested review from whjthu and wooway777 March 2, 2026 08:08
@qinyiqun qinyiqun linked an issue Mar 2, 2026 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[DEV] Nvidia 支持 w4a16推理

1 participant