[AMD] Update Quark Quantization Pass for Quark 0.11 and VitisAI LLM Fusion Model Support by poganesh · Pull Request #2364 · microsoft/Olive

poganesh · 2026-03-22T02:06:16Z

Describe your changes

Updates the QuarkQuantization (torch) pass for Quark 0.11 API (from 0.10)
Adds full fusion optimization for LLM models where supported
Adds token fusion support for models where full fusion is not yet available
Adds GPT-OSS pre-quantized model support
Aligned with MS-AMD 3D release (2/17/26)

… Fusion Support

poganesh · 2026-03-23T17:44:27Z

@devang-ml, @xieofxie could you please help review this PR.

poganesh added 2 commits March 21, 2026 19:40

Update Quark Quantization (torch) Pass for Quark 0.11 and VitisAI LLM…

496873e

… Fusion Support

Update Quark Quantization (torch) Pass for Quark 0.11 and VitisAI LLM…

fd03688

… Fusion Support

poganesh changed the title ~~[AMD] Update QuarkQuantization Pass (torch) for Quark 0.11 and VitisAI LLM Fusion Model Support~~ [AMD] Update Quark Quantization Pass for Quark 0.11 and VitisAI LLM Fusion Model Support Mar 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMD] Update Quark Quantization Pass for Quark 0.11 and VitisAI LLM Fusion Model Support#2364

[AMD] Update Quark Quantization Pass for Quark 0.11 and VitisAI LLM Fusion Model Support#2364
poganesh wants to merge 2 commits intomicrosoft:mainfrom
poganesh:npu_fusion_use_ep_v2

poganesh commented Mar 22, 2026

Uh oh!

poganesh commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

poganesh commented Mar 22, 2026

Describe your changes

Uh oh!

poganesh commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant