Support weight sharing in QNN GPU by vjatoth-qti · Pull Request #2325 · microsoft/Olive

vjatoth-qti · 2026-02-08T18:17:55Z

Describe your changes

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

(Optional) Issue link

jambayk · 2026-04-09T16:45:45Z

changing to draft since there has been no update on this PR since feb

Co-authored-by: qti-mattsinc <mattsinc@qti.qualcomm.com>

qti-mattsinc · 2026-06-30T20:58:30Z

+                # filename matches what genai_config.json references.
                actual_output_dir = output_dir
-                model_file_name = "model"
+                model_file_name = Path(onnx_file_name).stem if has_additional_files and onnx_file_name else "model"


Have we tried using resave_model in update_llm_pipeline_genai_config_gpu analogously to how it's used in the NPU path, instead of changing this file?

github-advanced-security AI found potential problems Feb 10, 2026

View reviewed changes

Comment thread olive/passes/onnx/static_llm.py Fixed

Comment thread olive/passes/onnx/static_llm.py Fixed

jambayk reviewed Feb 11, 2026

View reviewed changes

Comment thread olive/passes/onnx/static_llm.py

vjatoth-qti force-pushed the dev/vjatoth-qti/qnn-gpu-weight-sharing branch from 0c0236a to 24f1374 Compare March 1, 2026 09:11

qti-mattsinc reviewed Mar 9, 2026

View reviewed changes

Comment thread olive/passes/onnx/static_llm.py

qti-mattsinc reviewed Mar 9, 2026

View reviewed changes

Comment thread olive/passes/onnx/context_binary.py Outdated

jambayk marked this pull request as draft April 9, 2026 16:45

unnim-qti force-pushed the dev/vjatoth-qti/qnn-gpu-weight-sharing branch from 24f1374 to b4fc7bb Compare April 12, 2026 22:14

vjatoth-qti marked this pull request as ready for review May 29, 2026 18:01

Copilot AI review requested due to automatic review settings May 29, 2026 18:01

qti-mattsinc reviewed Jun 1, 2026

View reviewed changes

Comment thread olive/passes/onnx/static_llm.py Outdated

Comment thread olive/passes/onnx/static_llm.py Outdated

unnim-qti force-pushed the dev/vjatoth-qti/qnn-gpu-weight-sharing branch from 9b2ce4d to e36e46b Compare June 5, 2026 10:08

Support weight sharing in QNN GPU

8c99b76

unnim-qti force-pushed the dev/vjatoth-qti/qnn-gpu-weight-sharing branch from e36e46b to 8c99b76 Compare June 6, 2026 21:23

unnim-qti added 2 commits June 7, 2026 23:49

Merge branch 'main' into dev/vjatoth-qti/qnn-gpu-weight-sharing

c7fcdf3

Merge branch 'main' into dev/vjatoth-qti/qnn-gpu-weight-sharing

b88fce0

qti-mattsinc reviewed Jun 26, 2026

View reviewed changes

Comment thread olive/passes/onnx/common.py Outdated

Add Dx12 shared memory allocator flag in genai config

904e9ec

Co-authored-by: qti-mattsinc <mattsinc@qti.qualcomm.com>

vjatoth-qti requested review from jambayk and qti-mattsinc June 30, 2026 13:46

qti-mattsinc reviewed Jun 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support weight sharing in QNN GPU#2325

Support weight sharing in QNN GPU#2325
vjatoth-qti wants to merge 4 commits into
microsoft:mainfrom
CodeLinaro:dev/vjatoth-qti/qnn-gpu-weight-sharing

vjatoth-qti commented Feb 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jambayk commented Apr 9, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qti-mattsinc Jun 30, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

vjatoth-qti commented Feb 8, 2026

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jambayk commented Apr 9, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qti-mattsinc Jun 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

qti-mattsinc Jun 30, 2026 •

edited

Loading