[QNN-EP] Implement file mapped weights feature #26952

quic-calvnguy · 2026-01-09T02:13:54Z

Description
Enables the file mapping of weights as well as the overall context bin. This feature is currently only enabled for ARM64 WIN devices

Motivation and Context
Currently, when reading the context bin, ORT allocates a large buffer on the heap. Assuming the same model is used, each ORT session will allocate a buffer for the context bin. This is incredibly wasteful when large models are used. Instead, WIN file mapping can be leveraged to map the context bin, then every time a context needs to be created with the context bin, the pointer to the context bin can be retrieved and used instead of some pre-allocated buffer, thus making QNN EP more memory-efficient. In the case of multiple ORT sessions, the context bin will only be loaded once for all sessions, increasing memory efficiency and overall initialization performance. This is very useful regarding the use of LLMs going forward.

- Create file mapping callback interface class - Android expected to have support in the future - Implement Windows callbacks in WindowsFileMapper - New option disable_file_mapped_weights - Feature is enabled by default with retry logic

[QNN-EP] Implement file mapped weights feature

d57d067

- Create file mapping callback interface class - Android expected to have support in the future - Implement Windows callbacks in WindowsFileMapper - New option disable_file_mapped_weights - Feature is enabled by default with retry logic

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[QNN-EP] Implement file mapped weights feature #26952

[QNN-EP] Implement file mapped weights feature #26952

quic-calvnguy commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[QNN-EP] Implement file mapped weights feature #26952

Are you sure you want to change the base?

[QNN-EP] Implement file mapped weights feature #26952

Conversation

quic-calvnguy commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant