Support for bounded dynamic model #701

preetha-intel · 2025-06-06T13:12:45Z

Description

Introduce an user option to convert a fully dynamic model into a bounded dynamic model for efficiency.

Add support for output copy post ov infer to ORT for NPU with bounded dims

onnxruntime/core/providers/openvino/backends/basic_backend.h

onnxruntime/core/providers/openvino/backends/basic_backend.cc

ankitm3k · 2025-06-11T06:23:13Z

onnxruntime/core/providers/openvino/backends/basic_backend.cc

-            ov_tensor_data.tensor_ptr = std::make_shared<ov::Tensor>(input_info.type, input_info.ov_shape.get_shape(),
-                                                                     const_cast<void*>(tensor.GetTensorRawData()));
+          if (is_cpu) {
+            tensor_ptr = std::make_shared<ov::Tensor>(input_info.type, input_tensor_shape, (void*)tensor_data);


fix this using cpp style cast

onnxruntime/core/providers/openvino/contexts.h

onnxruntime/core/providers/openvino/backends/basic_backend.h

onnxruntime/core/providers/openvino/openvino_provider_factory.cc

onnxruntime/core/providers/openvino/backends/basic_backend.cc

onnxruntime/core/providers/openvino/backends/basic_backend.h

Copilot

Pull Request Overview

This PR adds support for bounded dynamic models by introducing a new user option (reshape_input) to convert fully dynamic models into bounded dynamic ones for better efficiency. Key changes include:

Enhancements to the OpenVINO provider factory and parser utilities to parse and validate the new reshape_input configuration.
Updates to backends and session tests to handle dynamic input shape validation and reshaping.
Minor formatting and logging improvements in various OpenVINO-related components.

Reviewed Changes

Copilot reviewed 16 out of 16 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
onnxruntime/test/providers/openvino/openvino_ep_context_test.cc	Validates error handling when a folder path is provided for ep_context_file_path.
onnxruntime/test/perftest/ort_test_session.cc	Adds support for the new "reshape_input" configuration in performance tests.
onnxruntime/python/tools/quantization/matmul_nbits_quantizer.py	Reformats conditions for readability without functional changes.
onnxruntime/core/providers/openvino/ov_interface.h	Adds necessary includes for dynamic shapes.
onnxruntime/core/providers/openvino/openvino_provider_factory.cc	Incorporates parsing logic for the new reshape_input option.
onnxruntime/core/providers/openvino/openvino_parser_utils.{h,cc}	Implements parsing of reshape_input using regular expressions and helper functions.
onnxruntime/core/providers/openvino/contexts.h	Introduces the reshape_t type to represent input reshaping settings.
onnxruntime/core/providers/openvino/backends/basic_backend.{h,cc}	Uses dynamic_flags to track static, fully dynamic, and bounded dynamic states and introduces dimension validation.
onnxruntime/core/providers/openvino/backend_utils.cc	Applies the reshape configuration to the OV model.
onnxruntime/core/providers/openvino/backend_manager.{h,cc}	Implements dynamic input validation and ensures reshape_input covers all dynamic inputs.

Comments suppressed due to low confidence (1)

onnxruntime/test/perftest/ort_test_session.cc:826

Consider adding test cases to verify that the new 'reshape_input' key is correctly handled and its effects are appropriately validated in performance tests.

ov_options[key] = value;

Copilot · 2025-06-11T16:19:46Z

onnxruntime/core/providers/openvino/backends/basic_backend.cc

+    int64_t min_dim = ov_dim.get_min_length();
+    int64_t max_dim = ov_dim.get_max_length();
+    if (ort_dim < min_dim || ort_dim > max_dim) {
+      ORT_THROW(" ORT Dimension is out of range");


Enhance the error message in ValidateOrtDimsAgainstPartialShape to include additional context (such as the dimension index and expected range) to aid debugging.

Copilot · 2025-06-11T16:19:46Z

onnxruntime/core/providers/openvino/backend_utils.cc

@@ -146,6 +146,10 @@ CreateOVModel(std::string&& model,
  try {
    auto ov_model = OVCore::Get()->ReadModel(std::move(model), session_context.onnx_model_path_name.string());

+    if (!session_context.reshape.empty()) {
+      LOGS_DEFAULT(INFO) << log_tag << "Reshaping the ov tensor to specified shape";


[nitpick] Consider logging the specific reshape details (e.g., target shape values) to provide clearer insight during debugging and validation of the reshaping process.

Suggested change

LOGS_DEFAULT(INFO) << log_tag << "Reshaping the ov tensor to specified shape";

LOGS_DEFAULT(INFO) << log_tag << "Reshaping the ov tensor to specified shape: " << session_context.reshape;

sfatimar

Approved

sfatimar · 2025-06-12T03:06:19Z

@preetha-intel @jatinwadhwa921 please fix Internal CI ..

sfatimar · 2025-06-12T03:06:52Z

@vthaniel please approve from your end.

onnxruntime/core/providers/openvino/backends/basic_backend.h

vthaniel · 2025-06-13T04:35:54Z

@preetha-intel @sfatimar @jatinwadhwa921
Tested the functionality with a python sample. Seems to work fine.
Please fix the unittest failures (same failures are seen on Windows also)

…unit test failures

preetha-intel · 2025-06-14T04:59:49Z

All unit tests pass. Internal CI building fine.

* Refactored the code for reshape feature * Refactor the inference logic accomodating bounded dimensions * Fix lint issues * Refactor OV shapes classification to be a part of bindings struct * Refactor the provider options key verification for python interface * Restrict removal of model proto when CPU fallback is enabled and fix unit test failures --------- Co-authored-by: jatinwadhwa921 <[email protected]>

MayureshV1 · 2025-06-17T08:16:08Z

onnxruntime/core/providers/openvino/backends/basic_backend.cc

      // Unified OV compile_model is efficient when ov model caching is enabled
      // Unified OV compile_model API is supported with AUTO from version 2024.3 and above
      // Inputs with static dimensions
      // Not enabled for models with external weights and when ep context is set.
      const std::string model = model_proto->SerializeAsString();
      // we have the serialized string, so we can release model proto to lower the peak memory consumption
-      model_proto.reset();
+      if (disable_cpu_fallback) model_proto.reset();


@preetha-intel .. Wouldn't this introduce a compile time memory regression if CPU fallback is not disabled and model fully compiles on NPU

jatinwadhwa921 and others added 2 commits May 19, 2025 04:26

Refactored the code for reshape feature

90aa8e0

Refactor the inference logic accomodating bounded dimensions

7a5a22b

jatinwadhwa921 force-pushed the jatin_ub_lb_redesign branch from 624ba6f to 010200c Compare June 6, 2025 13:24

Merge branch 'ovep-develop' into jatin_ub_lb_redesign

1efe73e

Add support for output copy post ov infer to ORT for NPU with bounded dims

preetha-intel requested review from ankitm3k, sfatimar and vthaniel June 11, 2025 06:00

Merge branch 'ovep-develop' into jatin_ub_lb_redesign

e0ab94b

ankitm3k reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backends/basic_backend.h Outdated Show resolved Hide resolved

ankitm3k reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backends/basic_backend.h Outdated Show resolved Hide resolved

sfatimar reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backends/basic_backend.cc Show resolved Hide resolved

sfatimar reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backends/basic_backend.cc Outdated Show resolved Hide resolved

ankitm3k reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/contexts.h Show resolved Hide resolved

sfatimar reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backends/basic_backend.h Outdated Show resolved Hide resolved

sfatimar reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/openvino_provider_factory.cc Show resolved Hide resolved

sfatimar reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/openvino_provider_factory.cc Show resolved Hide resolved

ankitm3k reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backends/basic_backend.cc Outdated Show resolved Hide resolved

ankitm3k requested a review from Copilot June 11, 2025 07:25

This comment was marked as outdated.

Sign in to view

ankitm3k reviewed Jun 11, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backends/basic_backend.h Outdated Show resolved Hide resolved

Fix lint issues

032d6db

jatinwadhwa921 force-pushed the jatin_ub_lb_redesign branch from 010200c to 032d6db Compare June 11, 2025 13:49

Refactor OV shapes classification to be a part of bindings struct

ef15b40

ankitm3k requested a review from Copilot June 11, 2025 16:19

Copilot AI reviewed Jun 11, 2025

View reviewed changes

Refactor the provider options key verification for python interface

0b0d9d2

sfatimar approved these changes Jun 12, 2025

View reviewed changes

ankitm3k reviewed Jun 12, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backends/basic_backend.h Show resolved Hide resolved

preetha-intel added 2 commits June 13, 2025 11:36

Restrict removal of model proto when CPU fallback is enabled and fix …

aee73f9

…unit test failures

Merge branch 'ovep-develop' into jatin_ub_lb_redesign

59bd22c

preetha-intel merged commit ca06b7a into ovep-develop Jun 14, 2025
6 of 8 checks passed

preetha-intel mentioned this pull request Jun 16, 2025

Reshape feature implementation #573

Closed

MayureshV1 reviewed Jun 17, 2025

View reviewed changes

	LOGS_DEFAULT(INFO) << log_tag << "Reshaping the ov tensor to specified shape";
	LOGS_DEFAULT(INFO) << log_tag << "Reshaping the ov tensor to specified shape: " << session_context.reshape;

Support for bounded dynamic model #701

Support for bounded dynamic model #701

Uh oh!

Conversation

preetha-intel commented Jun 6, 2025

Description

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ankitm3k Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

sfatimar left a comment

Choose a reason for hiding this comment

Uh oh!

sfatimar commented Jun 12, 2025

Uh oh!

sfatimar commented Jun 12, 2025

Uh oh!

Uh oh!

vthaniel commented Jun 13, 2025

Uh oh!

preetha-intel commented Jun 14, 2025

Uh oh!

Uh oh!

MayureshV1 Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!