[TorchFX] Use torchao for quantize_pt2e API when possible #3588

daniil-lyakhov · 2025-07-21T15:53:27Z

Context

torch.ao directory is being moved to a separate repo, torchao, and the legacy torch.ao implementation was deprecated in the latest release of PyTorch (see details here)

The solution in our side is to

Deprecate OpenVINOQuantizer in nncf leaving only the ExecuTorch implementation
Eventually remove all dependencies on torch.ao from the nncf.quantuze TorchFX backend
Introduce torchao dependency for the quantize_pt2e API or remove all dependencies on torch.ao from the quantize_pt2e, torch_ao_adapter as well

This PR does not achieve the goal, but makes necessary first steps to achieve the goal

Changes

OpenVINOQuantizer, TorchAOQuantizerAdapter and quantize_pt2e are using torchao classes whenever it possible (using the conditional import)
torch_fx_MinMaxBackend and TorchFX transformations don't use the torch.ao FakeQuantize class anymore. Instead, a structure TorchQDQParameters is introduced in src/nncf/experimental/torch/fx/quantization/qdq_parameters.py
TorchFX transformations.py dependency on torch.ao is resolved (by moving _fuse_conv_bn_ import to other files and moving create_getattr_from_value function to the nncf transformations.py file)
XNNPACKQuantizer is removed from the tests as the actual torchao implementation is moved to ExecuTorch

Reason for changes

To support OpenVINOQuantizer from ExecuTorch in quantize_pt2e
To eliminate dependencies to torch.ao from the transformations.py

Related tickets

170072

Tests

test_openvino_quantizer_with_torch_ao_convert_pt2e is enable only for the torchao implementation

AlexanderDokuchaev · 2025-07-24T09:09:02Z

src/nncf/experimental/torch/fx/transformations.py

        return PassResult(graph_module, True)
+
+
+def get_device(module: torch.nn.Module) -> torch.device:


Please reuse

nncf/src/nncf/torch/utils.py

Line 416 in cc935e4

def get_model_device(model: torch.nn.Module) -> torch.device:

AlexanderDokuchaev · 2025-07-24T12:44:56Z

src/nncf/experimental/torch/fx/quantization/qdq_parameters.py

+    :param quant_min: Minimum quant value.
+    :type quant_min: int
+    :param quant_max: Maximum quant value.
+    :type quant_max: int
+    :param scale: Defines the scale factor used for quantization.
+    :type scale: torch.Tensor
+    :param zero_point: Specifies the quantized value to which 0 in floating point maps to.
+    :type zero_point: torch.Tensor
+    :param is_per_channel: Whether quantization is applied per channel.
+    :type is_per_channel: bool
+    :param ch_axis: Channel axis used for per-channel quantization.
+    :type ch_axis: int


Suggested change

:param quant_min: Minimum quant value.

:type quant_min: int

:param quant_max: Maximum quant value.

:type quant_max: int

:param scale: Defines the scale factor used for quantization.

:type scale: torch.Tensor

:param zero_point: Specifies the quantized value to which 0 in floating point maps to.

:type zero_point: torch.Tensor

:param is_per_channel: Whether quantization is applied per channel.

:type is_per_channel: bool

:param ch_axis: Channel axis used for per-channel quantization.

:type ch_axis: int

:param quant_min: Minimum quant value.

:param quant_max: Maximum quant value.

:param scale: Defines the scale factor used for quantization.

:param zero_point: Specifies the quantized value to which 0 in floating point maps to.

:param is_per_channel: Whether quantization is applied per channel.

:param ch_axis: Channel axis used for per-channel quantization.

:type in docstring used only for API objects

AlexanderDokuchaev · 2025-07-24T13:30:52Z

src/nncf/experimental/torch/fx/transformations.py

+    return named_param.device
+
+
+def create_getattr_from_value(module: torch.nn.Module, graph: torch.fx.Graph, prefix: str, value: Any) -> torch.fx.Node:


Not found where value is not a torch.Tensor, is it really need to use Any?

AlexanderDokuchaev · 2025-07-24T13:31:33Z

src/nncf/experimental/torch/fx/transformations.py

+    """
+
+    def get_new_attr_name(module: torch.nn.Module, prefix: str):
+        def get_attr_name(i: int):


https://github.com/openvinotoolkit/nncf/blob/develop/docs/styleguide/PyGuide.md#33-nestedlocalinner-classes-and-functions

daniil-lyakhov requested a review from a team as a code owner July 21, 2025 15:53

github-actions bot added the API Public API-impacting changes label Jul 21, 2025

daniil-lyakhov force-pushed the dl/fx/migrate_to_torchao branch 5 times, most recently from 8695761 to 3432700 Compare July 22, 2025 16:16

[TorchFX] Use torchao for quantize_pt2e API when possible

3432700

daniil-lyakhov requested a review from AlexanderDokuchaev July 23, 2025 13:40

AlexanderDokuchaev reviewed Jul 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TorchFX] Use torchao for quantize_pt2e API when possible #3588

[TorchFX] Use torchao for quantize_pt2e API when possible #3588

Uh oh!

daniil-lyakhov commented Jul 21, 2025 •

edited

Loading

Uh oh!

AlexanderDokuchaev Jul 24, 2025

Uh oh!

AlexanderDokuchaev Jul 24, 2025

Uh oh!

AlexanderDokuchaev Jul 24, 2025

Uh oh!

AlexanderDokuchaev Jul 24, 2025

Uh oh!

Uh oh!

		return PassResult(graph_module, True)


		def get_device(module: torch.nn.Module) -> torch.device:

		return named_param.device


		def create_getattr_from_value(module: torch.nn.Module, graph: torch.fx.Graph, prefix: str, value: Any) -> torch.fx.Node:

[TorchFX] Use torchao for quantize_pt2e API when possible #3588

Are you sure you want to change the base?

[TorchFX] Use torchao for quantize_pt2e API when possible #3588

Uh oh!

Conversation

daniil-lyakhov commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

This PR does not achieve the goal, but makes necessary first steps to achieve the goal

Changes

Reason for changes

Related tickets

Tests

Uh oh!

AlexanderDokuchaev Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

AlexanderDokuchaev Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

AlexanderDokuchaev Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

AlexanderDokuchaev Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

daniil-lyakhov commented Jul 21, 2025 •

edited

Loading