Don't use RawReducer for activation shape collection in Fast Bias Correction #3642

nikita-savelyevv · 2025-08-28T11:30:49Z

Changes

As in the title.

Reason for changes

This PR reduces memory footprint when applying Fast Bias Correction algorithm: collecting raw activations is not required to obtain their shapes. Avoiding using raw reducers allows to save some memory otherwise allocated for the activations.

Example quantization run on vision encoder from OpenGVLab/InternVL2-1B with 4 calibration data samples:

Before	After

Since there is no need to allocate so much memory, statistics collection time also improves.

Related tickets

172800

Tests

Existing tests cover the new changes.

https://ci-adas-icv.iotg.sclab.intel.com/view/all/job/NNCF/job/manual/job/post_training_quantization/714/artifact/results.html

src/nncf/experimental/common/tensor_statistics/statistics.py

src/nncf/version.py

tests/common/experimental/test_reducers_and_aggregators.py

src/nncf/experimental/common/tensor_statistics/statistics.py

tests/common/experimental/test_statistic_collector.py

src/nncf/experimental/common/tensor_statistics/collectors.py

src/nncf/experimental/common/tensor_statistics/statistics.py

[Numerics] tolist tensor method

src/nncf/experimental/common/tensor_statistics/collectors.py

Co-authored-by: Daniil Lyakhov <[email protected]>

nikita-savelyevv added 3 commits August 28, 2025 13:30

Initial commit

04a05b5

Fix condition

2664489

Make out of place reducer return tensor

3d319cf

github-actions bot added the NNCF Common Pull request that updates NNCF Common label Aug 28, 2025

nikita-savelyevv added 2 commits August 28, 2025 16:37

Fix test

4a87253

Patch on an object level

9ae9cb2

nikita-savelyevv marked this pull request as ready for review August 28, 2025 15:28

nikita-savelyevv requested a review from a team as a code owner August 28, 2025 15:28

nikita-savelyevv requested a review from daniil-lyakhov August 28, 2025 15:28

Apply suggested changes

9fc3caa

daniil-lyakhov reviewed Sep 1, 2025

View reviewed changes

nikita-savelyevv and others added 8 commits September 1, 2025 17:59

Fix

197d5e7

Apply changes

ebab265

Another simplification

8695fd1

Use tolist instead of to numpy + to list

5a29e07

[Numerics] tolist tensor method

b9ae8a9

Merge pull request #2 from daniil-lyakhov/dl/tolist

ec28e37

[Numerics] tolist tensor method

Expand docstring

fd8dcdf

Add test cases

76a3888

daniil-lyakhov reviewed Sep 2, 2025

View reviewed changes

src/nncf/experimental/common/tensor_statistics/collectors.py Outdated Show resolved Hide resolved

daniil-lyakhov approved these changes Sep 2, 2025

View reviewed changes

Update src/nncf/experimental/common/tensor_statistics/collectors.py

6960eec

Co-authored-by: Daniil Lyakhov <[email protected]>

nikita-savelyevv requested a review from AlexanderDokuchaev September 2, 2025 13:49

nikita-savelyevv mentioned this pull request Sep 3, 2025

[OV] Fix high memory consumption during vision encoder quantization huggingface/optimum-intel#1440

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Don't use RawReducer for activation shape collection in Fast Bias Correction #3642

Don't use RawReducer for activation shape collection in Fast Bias Correction #3642

Uh oh!

nikita-savelyevv commented Aug 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Don't use RawReducer for activation shape collection in Fast Bias Correction #3642

Are you sure you want to change the base?

Don't use RawReducer for activation shape collection in Fast Bias Correction #3642

Uh oh!

Conversation

nikita-savelyevv commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Reason for changes

Related tickets

Tests

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nikita-savelyevv commented Aug 28, 2025 •

edited

Loading