Use `find` for `search_n` when n=1 #5346

AlexGuteniev · 2025-03-18T19:18:45Z

📜 The optimization

There are two implementations of search_n — in std and in std::ranges. For bidirectional iterators, both implementations take advantage of the contiguous range to search for. They jump forward by the value of n and try to match from the end. This allows skipping some comparisons. When there are more mismatches than matches, it ends up in fast pass over the range and few comparisons.

This means than for large values of n and non-pathological input, the algorithm is not even likely to benefit from vectorization.

For small values of n, however, the algorithm performs worse.

The worst case is n=1, where the algortihm is just find with extra steps. The PR forwards this case directly to find, where it may pick the vectorization or memchr, and even if it doesn't, it would still stop looking into doing extra steps.

⚖️ Predicate check

Unlike many other algorithms, such as find, the search_n algorithm takes both value and predicate. We want to forward to predicate-less find, as we're trying to engage vectorization, so we can do this when seeing the default equal_to predicate. Binding the value and the predicate into a bigger predicate and passing that to find_if would work for more cases, but would not be (manually) vectorized.

Since the value type and iterator type are unrelated, the comparison is potentially heterogenous, so it is hard to verify if non-void specialization of std::equal_to<T> does the same as default comparison, or not. We'll skip that, and check just for std::equal_to<void> and ranges::equal_to.

✅ Test coverage

There's no attempt of comprehensive coverage of std::search_n 🙀. Just some ad-hoc tests, mostly negative. Creating one seems out of scope for this PR. The n=1 case seems to be covered indirectly via P0024R2_parallel_algorithms_search_n test, along with many other cases.

For ranges::search_n there's a pre-existing test that does at least some minimum coverage, expanded that with n=1 case.

⏱️Benchmark results

Benchmark	Before	After
bm<uint8_t, AlgType::Std>/3000	525 ns	17.5 ns
bm<uint8_t, AlgType::Rng>/3000	995 ns	17.5 ns
bm<uint16_t, AlgType::Std>/3000	587 ns	40.0 ns
bm<uint16_t, AlgType::Rng>/3000	1506 ns	38.8 ns
bm<uint32_t, AlgType::Std>/3000	582 ns	67.8 ns
bm<uint32_t, AlgType::Rng>/3000	1500 ns	68.5 ns
bm<uint64_t, AlgType::Std>/3000	571 ns	146 ns
bm<uint64_t, AlgType::Rng>/3000	1466 ns	147 ns

benchmarks/src/search_n.cpp

StephanTLavavej · 2025-03-20T00:42:31Z

Thanks for the detailed PR description and significant optimization! 💚 I pushed very minor nitpicks to the benchmark.

StephanTLavavej · 2025-03-21T14:19:42Z

I'm mirroring this to the MSVC-internal repo - please notify me if any further changes are pushed.

StephanTLavavej · 2025-03-24T23:38:48Z

Thanks for finding this optimization opportunity! 😹 🕵️ 🎉

Use find for search_n when n=1

9043ffe

AlexGuteniev requested a review from a team as a code owner March 18, 2025 19:18

github-project-automation bot added this to STL Code Reviews Mar 18, 2025

github-project-automation bot moved this to Initial Review in STL Code Reviews Mar 18, 2025

StephanTLavavej added the performance Must go faster label Mar 18, 2025

Actually test predicate-less unit needle

7af139a

StephanTLavavej self-assigned this Mar 18, 2025

StephanTLavavej added 3 commits March 19, 2025 17:20

Fix comment typos.

74116a2

Avoid shadowing: count => N

d3766ea

Remove unused <limits>.

2cb7d1c

StephanTLavavej reviewed Mar 20, 2025

View reviewed changes

benchmarks/src/search_n.cpp Outdated Show resolved Hide resolved

benchmarks/src/search_n.cpp Outdated Show resolved Hide resolved

benchmarks/src/search_n.cpp Outdated Show resolved Hide resolved

StephanTLavavej approved these changes Mar 20, 2025

View reviewed changes

StephanTLavavej removed their assignment Mar 20, 2025

StephanTLavavej moved this from Initial Review to Ready To Merge in STL Code Reviews Mar 20, 2025

StephanTLavavej mentioned this pull request Mar 20, 2025

Maintainer priorities #4700

Open

StephanTLavavej moved this from Ready To Merge to Merging in STL Code Reviews Mar 21, 2025

StephanTLavavej self-assigned this Mar 21, 2025

StephanTLavavej added a commit to StephanTLavavej/STL that referenced this pull request Mar 21, 2025

microsoftGH-5346

5fc17e0

AlexGuteniev mentioned this pull request Mar 22, 2025

Vectorize search_n for small values of n #5352

Merged

StephanTLavavej merged commit 0a0514c into microsoft:main Mar 24, 2025
39 checks passed

github-project-automation bot moved this from Merging to Done in STL Code Reviews Mar 24, 2025

AlexGuteniev deleted the n-equals-one branch March 25, 2025 05:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use `find` for `search_n` when n=1 #5346

Use `find` for `search_n` when n=1 #5346

AlexGuteniev commented Mar 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

StephanTLavavej commented Mar 20, 2025

Uh oh!

StephanTLavavej commented Mar 21, 2025

Uh oh!

Uh oh!

StephanTLavavej commented Mar 24, 2025

Uh oh!

Uh oh!

Use find for search_n when n=1 #5346

Use find for search_n when n=1 #5346

Conversation

AlexGuteniev commented Mar 18, 2025

📜 The optimization

⚖️ Predicate check

✅ Test coverage

⏱️Benchmark results

Uh oh!

Uh oh!

Uh oh!

Uh oh!

StephanTLavavej commented Mar 20, 2025

Uh oh!

StephanTLavavej commented Mar 21, 2025

Uh oh!

Uh oh!

StephanTLavavej commented Mar 24, 2025

Uh oh!

Uh oh!

Use `find` for `search_n` when n=1 #5346

Use `find` for `search_n` when n=1 #5346