Improve quantile performance v2 #91

bkamins · 2021-10-17T10:18:09Z

This is an alternative implementation to #86.

Here following #86 (comment) I perform partial sorting incrementally.

I make this a separate PR to allow an easy comparison of both. Either one or the other should be merged.

src/Statistics.jl

codecov · 2021-10-17T10:37:48Z

Codecov Report

Merging #91 (fb4c8d8) into master (74897fe) will increase coverage by 0.02%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master      #91      +/-   ##
==========================================
+ Coverage   96.89%   96.91%   +0.02%     
==========================================
  Files           1        1              
  Lines         419      422       +3     
==========================================
+ Hits          406      409       +3     
  Misses         13       13

Impacted Files	Coverage Δ
src/Statistics.jl	`96.91% <100.00%> (+0.02%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 74897fe...fb4c8d8. Read the comment docs.

bkamins · 2021-10-17T11:06:40Z

The only drawback of this approach is the case when very many quantiles are requested as we sort p and then perform many partial-sorts. Maybe we should use some threshold on p and use the old algorithm if it is long?

nalimilan · 2021-10-17T13:32:20Z

Thanks. Is there any reason to think that a series of partial sorts of nested subsets of the data would be significantly slower than a single full sort? Have you tried benchmarking this?

bkamins · 2021-10-17T17:19:07Z

Is there any reason to think that a series of partial sorts of nested subsets of the data would be significantly slower than a single full sort?

We have to sort p, so it its length is comparable to length of the vector in which we are analyzing we have to essentially perform the sorting twice. I will try to benchmark this and post the results.

bkamins · 2021-10-17T17:25:06Z

Here are the benchmarks:

julia> function f1(n)
       x = rand(10^6); x2 = copy(x)
       p = rand(n)
       @time Statistics._quantilesort!(x, false, extrema(p)...)
       @time new_quantilesort!(x, false, p)
       nothing
       end
f1 (generic function with 1 method)

julia> f1(5)
  0.104513 seconds
  0.018321 seconds (1 allocation: 128 bytes)

julia> f1(5)
  0.075638 seconds
  0.015375 seconds (1 allocation: 128 bytes)

julia> f1(5)
  0.100432 seconds
  0.018790 seconds (1 allocation: 128 bytes)

julia> f1(50)
  0.099497 seconds
  0.130415 seconds (1 allocation: 496 bytes)

julia> f1(50)
  0.101801 seconds
  0.106628 seconds (1 allocation: 496 bytes)

julia> f1(50)
  0.109406 seconds
  0.105568 seconds (1 allocation: 496 bytes)

julia> f1(500)
  0.112124 seconds
  1.050222 seconds (1 allocation: 4.062 KiB)

julia> f1(500)
  0.106582 seconds
  1.027544 seconds (1 allocation: 4.062 KiB)

so as you can see it starts to deteriorate much faster.

(as usual - it would not hurt if you double checked this if you had time as I might have made some error here)

nalimilan

I realize I had two comments pending...

nalimilan · 2021-10-17T13:27:36Z

src/Statistics.jl

-        sort!(v, 1, lv, Base.Sort.PartialQuickSort(lo:hi), Base.Sort.Forward)
+        start = 1
+        for pv in sort(p)
+            lv = length(v)


Move this out of the loop? BTW, better use lastindex even if we call require_one_based_indexing(v).

nalimilan · 2021-10-17T13:31:13Z

src/Statistics.jl

+            lo = floor(Int,pv*(lv))
+            hi = ceil(Int,1+pv*(lv))
+            sort!(v, start, lv, Base.Sort.PartialQuickSort(lo:hi), Base.Sort.Forward)
+            start = hi + 1


Are you completely sure of the +1? Is that still correct if p contains duplicates? That would be worth testing.

nalimilan · 2024-09-06T17:13:00Z

I'm not sure it's worth worrying about performance when the number of quantiles is large compared to the data. Quantiles don't make a lot of sense in that case.

Maybe a simple optimization is to do sort_p = issorted(p) ? p : sort(p) to avoid making a copy if possible (as that will be the case almost all the time).

nalimilan · 2025-02-22T12:32:59Z

Actually with the (semi-)recent improvements to sorting performance, it doesn't seem that partial sort is a good idea, as it uses quick sort, while e.g. radix sort is much faster for standard numeric types. Even for BigInt sorting everything seems much faster.

Maybe we should just switch to a full sort?

I'm a bit surprised that the implementation from this PR is so slow in my benchmark. I would be good to double-check the result. (I think yours had a bug because it reused x after sorting it.)

julia> function f1(n)
           @btime Statistics._quantilesort!($(rand(10^6)), false, extrema($(rand(n)))...)
           @btime new_quantilesort!($(rand(10^6)), false, $(rand(n)))
           @btime sort!($(rand(10^6)))
           nothing
       end
f1 (generic function with 1 method)

julia> f1(5)
  8.901 ms (0 allocations: 0 bytes)
  6.912 ms (2 allocations: 96 bytes)
  2.568 ms (0 allocations: 0 bytes)

julia> f1(50)
  18.857 ms (0 allocations: 0 bytes)
  49.713 ms (2 allocations: 480 bytes)
  1.840 ms (0 allocations: 0 bytes)

julia> f1(500)
  19.029 ms (0 allocations: 0 bytes)
  520.530 ms (8 allocations: 9.09 KiB)
  2.607 ms (0 allocations: 0 bytes)

julia> function f2(n)
           @btime Statistics._quantilesort!($(big.(rand(10^6))), false, extrema($(rand(n)))...)
           @btime new_quantilesort!($(big.(rand(10^6))), false, $(rand(n)))
           @btime sort!($(big.(rand(10^6))))
           nothing
       end
f2 (generic function with 1 method)

julia> f2(5)

  538.816 ms (0 allocations: 0 bytes)
  351.436 ms (2 allocations: 96 bytes)
  77.425 ms (0 allocations: 0 bytes)

julia> f2(50)
  861.641 ms (0 allocations: 0 bytes)
  8.368 s (2 allocations: 480 bytes)
  68.657 ms (0 allocations: 0 bytes)

julia> f2(500)
  776.346 ms (0 allocations: 0 bytes)
  51.798 s (8 allocations: 9.09 KiB)
  84.172 ms (0 allocations: 0 bytes)

Improve quantile performance v2

b6ceb83

bkamins mentioned this pull request Oct 17, 2021

Improve quantile performance #86

Open

bkamins commented Oct 17, 2021

View reviewed changes

src/Statistics.jl Outdated Show resolved Hide resolved

Update src/Statistics.jl

7943e97

bkamins commented Oct 17, 2021

View reviewed changes

src/Statistics.jl Outdated Show resolved Hide resolved

Update src/Statistics.jl

af561f2

bkamins commented Oct 17, 2021

View reviewed changes

src/Statistics.jl Outdated Show resolved Hide resolved

Update src/Statistics.jl

6e233de

bkamins added 2 commits October 17, 2021 12:51

Update runtests.jl

d84deda

Update runtests.jl

fb4c8d8

nalimilan mentioned this pull request Dec 10, 2023

Median (and presumably all quantile computation) could be much faster for large inputs #154

Open

nalimilan mentioned this pull request Sep 6, 2024

performance of median and iqr compared to python libraries JuliaData/DataFrames.jl#3462

Open

nalimilan reviewed Sep 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve quantile performance v2 #91

Improve quantile performance v2 #91

Uh oh!

bkamins commented Oct 17, 2021 •

edited by nalimilan

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Oct 17, 2021 •

edited

Loading

Uh oh!

bkamins commented Oct 17, 2021

Uh oh!

nalimilan commented Oct 17, 2021

Uh oh!

bkamins commented Oct 17, 2021

Uh oh!

bkamins commented Oct 17, 2021

Uh oh!

nalimilan left a comment

Uh oh!

nalimilan Oct 17, 2021

Uh oh!

nalimilan Oct 17, 2021

Uh oh!

nalimilan commented Sep 6, 2024

Uh oh!

nalimilan commented Feb 22, 2025

Uh oh!

Uh oh!

Improve quantile performance v2 #91

Are you sure you want to change the base?

Improve quantile performance v2 #91

Uh oh!

Conversation

bkamins commented Oct 17, 2021 • edited by nalimilan Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Oct 17, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

bkamins commented Oct 17, 2021

Uh oh!

nalimilan commented Oct 17, 2021

Uh oh!

bkamins commented Oct 17, 2021

Uh oh!

bkamins commented Oct 17, 2021

Uh oh!

nalimilan left a comment

Choose a reason for hiding this comment

Uh oh!

nalimilan Oct 17, 2021

Choose a reason for hiding this comment

Uh oh!

nalimilan Oct 17, 2021

Choose a reason for hiding this comment

Uh oh!

nalimilan commented Sep 6, 2024

Uh oh!

nalimilan commented Feb 22, 2025

Uh oh!

Uh oh!

bkamins commented Oct 17, 2021 •

edited by nalimilan

Loading

codecov bot commented Oct 17, 2021 •

edited

Loading