The logpdf from `ExponentialFamily.jl` is slower then from `Distributions.jl` #146

Nimrais · 2023-11-17T08:03:13Z

I have such example for evaluating the multivariate normal log probability density function (logpdf):

using ExponentialFamily
using StableRNGs
using LinearAlgebra
using Distributions
using BenchmarkTools

rng = StableRNG(42)
n = 4
golden_μ = randn(rng, n)
L = randn(rng, n, n)
golden_Σ = L * L' + Matrix{Float64}(I, n, n)

golden_normal = Distributions.MvNormal(golden_μ, golden_Σ);
ef_mv_normal = ExponentialFamily.MvNormalMeanCovariance(golden_μ, golden_Σ)
ef = convert(ExponentialFamily.ExponentialFamilyDistribution, ef_mv_normal)

samples = rand(golden_normal, 1000);

@benchmark logpdf($golden_normal, $samples)
@benchmark logpdf($ef_mv_normal, $samples)
	
hybridlinearize(matrix::AbstractMatrix) = eachcol(matrix)
	
@benchmark map((x) -> logpdf($(ef), x), $(hybridlinearize(samples)))

The results:

BenchmarkTools.Trial: 10000 samples with 1 evaluation.
 Range (min … max):  125.000 μs …  1.147 ms  ┊ GC (min … max): 0.00% … 84.03%
 Time  (median):     140.000 μs              ┊ GC (median):    0.00%
 Time  (mean ± σ):   146.581 μs ± 62.614 μs  ┊ GC (mean ± σ):  2.80% ±  5.73%
 Memory estimate: 211.44 KiB, allocs estimate: 3003.
 
 BenchmarkTools.Trial: 10000 samples with 1 evaluation.
 Range (min … max):  287.125 μs … 945.667 μs  ┊ GC (min … max): 0.00% … 61.13%
 Time  (median):     299.459 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   306.335 μs ±  57.800 μs  ┊ GC (mean ± σ):  1.88% ±  6.45%
 
 
 BenchmarkTools.Trial: 2539 samples with 1 evaluation.
 Range (min … max):  1.793 ms …   4.692 ms  ┊ GC (min … max): 0.00% … 59.40%
 Time  (median):     1.833 ms               ┊ GC (median):    0.00%
 Time  (mean ± σ):   1.968 ms ± 566.244 μs  ┊ GC (mean ± σ):  6.32% ± 12.40%

Clearly, the result from the ExponentialFamily.ExponentialFamilyDistribution comes from re-evaluating the log partition for each sample. However, should this be avoided? But even just ExponentialFamily.MvNormalMeanCovariance is considerably slower.

The text was updated successfully, but these errors were encountered:

bvdmitri · 2023-11-17T09:44:06Z

This is expected behaviour. Different parametrizations have different speed characteristics when evaluating logpdf or rand. The natural parameters space is far from being ideal for logpdf evaluation so it is indeed slower. The fact that MvNormalMeanCovariance is slower is also expected, because it requires extra inverses.

The idea is to use logpdf_optimized function defined in BayesBase.jl, which returns an optimal parametrization for repeated logpdf evaluation. However, we did not have time to implement it properly for ExponentialFamilyDistribution structure. I guess for now it works only for Distribution structures.

julia> optimised = logpdf_optimized(golden_normal); # should return MvNormalMeanPrecision

julia> @benchmark logpdf($golden_normal, $samples)
BenchmarkTools.Trial: 10000 samples with 1 evaluation.
 Range (min … max):  229.023 μs … 838.760 μs  ┊ GC (min … max): 0.00% … 65.07%
 Time  (median):     232.834 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   242.893 μs ±  42.271 μs  ┊ GC (mean ± σ):  1.04% ±  4.62%

  ▂█▅▃▄▃ ▃▂▁▅▃ ▁▂▁    ▁  ▁                                      ▂
  █████████████████▇▇██████▆▄▇▆▄▄▆▅▅▃▆▆▅▄▆▆▆▆▆▆▅▅▅▄▆▇▃▄▁▃▄▃▄▃▄▃ █
  229 μs        Histogram: log(frequency) by time        352 μs <

 Memory estimate: 211.44 KiB, allocs estimate: 3003.
 
 julia> @benchmark logpdf($optimised, $samples)
BenchmarkTools.Trial: 10000 samples with 1 evaluation.
 Range (min … max):  147.888 μs …  1.122 ms  ┊ GC (min … max): 0.00% … 83.80%
 Time  (median):     152.977 μs              ┊ GC (median):    0.00%
 Time  (mean ± σ):   158.866 μs ± 43.102 μs  ┊ GC (mean ± σ):  1.91% ±  5.80%

  ▇▇█▇▃      ▁                                                 ▂
  ██████▆▅▅▇██▆▇▅▄▂▃▄███▇▆▇▆▄▆▆▅█▆▄▆▅▅▅▆▆▆▇▇▅▄▅▄▅▄▄▅▄▂▄▃▂▄▄▄▅▄ █
  148 μs        Histogram: log(frequency) by time       265 μs <

 Memory estimate: 289.19 KiB, allocs estimate: 2001.

Thus being said, there is still a lot of room for improvements. We can keep this issue open for future reference.

Nimrais · 2023-11-22T10:13:59Z

Yes, you are absolutely correct; different parametrizations exhibit varied speed characteristics when evaluating logpdf.

However, I didn't understand why we can't adopt a seemingly simplistic approach like the following: if we want to evaluate a container of samples for the same distribution, we will reuse the log-partition (which appears to be the primary issue).

For instance, something like this:

using ExponentialFamily
using StableRNGs
using LinearAlgebra
using Distributions
using BenchmarkTools

rng = StableRNG(42)
n = 4
golden_μ = randn(rng, n)
L = randn(rng, n, n)
golden_Σ = L * L' + Matrix{Float64}(I, n, n)

golden_normal = Distributions.MvNormal(golden_μ, golden_Σ);
ef_mv_normal = ExponentialFamily.MvNormalMeanCovariance(golden_μ, golden_Σ)
ef = convert(ExponentialFamily.ExponentialFamilyDistribution, ef_mv_normal)

samples = rand(golden_normal, 1000);

@benchmark logpdf($golden_normal, $samples)
@benchmark logpdf($ef_mv_normal, $samples)
	
hybridlinearize(matrix::AbstractMatrix) = eachcol(matrix)
	
@benchmark map((x) -> logpdf($(ef), x), $(hybridlinearize(samples)))

function _logpdf(ef, x::AbstractMatrix)
	stats_part = map(
		(s) -> begin
			stats = ExponentialFamily.sufficientstatistics(ef, s)
			sum(ExponentialFamily.pack_parameters(stats) .* ExponentialFamily.getnaturalparameters(ef))  
		end,
		hybridlinearize(x)
	)

	basemeasure_part = map(
		(x) -> begin
			log(ExponentialFamily.basemeasure(ef, x))
		end,
		hybridlinearize(x)
	)

	logpartion_part = logpartition(ef) 
		
	return stats_part .+ basemeasure_part .- logpartion_part
end

@benchmark _logpdf($(ef), $(samples))

For example for normals it's efficient enough:

BenchmarkTools.Trial: 10000 samples with 1 evaluation.
 Range (min … max):  126.709 μs …   1.958 ms  ┊ GC (min … max):  0.00% … 91.33%
 Time  (median):     130.125 μs               ┊ GC (median):     0.00%
 Time  (mean ± σ):   155.554 μs ± 187.075 μs  ┊ GC (mean ± σ):  14.23% ± 10.78%

  █▂                                                            ▁
  ██▇▆▄▅▃▃▄▁▁▁▁▃▁▁▁▁▁▁▁▁▁▃▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▅▇ █
  127 μs        Histogram: log(frequency) by time       1.65 ms <

 Memory estimate: 649.19 KiB, allocs estimate: 3005.

bvdmitri · 2023-11-22T10:21:24Z

I think you are right, we definitely can (and should) do it. Would mine opening a PR for that?

Nimrais · 2023-11-22T10:23:08Z

yes, I can 👍

bvdmitri · 2023-12-04T16:19:51Z

Can be closed?

Nimrais · 2023-12-04T16:21:10Z

Yes, I think so, we have a separate issue for MvNormalWishart #148. For all other distributions: the approach is working.

bvdmitri changed the title ~~The logpdf inside ExponentialFamily.jl is slover then inside Distributions.jl~~ The logpdf from ExponentialFamily.jl is slower then from Distributions.jl Nov 17, 2023

Nimrais mentioned this issue Nov 23, 2023

Faster logpdf implementation for container based inputs #147

Merged

bvdmitri closed this as completed Dec 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The logpdf from `ExponentialFamily.jl` is slower then from `Distributions.jl` #146

The logpdf from `ExponentialFamily.jl` is slower then from `Distributions.jl` #146

Nimrais commented Nov 17, 2023 •

edited

Loading

bvdmitri commented Nov 17, 2023

Nimrais commented Nov 22, 2023

bvdmitri commented Nov 22, 2023

Nimrais commented Nov 22, 2023

bvdmitri commented Dec 4, 2023

Nimrais commented Dec 4, 2023

The logpdf from ExponentialFamily.jl is slower then from Distributions.jl #146

The logpdf from ExponentialFamily.jl is slower then from Distributions.jl #146

Comments

Nimrais commented Nov 17, 2023 • edited Loading

bvdmitri commented Nov 17, 2023

Nimrais commented Nov 22, 2023

bvdmitri commented Nov 22, 2023

Nimrais commented Nov 22, 2023

bvdmitri commented Dec 4, 2023

Nimrais commented Dec 4, 2023

The logpdf from `ExponentialFamily.jl` is slower then from `Distributions.jl` #146

The logpdf from `ExponentialFamily.jl` is slower then from `Distributions.jl` #146

Nimrais commented Nov 17, 2023 •

edited

Loading