PyMC/PyTensor Implementation of Pathfinder VI #387

aphc14 · 2024-10-31T10:42:57Z

Another version to draft PR #386 which uses more of PyTensor's symbolic variables and compiling functions.

Questions for Review

Which implementations should I continue for future improvements?
Are there additional PyTensor optimisations we could leverage?

…sion

`fit_pathfinder` - Edited `fit_pathfinder` to produce `pathfinder_state`, `pathfinder_info`, `pathfinder_samples` and `pathfinder_idata` for closer examination of the outputs. - Changed the `num_samples` argument name to `num_draws` to avoid `TypeError` got multiple values for keyword argument 'num_samples'. - Initial points are automatically set to jitter as jitter is required for pathfinder. Extras - New function 'get_jaxified_logp_ravel_inputs' to simplify previous code structure in fit_pathfinder. Tests - Added extra test for pathfinder to test pathfinder_info variables and pathfinder_idata are consistent for a given random seed.

Add a new PyMC-based implementation of Pathfinder VI that uses PyTensor operations which provides support for both PyMC and BlackJAX backends in fit_pathfinder.

- Implemented in to support running multiple Pathfinder instances in parallel. - Implemented function in for Pareto Smoothed Importance Resampling (PSIR). - Moved relevant pathfinder files into the directory. - Updated tests to reflect changes in the Pathfinder implementation and added tests for new functionalities.

aphc14 · 2024-11-04T19:31:18Z

Suppose the preferred approach is to stick with symbolic variables in PyTensor than the other non-symbolic approach in #386. In that case, I'd be happy to refactor the Multipath Pathfinder implementation in #386 to use symbolic variables and pytensor.function.

…nd .

…race data to InferenceData

… for bfgs_sample

aphc14 · 2024-11-07T18:15:31Z

This version runs much faster than #386, but the codes are messier due to the numerous pytensor symbolic variables created for the compiled pytensor functions (see the lines of code between def compute_logp and def single_pathfinder). Any suggestions for a cleaner setup would be appreciated

tests/test_pathfinder.py

pymc_experimental/inference/pathfinder/pathfinder.py

fonnesbeck · 2024-11-08T02:42:08Z

pymc_experimental/inference/pathfinder/lbfgs.py

+    g: np.ndarray
+
+
+class LBFGSHistoryManager:


Cleaner to use a data class? Don't know.

yep, I agree. dataclass now added

pymc_experimental/inference/pathfinder/importance_sampling.py

Summaryh of changes: - Remove multiprocessing code in favour of reusing compiled for each path - takes only random_seed as argument for each path - Compute graph significantly smaller by using pure pytensor op and symoblic variables - Added LBFGSOp to compile with pytensor.function - Cleaned up codes using pytensor variables

…and . - Corrected the dimensions in comments for matrices Q and R in the function. - Uumerical stability in the calculation by changing from to .

fonnesbeck · 2024-11-17T19:40:34Z

pymc_experimental/inference/fit.py

@@ -31,11 +31,13 @@ def fit(method, **kwargs):
    arviz.InferenceData
    """
    if method == "pathfinder":
+        # TODO: Remove this once we have a pure PyMC implementation


This PR will provide that, no?

the latest commit addresses this

Fixed incorrect and inconsistent posterior approximations in the Pathfinder VI algorithm by: 1. Adding missing parentheses in the phi calculation to ensure proper order of operations in matrix multiplications 2. Changing the sign in mu calculation from 'x +' to 'x -' to match Stan's implementation (which differs from the original paper) The resulting changes now make the posterior approximations more reliable.

Implements both sparse and dense BFGS sampling approaches for Pathfinder VI: - Adds bfgs_sample_dense for cases where 2*maxcor >= num_params. - Moved existing and computations to bfgs_sample_sparse, making the sparse use cases more explicit. Other changes: - Sets default maxcor=5 instead of dynamic sizing based on parameters Dense approximations are recommended when the target distribution has higher dependencies among the parameters.

Bigger changes: - Made pmx.fit compatible with method='pathfinder' - Remove JAX dependency when inference_backend='pymc' to support Windows users - Improve runtime performance by setting trust_input=True for compiled functions Minor changes: - Change default num_paths from 1 to 4 for stable and reliable approximations - Change LBFGS code using dataclasses - Update tests to handle both PyMC and BlackJAX backends

- Add LBFGSInitFailed exception for failed LBFGS initialisation - Skip failed paths in multipath_pathfinder and track number of failures - Handle NaN values from Cholesky decompsition in bfgs_sample - Add checks for numericl stabilty in matrix operations Slight performance improvements: - Set allow_gc=False in scan ops - Use FAST_RUN mode consistently

Major: - Added progress bar support. Minor - Added exception for non-finite log prob values - Removed . - Allowed maxcor argument to be None, and dynamically set based on the number of model parameters. - Improved logging to inform users about failed paths and lbfgs initialisation.

aphc14 · 2024-11-30T16:49:17Z

Need to make an important change related to how important sampling is done. Based on some tests, for trickier posteriors, psir (Pareto smoothed importance resampling) tends to cause many large peaks. In contrast to the reference posterior (what you’d get using NUTS), it doesn’t have such peaks.

Turning off resampling, you'd get psis instead, and the final posterior better resembles NUTS, so you don't get the weird peaks behaviour. But this would differ from the original paper, which uses psir.

Since the choice of importance sampling (IS) can have a big impact on the final posterior, and there are several IS methods, I plan to use a class variable that controls how IS is done based on the user inputs. I'm thinking of making psis (and not psir) the default IS behaviour as the safest and most generally reliable option.

Shouldn't take long to fix.

aphc14 added 7 commits October 19, 2024 23:48

renamed samples argument name and pathfinder variables to avoid confu…

4540b84

…sion

extract additional pathfinder objects from high level API for debugging

8835cd5

changed pathfinder samples argument to num_draws

663a60a

Merge branch 'replicate_pathfinder_w_pytensor' into scipy_lbfgs

05aeeaf

feat(pathfinder): add PyMC-based Pathfinder VI implementation

0db91fe

Add a new PyMC-based implementation of Pathfinder VI that uses PyTensor operations which provides support for both PyMC and BlackJAX backends in fit_pathfinder.

aphc14 added 4 commits November 7, 2024 20:40

Added type hints and epsilon parameter to fit_pathfinder

2efb511

Removed initial point values (l=0) to reduce iterations. Simplified a…

fdc3f38

…nd .

Added placeholder/reminder to remove jax dependency when converting t…

1fd7a11

…race data to InferenceData

Sync updates with draft PR pymc-devs#386. \n- Added pytensor.function…

ef2956f

… for bfgs_sample

aphc14 force-pushed the pathfinder_w_pytensor_symbolic branch from 9bfc48c to ef2956f Compare November 7, 2024 18:04

aphc14 changed the title ~~Pathfinder w pytensor symbolic~~ PyMC/PyTensor Implementation of Pathfinder VI Nov 7, 2024

fonnesbeck reviewed Nov 8, 2024

View reviewed changes

tests/test_pathfinder.py Show resolved Hide resolved

fonnesbeck reviewed Nov 8, 2024

View reviewed changes

pymc_experimental/inference/pathfinder/pathfinder.py Outdated Show resolved Hide resolved

fonnesbeck reviewed Nov 8, 2024

View reviewed changes

pymc_experimental/inference/pathfinder/importance_sampling.py Outdated Show resolved Hide resolved

aphc14 mentioned this pull request Nov 11, 2024

PyMC Implementation of Pathfinder VI #386

Closed

aphc14 marked this pull request as ready for review November 11, 2024 17:52

aphc14 marked this pull request as draft November 11, 2024 17:53

- Added TODO comments for implementing Taylor approximation methods: …

6484b3d

…and . - Corrected the dimensions in comments for matrices Q and R in the function. - Uumerical stability in the calculation by changing from to .

fonnesbeck reviewed Nov 17, 2024

View reviewed changes

aphc14 added 5 commits November 21, 2024 18:37

aphc14 marked this pull request as ready for review November 27, 2024 17:04

set maxcor to max(5, floor(N / 1.9)). max=1 will cause error

9faaa72

aphc14 marked this pull request as draft November 30, 2024 16:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyMC/PyTensor Implementation of Pathfinder VI #387

PyMC/PyTensor Implementation of Pathfinder VI #387

aphc14 commented Oct 31, 2024

aphc14 commented Nov 4, 2024

aphc14 commented Nov 7, 2024 •

edited

Loading

fonnesbeck Nov 8, 2024

aphc14 Nov 25, 2024

fonnesbeck Nov 17, 2024

aphc14 Nov 25, 2024

aphc14 commented Nov 30, 2024

PyMC/PyTensor Implementation of Pathfinder VI #387

Are you sure you want to change the base?

PyMC/PyTensor Implementation of Pathfinder VI #387

Conversation

aphc14 commented Oct 31, 2024

aphc14 commented Nov 4, 2024

aphc14 commented Nov 7, 2024 • edited Loading

fonnesbeck Nov 8, 2024

Choose a reason for hiding this comment

aphc14 Nov 25, 2024

Choose a reason for hiding this comment

fonnesbeck Nov 17, 2024

Choose a reason for hiding this comment

aphc14 Nov 25, 2024

Choose a reason for hiding this comment

aphc14 commented Nov 30, 2024

aphc14 commented Nov 7, 2024 •

edited

Loading