On the Hardness of Probabilistic Neurosymbolic Learning

Code to the reproduce the experiments of the paper "On the Hardness of Probabilistic Neurosymbolic Learning" (ICML 2024).

Installation

Install d4 and EvalMaxSAT, for exact compilation and MaxSat solving, respectively. Furthermore, make sure the following python libraries are installed.

pip install numpy torch pycmsgen problog pysdd tqdm matplotlib scienceplots

Data generation

The MCC benchmarks can be downloaded in dimacs format at:

Note that we only use the weighted model counting track (track2). The different benchmarks should be saved in the data folder. The Road-R benchmark is already included.

Note that Windows is not supported. The random weights of the MCC benchmarks are generated using

python scripts/reweight_instances.py

Usage

First, generate the d-DNNF circuits with d4. (This will take a while.)

python scripts/run_d4.py

Next, we can evaluate these circuits in d4 to get the exact gradients. (This will take a while. For the largest circuits, 256GB RAM is necessary.)

python scripts/nnf_to_grads.py

Finally, we can run one of the approximate methods.

python scripts/run_approx_solver.py

You can select the specific method and hyperparameters at the bottom of the file. (To run all methods and with multiple hyperparameters, its recommended to parallize this over multiple machines.)

Similarly, the training experiments can be run using

python scripts/run_training.py

Once all experiments have finished running, the figures and table of the paper can be generated using

python scripts/aggregate_results.py

Paper

@InProceedings{maene2024hardness,
  title = 	 {{O}n the {H}ardness of {P}robabilistic {N}eurosymbolic {L}earning},
  author =       {Maene, Jaron and Derkinderen, Vincent and De Raedt, Luc},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {34203--34218},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data/road-r		data/road-r
scripts		scripts
solvers		solvers
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cnf.py		cnf.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

On the Hardness of Probabilistic Neurosymbolic Learning

Installation

Data generation

Usage

Paper

About

Languages

License

jjcmoon/hardness-nesy

Folders and files

Latest commit

History

Repository files navigation

On the Hardness of Probabilistic Neurosymbolic Learning

Installation

Data generation

Usage

Paper

About

Topics

Resources

License

Stars

Watchers

Forks

Languages