Skip to content

Latest commit

 

History

History
50 lines (50 loc) · 1.9 KB

2021-07-01-aitchison21a.md

File metadata and controls

50 lines (50 loc) · 1.9 KB
title abstract layout series publisher issn id month tex_title firstpage lastpage page order cycles bibtex_author author date address container-title volume genre issued pdf extras
Deep kernel processes
We define deep kernel processes in which positive definite Gram matrices are progressively transformed by nonlinear kernel functions and by sampling from (inverse) Wishart distributions. Remarkably, we find that deep Gaussian processes (DGPs), Bayesian neural networks (BNNs), infinite BNNs, and infinite BNNs with bottlenecks can all be written as deep kernel processes. For DGPs the equivalence arises because the Gram matrix formed by the inner product of features is Wishart distributed, and as we show, standard isotropic kernels can be written entirely in terms of this Gram matrix — we do not need knowledge of the underlying features. We define a tractable deep kernel process, the deep inverse Wishart process, and give a doubly-stochastic inducing-point variational inference scheme that operates on the Gram matrices, not on the features, as in DGPs. We show that the deep inverse Wishart process gives superior performance to DGPs and infinite BNNs on fully-connected baselines.
inproceedings
Proceedings of Machine Learning Research
PMLR
2640-3498
aitchison21a
0
Deep Kernel Processes
130
140
130-140
130
false
Aitchison, Laurence and Yang, Adam and Ober, Sebastian W
given family
Laurence
Aitchison
given family
Adam
Yang
given family
Sebastian W.
Ober
2021-07-01
Proceedings of the 38th International Conference on Machine Learning
139
inproceedings
date-parts
2021
7
1