gpflux.sampling.sample#

This module enables you to sample from (Deep) GPs using different approaches.

Module Contents#

compute_A_inv_b(A: gpflow.base.TensorType, b: gpflow.base.TensorType) → tf.Tensor[source]#

Computes $A^{- 1} b$ using the Cholesky of A instead of the explicit inverse, as this is often numerically more stable.

Parameters:

A – A positive-definite matrix with shape [..., M, M]. Can contain any leading dimensions (...) as long as they correspond to the leading dimensions in b.
b – Tensor with shape [..., M, D]. Can contain any leading dimensions (...) as long as they correspond to the leading dimensions in A.

Returns:

Tensor with shape [..., M, D]. Leading dimensions originate from A and b.

class KernelWithFeatureDecomposition(kernel: gpflow.kernels.Kernel | NoneType, feature_functions: gpflow.keras.tf_keras.layers.Layer, feature_coefficients: gpflow.base.TensorType)[source]#

Bases: gpflow.kernels.Kernel

This class represents a kernel together with its finite feature decomposition:

k (x, x^{'}) = \sum_{i = 0}^{L} λ_{i} ϕ_{i} (x) ϕ_{i} (x^{'}),

where $λ_{i}$ and $ϕ_{i} (\cdot)$ are the coefficients and features, respectively.

The decomposition can be derived from Mercer or Bochner’s theorem. For example, feature-coefficient pairs could be eigenfunction-eigenvalue pairs (Mercer) or Fourier features with constant coefficients (Bochner).

In some cases (e.g., [1] and [2]) the left-hand side (that is, the covariance function $k (\cdot, \cdot)$ ) is unknown and the kernel can only be approximated using its feature decomposition. In other cases (e.g., [3] and [4]), both the covariance function and feature decomposition are available in closed form.

Parameters:

kernel –
The kernel corresponding to the feature decomposition. If None, there is no analytical expression associated with the infinite sum and we approximate the kernel based on the feature decomposition.

Note

In certain cases, the analytical expression for the kernel is not available. In this case, passing None is allowed, and K() and K_diag() will be computed using the approximation provided by the feature decomposition.
feature_functions – A Keras layer for which the call evaluates the L features of the kernel $ϕ_{i} (\cdot)$ . For X with the shape [N, D], feature_functions(X) returns a tensor with the shape [N, L].
feature_coefficients – A tensor with the shape [L, 1] with coefficients associated with the features, $λ_{i}$ .

property feature_functions: gpflow.keras.tf_keras.layers.Layer#: Return the kernel’s features $ϕ_{i} (\cdot)$ .

property feature_coefficients: tf.Tensor#: Return the kernel’s coefficients $λ_{i}$ .

draw_conditional_sample(mean: gpflow.base.TensorType, cov: gpflow.base.TensorType, f_old: gpflow.base.TensorType) → tf.Tensor[source]#

Draw a sample ${\tilde{f}}_{new}$ from the conditional multivariate Gaussian $p (f_{new} | f_{old})$ , where the parameters mean and cov are the mean and covariance matrix of the joint multivariate Gaussian over $[f_{old}, f_{new}]$ .

Parameters:

mean –
A tensor with the shape [..., D, N+M] with the mean of [f_old, f_new]. For each [..., D] this is a stacked vector of the form:

$\begin{array}{r} (\begin{array}{c} mean (f_{old}) [N] \\ mean (f_{new}) [M] \end{array}) \end{array}$
cov –
A tensor with the shape [..., D, N+M, N+M] with the covariance of [f_old, f_new]. For each [..., D], there is a 2x2 block matrix of the form:

$\begin{array}{r} (\begin{array}{c} cov (f_{old}, f_{old}) [N, N] & cov (f_{old}, f_{new}) [N, M] \\ cov (f_{new}, f_{old}) [M, N] & cov (f_{new}, f_{new}) [M, M] \end{array}) \end{array}$
f_old – A tensor of observations with the shape [..., D, N], drawn from Normal distribution with mean $mean (f_{old}) [N]$ , and covariance $cov (f_{old}, f_{old}) [N, N]$

Returns:

A sample ${\tilde{f}}_{new}$ from the conditional normal $p (f_{new} | f_{old})$ with the shape [..., D, M].

efficient_sample[source]#: A function that returns a Sample of a GP posterior.

class Sample[source]#

Bases: abc.ABC

This class represents a sample from a GP that you can evaluate by using the __call__ at new locations within the support of the GP.

Importantly, the same function draw (sample) is evaluated when calling it multiple times. This property is called consistency. Achieving consistency for vanilla GPs is costly because it scales cubically with the number of evaluation points, but works with any kernel. It is implemented in _efficient_sample_conditional_gaussian(). For KernelWithFeatureDecomposition, the more efficient approach following Wilson et al. [WBT+20] is implemented in _efficient_sample_matheron_rule().

See the tutorial notebooks Efficient sampling and Weight Space Approximation with Random Fourier Features for an in-depth overview.

abstract __call__(X: gpflow.base.TensorType) → tf.Tensor[source]#

Return the evaluation of the GP sample $f (X)$ for $f \sim G P (0, k)$ .

Parameters:: X – The inputs, a tensor with the shape [N, D], where D is the input dimensionality.
Returns:: Function values, a tensor with the shape [N, P], where P is the output dimensionality.

__add__(other: Sample | Callable[[gpflow.base.TensorType], gpflow.base.TensorType]) → Sample[source]#: Allow for the summation of two instances that implement the __call__ method.

_efficient_sample_conditional_gaussian(inducing_variable: gpflow.inducing_variables.InducingVariables, kernel: gpflow.kernels.Kernel, q_mu: tf.Tensor, *, q_sqrt: gpflow.base.TensorType | None = None, whiten: bool = False) → Sample[source]#: Most costly implementation for obtaining a consistent GP sample. However, this method can be used for any kernel.

_efficient_sample_matheron_rule(inducing_variable: gpflow.inducing_variables.InducingVariables, kernel: gpflux.sampling.kernel_with_feature_decomposition.KernelWithFeatureDecomposition, q_mu: tf.Tensor, *, q_sqrt: gpflow.base.TensorType | None = None, whiten: bool = False) → Sample[source]#

Implements the efficient sampling rule from Wilson et al. [WBT+20] using the Matheron rule. To use this sampling scheme, the GP has to have a kernel of the KernelWithFeatureDecomposition type .

Parameters:

kernel – A kernel of the KernelWithFeatureDecomposition type, which holds the covariance function and the kernel’s features and coefficients.
q_mu – A tensor with the shape [M, P].
q_sqrt – A tensor with the shape [P, M, M].
whiten – Determines the parameterisation of the inducing variables.