Gaussian Process Kernel Gradient

Hello,

I am looking at the gradient computation in Gaussian Process Kernel module. My understanding is that there we are trying to compute $$\partial{K(x,x')}{\partial \theta}$$, where $$\theta$$ is a hyperparameter. However, I am not sure that is what is computed in the code:

For example, the ConstantKernel has:
https://github.com/scikit-learn/scikit-learn/blob/f339609ab80ee79924cc739edc774182f4870c7c/sklearn/gaussian_process/kernels.py#L1013
While I think instead of filling in the constant value, we should fill in just 1.

Another example is the RBF kernel,
https://github.com/scikit-learn/scikit-learn/blob/f339609ab80ee79924cc739edc774182f4870c7c/sklearn/gaussian_process/kernels.py#L1232
I think the gradient should be further divided by the **length_scale**.

It seems to me that rather than computing the **gradient**, we are computing **gradient * hyperparameter** here. Am I missing something?

Thanks!
Junteng

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gaussian Process Kernel Gradient #14206

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Gaussian Process Kernel Gradient #14206

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions