Sc336/categorical kernel #2055

sc336 · 2023-03-28T09:13:29Z

PR type: new feature

Summary

Proposed changes
Adds a new categorical kernel that implements categorical variables by mapping them to values in a latent space.

Minimal working example

# Put your example code in here

Fully backwards compatible: yes

PR checklist

codecov · 2023-03-29T09:00:41Z

Codecov Report

Patch coverage: 100.00% and project coverage change: +0.04 🎉

Comparison is base (7c6beae) 98.01% compared to head (748352b) 98.06%.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #2055      +/-   ##
===========================================
+ Coverage    98.01%   98.06%   +0.04%     
===========================================
  Files           96       97       +1     
  Lines         5404     5436      +32     
===========================================
+ Hits          5297     5331      +34     
+ Misses         107      105       -2

Impacted Files	Coverage Δ
gpflow/kernels/__init__.py	`100.00% <ø> (ø)`
gpflow/kernels/categorical.py	`100.00% <100.00%> (ø)`

... and 1 file with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

uri-granta

Various comments! Also probably worth checking with victor regarding which essential tests are worth adding. Code looks good though!

gpflow/kernels/categorical.py

uri-granta · 2023-03-30T08:55:07Z

gpflow/kernels/categorical.py

+from ..base import Parameter, TensorType
+from . import Kernel
+
+tfd = tfp.distributions


More normal to write

Suggested change

tfd = tfp.distributions

from tensorflow_probability import distributions as tfd

uri-granta · 2023-03-30T08:58:20Z

gpflow/kernels/categorical.py

+from . import Kernel
+
+tfd = tfp.distributions
+st = tfp.bijectors.Softplus().inverse


I don't think you're actually using st or tfd!

Suggested change

st = tfp.bijectors.Softplus().inverse

gpflow/kernels/categorical.py

uri-granta · 2023-03-30T09:33:36Z

gpflow/kernels/categorical.py

+     of degrees of freedom in optimisation.  You may also find fixing the lengthscale to be useful here.
+     Note that Z is parameterised by the differences of latent space values for each category for the same reason.
+    Multiple categories can be included by wrapping multiple layers of CategoricalKernel.
+    :param wrapped_kernel_1: The non-categorical kernel.


Surely clearer to call these arguments non_categorical_kernel and categorical_kernel?

uri-granta · 2023-03-30T09:35:28Z

gpflow/kernels/categorical.py

+        # wrapped_kernel_2 is the one that trains the categorical variables
+        set_trainable(wrapped_kernel_2, False)
+        self.wrapped_kernel = wrapped_kernel_1 * wrapped_kernel_2
+        label_dim = 1


I thought the categorical_kernel was supposed to specify the label dimension in its active_dims?

We only support one label dimension for now, this could be extended at a later date.

uri-granta · 2023-03-30T09:36:28Z

gpflow/kernels/categorical.py

+        set_trainable(wrapped_kernel_2, False)
+        self.wrapped_kernel = wrapped_kernel_1 * wrapped_kernel_2
+        label_dim = 1
+        self._Z_deltas = Parameter(


A comment might be useful?

# parametrise by the `num_labels - 1` differences of latent space values

uri-granta · 2023-03-30T09:41:20Z

gpflow/kernels/categorical.py

+        """
+        Z = tf.concat([tf.constant(0, shape=(1,), dtype=tf.float64), tf.squeeze(self._Z_deltas)], 0)
+        m = tf.linalg.band_part(tf.ones([tf.size(Z), tf.size(Z)], dtype=tf.float64), -1, 0)
+        return tf.expand_dims(tf.linalg.matvec(m, Z), -1)


looks like voodoo to me but I believe you! (though a simple unit test might be useful)

I'll add a comment to explain it.

uri-granta · 2023-03-30T09:51:25Z

tests/gpflow/kernels/test_kernels.py

    np.testing.assert_allclose(K, K1 + K2)
+
+
+def test_concat_inputs_with_latents() -> None:


I feel like there are definitely missing tests for the Categorical kernel (beyond broadcasting), but I'm probably not the person to spot them! Also don't we want a test that actually uses Categorical in a model?

gpflow/kernels/__init__.py

uri-granta

Looks good (though my OCD is still vexed by _concat_inputs_with_latents describing the same dimension as both D and x_dim!). Are you ok to create an issue to investigate writing more tests?

sc336 · 2023-04-04T08:13:20Z

Looks good (though my OCD is still vexed by _concat_inputs_with_latents describing the same dimension as both D and x_dim!). Are you ok to create an issue to investigate writing more tests?

#2058

sc336 requested a review from uri-granta March 30, 2023 08:18

uri-granta reviewed Mar 30, 2023

View reviewed changes

gpflow/kernels/__init__.py Show resolved Hide resolved

sc336 added 15 commits April 3, 2023 13:01

First attempt

f922fb4

format

5eaeb9b

test_concat_inputs_with_latents

5df315d

Some imports

8b18af4

mypy

54a992d

circular import

3ac3110

removed Categorical

f5e7c1d

all

c213e94

Added Categorical

2a77b94

format

f775f91

added categorical

3b9efbe

Addressed some review comments

ae589f3

Improved docstring

4fae7eb

renaming

917d81f

More renaming

748352b

sc336 force-pushed the sc336/categorical_kernel branch from c00be2e to 748352b Compare April 3, 2023 12:02

uri-granta approved these changes Apr 3, 2023

View reviewed changes

sc336 merged commit 39bd130 into develop Apr 4, 2023

sc336 mentioned this pull request Apr 4, 2023

Add tests for new categorical kernel #2058

Open

khurram-ghani mentioned this pull request May 3, 2023

Release 2.8.0 #2068

Merged

	tfd = tfp.distributions
	from tensorflow_probability import distributions as tfd

		np.testing.assert_allclose(K, K1 + K2)


		def test_concat_inputs_with_latents() -> None:

Sc336/categorical kernel #2055

Sc336/categorical kernel #2055

Uh oh!

Conversation

sc336 commented Mar 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Minimal working example

PR checklist

Uh oh!

codecov bot commented Mar 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

uri-granta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

uri-granta left a comment

Choose a reason for hiding this comment

Uh oh!

sc336 commented Apr 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sc336 commented Mar 28, 2023 •

edited

Loading

codecov bot commented Mar 29, 2023 •

edited

Loading