resolve #4587 add inductive learning example#6478

Closed

chiragnagpal wants to merge 1 commit intoscikit-learn:masterfrom

chiragnagpal:ind_learn

chiragnagpal commented Mar 2, 2016

I added an example that uses a synthetic dataset (blobs + random), anduses dbscan to infer labels, and and then fits SVM over it, to infer labels on other data


          resolve scikit-learn#4587 add inductive learning example

27f004d

Member

amueller commented Oct 10, 2016

See #4587. (github doesn't create links from PR headers.

Member

amueller commented Oct 10, 2016

@jnothman this was your idea ;)

amueller added the Waiting for Reviewer label

jnothman reviewed

View reviewed changes

Member

jnothman left a comment

Apologies for the very slow review. I'm not yet sure this is persuasive. It might be worth thinking about whether there's a use case that can motivate it clearly.

This should be in examples/cluster/plot_inductive_learning.py

Thanks.

examples/inductive_learning.py

@@ @@ -0,0 +1,85 @@ @@
+              """
+              ==============================================
+              Inductive Learning with Scikit Learn

Member

jnothman Jan 1, 2017

I think you must mean 'inductive clustering'. "With scikit-learn" is redundant inside scikit-learn.

examples/inductive_learning.py

+              Clustering is expensive, especially when our dataset contains millions of
+              datapoints. Recomputing the clusters everytime we receive some new data
+              is thus in many cases, intractable. With more data, there is also the

Member

jnothman Jan 1, 2017

"With more data, "... this comment only really makes sense if you say something more explicit about acquiring more data from a noisier source than was used to build a clustering... no?

examples/inductive_learning.py

+              some unsupervised learning algorithm and then fit a classifier on the
+              inferred targets, treating it as a supervised problem. This is known as
+              Transductive learning.

Member

jnothman Jan 1, 2017

excess blank line(s)

examples/inductive_learning.py

+              One solution to this problem, is to first infer the target classes using
+              some unsupervised learning algorithm and then fit a classifier on the
+              inferred targets, treating it as a supervised problem. This is known as
+              Transductive learning.

Member

jnothman Jan 1, 2017

No, this is certainly not transductive. Whether we're using "inductive" correctly is another matter.

examples/inductive_learning.py


		n_samples = 5000

		colors = np.array([x for x in 'bgrcmykbgrcmykbgrcmykbgrcmyk'])

Member

jnothman Jan 1, 2017

this is just np.array(list('bgrcmyk' * 4))?

examples/inductive_learning.py

+              plt.scatter(X[:, 0], X[:, 1], color="black", s=2)
+              plt.show()
+              from sklearn import svm

Member

jnothman Jan 1, 2017

imports should be up the top

examples/inductive_learning.py


		colors = np.array([x for x in 'bgrcmykbgrcmykbgrcmykbgrcmyk'])

		blobs = datasets.make_blobs(n_samples=3*n_samples, random_state=8)

Member

jnothman Jan 1, 2017

why 3x?

examples/inductive_learning.py

+              # Inferring class on a new random dataset
+              X_new = StandardScaler().fit_transform(np.random.rand(n_samples*2,2))
+              y_pred = inductiveLearner.predict(X_new)
+              plt.scatter(X_new[:, 0], X_new[:, 1], color=colors[y_pred].tolist(), s=5)

Member

jnothman Jan 1, 2017

This overlay doesn't really work if black is one of the plotted colours.

You need more clear titling/description of the plot.

jnothman added Stalled and removed Waiting for Reviewer labels

Contributor

chkoar commented Mar 12, 2018

@jnothman what is the intension here? Do we need to provide just an example of inductive inference on cluster labels or to create a meta-estimator?

Member

jnothman commented Mar 12, 2018

I don't mind examples containing new meta-estimators...

chkoar mentioned this pull request

[MRG] Add an example of inductive clustering #10852

Merged

Member

jnothman commented Jan 17, 2019

Closed by #10852

jnothman closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels