[MRG+2] modify disadvantage by Ellen-Co2 · Pull Request #8521 · scikit-learn/scikit-learn

Ellen-Co2 · 2017-03-04T18:00:55Z

svm can work effectively when feature number is >> number of samples.
But to avoid over-fitting usually happens in such situation by choosing
appropriate kernel (model selection) is important

Reference Issue

<-- Fixes #8450 -->

What does this implement/fix? Explain your changes.

In case of high dimensionality, SVM can still work effectively, but the over-fitting issue still need to be considered, cause the vc dimension might be close to infinite in such case, thus choose of kernel or control the regularization factor "C" is essential.

Any other comments?

To test for over-fitting, use cross validation or larger hold-out can be useful. Check some discussions regarding dimensionality here

svm can work effectively when feature number is >> number of samples. But to avoid over-fitting usually happens in such situation by choosing appropriate kernel (model selection) is important

amueller · 2017-03-04T19:21:52Z

LGTM.

amueller · 2017-03-04T19:22:30Z

doc/modules/svm.rst


    - If the number of features is much greater than the number of
-      samples, the method is likely to give poor performances.
+      samples, avoid over-fitting in choosing :ref:`svm_kernels` and regularization


Did you respect the 80 character line length?

Looks like its 84 characters, is that a major issue?

codecov · 2017-03-04T21:24:33Z

Codecov Report

Merging #8521 into master will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #8521   +/-   ##
=======================================
  Coverage   95.48%   95.48%           
=======================================
  Files         342      342           
  Lines       60913    60913           
=======================================
  Hits        58160    58160           
  Misses       2753     2753

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cdd693b...54b789b. Read the comment docs.

jmschrei · 2017-03-04T21:35:17Z

LGTM as well.

jmschrei · 2017-03-04T21:35:53Z

Congrats @Ellen-Co2 !

[MRG+2] modify disadvantage

modify disadvantage

54b789b

svm can work effectively when feature number is >> number of samples. But to avoid over-fitting usually happens in such situation by choosing appropriate kernel (model selection) is important

amueller reviewed Mar 4, 2017

View reviewed changes

amueller changed the title ~~[MRG] modify disadvantage~~ [MRG + 1] modify disadvantage Mar 4, 2017

jmschrei changed the title ~~[MRG + 1] modify disadvantage~~ [MRG+2] modify disadvantage Mar 4, 2017

jmschrei merged commit 56d5789 into scikit-learn:master Mar 4, 2017

Przemo10 mentioned this pull request Mar 17, 2017

update fork (#1) #8606

Closed

herilalaina pushed a commit to herilalaina/scikit-learn that referenced this pull request Mar 26, 2017

modify disadvantage (scikit-learn#8521)

f731e12

[MRG+2] modify disadvantage

massich pushed a commit to massich/scikit-learn that referenced this pull request Apr 26, 2017

modify disadvantage (scikit-learn#8521)

54dd122

[MRG+2] modify disadvantage

Sundrique pushed a commit to Sundrique/scikit-learn that referenced this pull request Jun 14, 2017

modify disadvantage (scikit-learn#8521)

c5c189b

[MRG+2] modify disadvantage

NelleV pushed a commit to NelleV/scikit-learn that referenced this pull request Aug 11, 2017

modify disadvantage (scikit-learn#8521)

1d3b01d

[MRG+2] modify disadvantage

paulha pushed a commit to paulha/scikit-learn that referenced this pull request Aug 19, 2017

modify disadvantage (scikit-learn#8521)

8431736

[MRG+2] modify disadvantage

maskani-moh pushed a commit to maskani-moh/scikit-learn that referenced this pull request Nov 15, 2017

modify disadvantage (scikit-learn#8521)

ee913a2

[MRG+2] modify disadvantage

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MRG+2] modify disadvantage#8521

[MRG+2] modify disadvantage#8521
jmschrei merged 1 commit intoscikit-learn:masterfrom
Ellen-Co2:sprint-work

Ellen-Co2 commented Mar 4, 2017 •

edited

Loading

Uh oh!

amueller commented Mar 4, 2017

Uh oh!

amueller Mar 4, 2017 •

edited

Loading

Uh oh!

jmschrei Mar 4, 2017

Uh oh!

codecov bot commented Mar 4, 2017

Uh oh!

jmschrei commented Mar 4, 2017

Uh oh!

jmschrei commented Mar 4, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Ellen-Co2 commented Mar 4, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

amueller commented Mar 4, 2017

Uh oh!

amueller Mar 4, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jmschrei Mar 4, 2017

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Mar 4, 2017

Codecov Report

Uh oh!

jmschrei commented Mar 4, 2017

Uh oh!

jmschrei commented Mar 4, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Ellen-Co2 commented Mar 4, 2017 •

edited

Loading

amueller Mar 4, 2017 •

edited

Loading