-
-
Notifications
You must be signed in to change notification settings - Fork 26.9k
Closed
Labels
ModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practiceshelp wanted
Description
There exist many defintions of NMI and AMI.
Vinh, N. X., Epps, J., & Bailey, J. (2010). Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance. Journal of Machine Learning Research, 11(Oct), 2837-2854.
mention 5 different definitions of NMI, and based on that give 4 different AMI.
The NMI implemented in sklearn uses sqrt(H(U), H(V)) for normalization.
The AMI implemented in sklearn uses max(H(U), H(V)) for normalization.
There exists an NMI with the max normalization, and a AMI with the sqrt normalization, so this is inconsistent in sklearn. Ideally, they would both use the same definition by default, and allow using any of the others via an option.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
ModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practiceshelp wanted