Add Clauset-Newman-Moore modularity-max community detection by elplatt · Pull Request #2871 · networkx/networkx

elplatt · 2018-02-07T04:52:19Z

* Adds modularity_max module to networkx.algorithms.community package
* Adds greedy_modularity_communities() function with CNM implementation
* Adds test using Zachary karate club network
* Add networkx.utils.mapped_queue used by the above implementation
* Add tests for networkx.utils.mapped_queue

* Adds modularity_max module to networkx.algorithms.community package * Adds greedy_modularity_communities() function * Adds test using Zachary karate club network.

* Replace greedy_modularity_communities() with CNM implementation. * Add networkx.utils.mapped_queue used by the above implementation. * Add tests for networkx.utils.mapped_queue.

* Add TestNaive class.

* Removes call to modularity() left over from debugging. Results in a significant speed-up.

dschult · 2018-02-18T05:20:42Z

This looks pretty good! Thanks!
Can you get it to PEP8 standards? (you can install a pep8 style checker using pip install pep8. probably it only will need you to follow the 80 character limit on each line.

Can you say something about the need for the locally written and maintained queue utility?
Could it be replaced by Python's heapq? If so, that could remove maintenance issues for us later.

elplatt · 2018-02-18T17:12:42Z

I'll work on pep8.

The heapq module is insufficient for this implementation because it doesn't support removing or updating elements (aside from removing the root). There is a third-party HeapDict package (BSD 3-clause license) that looks like it could work. That package stores elements and priorities as separate values, which isn't needed for this algorithm and would add both time and space overhead. Would it be worth doing some comparisons using that implementation instead? I'm not sure how desirable it is to add a dependency.

* Update modularity_max module and tests for pep8. * Update mapped_queue module and tests for pep8.

dschult · 2018-02-19T05:00:25Z

Thanks for that description. I would prefer not to add a dependency. How often do you have to remove a non-root element? I'll look at the code some more...

elplatt · 2018-02-19T14:54:32Z

The algorithm maintains a matrix of how much the modularity would increase by merging each pair of communities. There's a priority queue for each row, and one more for the max of each row. After each merge, a row/column is deleted and the modularity deltas are updated for all neighbors of the merged communities, which is where heapq falls short.

dschult · 2018-02-23T23:14:19Z

OK.. That makes sense to me.

Could you add to the doc file doc/reference/algorithms/community.rst to make these functions appear in the documentation reference? I think it is then ready to go. If that's too much, let me know and I'll do it.

…#2871) * Add greedy modularity maximization community detection. * Adds modularity_max module to networkx.algorithms.community package * Adds greedy_modularity_communities() function * Adds test using Zachary karate club network. * Add Clauset-Newman-Moore community detection. * Replace greedy_modularity_communities() with CNM implementation. * Add networkx.utils.mapped_queue used by the above implementation. * Add tests for networkx.utils.mapped_queue. * Add tests for naive modularity maximization. * Add TestNaive class. * Remove redundant modularity calculation. * Removes call to modularity() left over from debugging. Results in a significant speed-up. * Comply with pep8. * Update modularity_max module and tests for pep8. * Update mapped_queue module and tests for pep8. * Fix import of MappedQueue. * Add documentation for modularity_max module.

MridulS · 2019-03-30T15:03:01Z

networkx/algorithms/community/modularity_max.py

+#   Edward L. Platt <ed@elplatt.com>
+#
+# TODO:
+#   - Alter equations for weighted case


@elplatt Was this TODO done, or is there still something to do about this?

elplatt · 2019-03-30T15:44:06Z

I've figured out the appropriate equations for all combinations of (un)weighted and (un)directed, including self loops, but they're not implemented yet. We also need some known results to write test cases against. Equations are here: https://github.com/elplatt/Paper-CNM-Modularity

…

On Sat, Mar 30, 2019 at 11:03 AM Mridul Seth ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In networkx/algorithms/community/modularity_max.py <#2871 (comment)>: > @@ -0,0 +1,282 @@ +# modularity_max.py - functions for finding communities based on modularity +# +# Copyright 2018 Edward L. Platt +# +# This file is part of NetworkX +# +# NetworkX is distributed under a BSD license; see LICENSE.txt for more +# information. +# +# Authors: +# Edward L. Platt ***@***.***> +# +# TODO: +# - Alter equations for weighted case @elplatt <https://github.com/elplatt> Was this TODO done, or is there still something to do about this? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2871 (review)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAS0WPGGNJ4mRFt7XSqTWvD6WS2Gm8tQks5vb3ysgaJpZM4R8JBJ> .

-- Edward L. Platt PhD Candidate, University of Michigan School of Information he/him | https://elplatt.com | @elplatt | @elplatt@social.coop Tips for stopping email overload: https://hbr.org/2012/02/stop-email-overload-1

elplatt added 2 commits February 6, 2018 23:33

Add greedy modularity maximization community detection.

5a4d0b7

* Adds modularity_max module to networkx.algorithms.community package * Adds greedy_modularity_communities() function * Adds test using Zachary karate club network.

Add Clauset-Newman-Moore community detection.

46cae14

* Replace greedy_modularity_communities() with CNM implementation. * Add networkx.utils.mapped_queue used by the above implementation. * Add tests for networkx.utils.mapped_queue.

elplatt mentioned this pull request Feb 7, 2018

Add greedy modularity maximization community detection. #2855

Closed

elplatt added 2 commits February 7, 2018 11:40

Add tests for naive modularity maximization.

0139618

* Add TestNaive class.

Remove redundant modularity calculation.

27cd953

* Removes call to modularity() left over from debugging. Results in a significant speed-up.

Comply with pep8.

48e6f5e

* Update modularity_max module and tests for pep8. * Update mapped_queue module and tests for pep8.

Fix import of MappedQueue.

56df201

Add documentation for modularity_max module.

137d36f

dschult merged commit b7bafdb into networkx:master Feb 28, 2018

dschult added this to the networkx-2.2 milestone Feb 28, 2018

dschult added the type: Enhancements label Feb 28, 2018

MridulS reviewed Mar 30, 2019

View reviewed changes

MridulS mentioned this pull request May 18, 2022

Integration of Ben Edwards' GSoC project 2011 on community detection #764

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Clauset-Newman-Moore modularity-max community detection#2871

Add Clauset-Newman-Moore modularity-max community detection#2871
dschult merged 7 commits intonetworkx:masterfrom
elplatt:feature-modularity_max

elplatt commented Feb 7, 2018

Uh oh!

dschult commented Feb 18, 2018

Uh oh!

elplatt commented Feb 18, 2018

Uh oh!

dschult commented Feb 19, 2018

Uh oh!

elplatt commented Feb 19, 2018 •

edited

Loading

Uh oh!

dschult commented Feb 23, 2018

Uh oh!

MridulS Mar 30, 2019

Uh oh!

elplatt commented Mar 30, 2019 via email

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

elplatt commented Feb 7, 2018

Uh oh!

dschult commented Feb 18, 2018

Uh oh!

elplatt commented Feb 18, 2018

Uh oh!

dschult commented Feb 19, 2018

Uh oh!

elplatt commented Feb 19, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dschult commented Feb 23, 2018

Uh oh!

MridulS Mar 30, 2019

Choose a reason for hiding this comment

Uh oh!

elplatt commented Mar 30, 2019 via email

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

elplatt commented Feb 19, 2018 •

edited

Loading