Fix bugs in ParallelLeiden #1328

adsharma · 2025-07-04T23:16:55Z

code inconsistent with the comment
missed initialization

Also removed the warning, based on a comparison vs GVE Leiden

================================================================================
RESULTS SUMMARY
================================================================================
Algorithm Performance:
  leiden-communities-openmp Leiden:  9.0327s
  NetworkIt Leiden:                  9.1313s

Performance Comparison:
  leiden-communities-openmp vs NetworkIt Leiden:  1.01x

Community Detection Results:
  leiden-communities-openmp Leiden:  4504 communities, modularity 0.743539
  NetworkIt Leiden:                  2283 communities, modularity 0.788739

Quality Comparison:
  Modularity difference (LC vs NK Leiden):  0.045199

Related to: #1244

- code inconsistent with the comment - missed initialization Also removed the warning, based on a comparison vs GVE Leiden

adsharma · 2025-07-06T16:36:38Z

Cc: @CxVercility @fabratu

CxVercility · 2025-07-06T17:23:40Z

Hi there,
thanks for the info. At least I can now have some peace of mind in knowing that my paper didnt just contain complete nonsense 😂
Interestingly, the first bug is not present in my original implementation so this must have happened during the review/change cycle for networkit. That probably explains how my own testing didnt have such atrocious modularity results.
Greets

fabratu · 2025-07-07T07:16:41Z

Good catch and thanks for the PR. Based on your test results, the error is minor after all. I also looked into the code in the past, but overlooked this rather simple mistakes sigh

There is still one error on the pipeline for MSVC. The described error points towards an ill-sized vector in ParallelLeiden. Based on past experience this also might be kind of a false positive. Do you have time to investigate? Otherwise I can take care of this.

Just out of curiosity - what network (or networks) did you use for your comparison? It makes sense, that NetworKit Leiden is able to produce higher modularity scores, since GVE-Leiden uses a more simplistic heuristic for detecting and fixing disconnected communities. But this also might be due to non-deterministic effects. I would assume though, that the performance of GVE should be higher due to some tricks (one of them being the more simplistic cut-logic).

adsharma · 2025-07-07T19:59:39Z

Do you have time to investigate? Otherwise I can take care of this.

Not super familiar with windows. Would appreciate help.

I used the LiveJournal graph for this test.
I have one concern with the reported time here (~9 seconds). The C++ version does it in about 2 seconds.

The rest of the time goes into converting between python <-> C++ vectors. Do you have any thoughts on how to optimize this? Perhaps we can discuss on a separate issue.

fabratu · 2025-07-08T07:01:13Z

@adsharma Alright, I will look into the Windows error. Might take 1-2 days.

I also found your other pull request. The problem / use case you are trying to solve here is to have a parallel Leiden algorithm with Python front-end, correct? And the reported running time in this PR is from your proposed Python interface for GVE-Leiden against the Python interface of ParallelLeiden in NetworKit?

If so, that would explain the running time parity. In the GVE-Leiden paper, the authors show a measurable performance gain with respect to ParallelLeiden in C++. Some of it is likely not reproducible if we integrate GVE into NetworKit due to existing design decisions. We currently have a student project to merge the ideas of GVE-Leiden (+ other implementations) and ParallelLeiden, thereby also fixing the previous implementation. Since the fixing part seems rather simple, we can merge this PR after I looked in the Windows error. Further improvements concerning the C++ part can then be handled by the student project.

I would also recommend to discuss the overhead stemming from conversion time in another PR. Due to the number of algorithms available in NetworKit, there would a nice impact if the conversion process would be faster. Thanks again for your contribution :)

adsharma · 2025-07-08T15:52:54Z

Yes - let's take the discussion about further optimizations to #1331 and keep this one limited to a narrow bug fix.

Thanks for looking into Windows CI check.

adsharma · 2025-07-11T00:34:39Z

@fabratu looks like windows CI is also passing. Please take another look.

Fix bugs in ParallelLeiden

13ab15d

- code inconsistent with the comment - missed initialization Also removed the warning, based on a comparison vs GVE Leiden

adsharma mentioned this pull request Jul 4, 2025

Python bindings nodef/leiden-communities-openmp.py#1

Closed

CxVercility pushed a commit to CxVercility/ParallelLeiden that referenced this pull request Jul 6, 2025

Fixed missing init. see networkit/networkit#1328

e4fcf86

adsharma mentioned this pull request Jul 7, 2025

Optimize for Arrow in-memory format #1331

Open

Potential fix for vector out of bounds

2817ff4

fabratu approved these changes Jul 11, 2025

View reviewed changes

fabratu merged commit 13ccda4 into networkit:master Jul 14, 2025
29 checks passed

adsharma deleted the leiden_fixes branch July 14, 2025 16:26

rpadmanabhan mentioned this pull request Oct 20, 2025

New Release ? #1364

Closed

fabratu mentioned this pull request Nov 3, 2025

Doc: Remove hint towards errornuous results of ParallelLeiden. #1370

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bugs in ParallelLeiden #1328

Fix bugs in ParallelLeiden #1328

Uh oh!

adsharma commented Jul 4, 2025

Uh oh!

adsharma commented Jul 6, 2025

Uh oh!

CxVercility commented Jul 6, 2025 •

edited

Loading

Uh oh!

fabratu commented Jul 7, 2025

Uh oh!

adsharma commented Jul 7, 2025

Uh oh!

fabratu commented Jul 8, 2025

Uh oh!

adsharma commented Jul 8, 2025

Uh oh!

adsharma commented Jul 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix bugs in ParallelLeiden #1328

Fix bugs in ParallelLeiden #1328

Uh oh!

Conversation

adsharma commented Jul 4, 2025

Uh oh!

adsharma commented Jul 6, 2025

Uh oh!

CxVercility commented Jul 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fabratu commented Jul 7, 2025

Uh oh!

adsharma commented Jul 7, 2025

Uh oh!

fabratu commented Jul 8, 2025

Uh oh!

adsharma commented Jul 8, 2025

Uh oh!

adsharma commented Jul 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CxVercility commented Jul 6, 2025 •

edited

Loading