[core] Abort on semantic errors by oowekyala · Pull Request #3892 · pmd/pmd

oowekyala · 2022-04-01T21:41:07Z

Describe the PR

This makes us skip rules if semantic errors are reported. This is useful for better error reporting. Previously errors might have been reported by throwing an exception. This hides further errors, now we can also collect them. This makes #3891 more useful.

Related issues

Relates to [core] Error handling in PMD 7 #3761
Relates to [apex] Integrate nawforce/ApexLink to build robust Unused rule #2667 (comment)

Ready?

Added unit tests for fixed bug/feature
Passing all unit tests
Complete build ./mvnw clean verify passes (checked automatically by github actions)
Added (in-code) documentation (if needed)

ghost · 2022-04-01T22:16:21Z

	2 Messages
📖	Compared to pmd/7.0.x: This changeset changes 27 violations, introduces 18 new violations, 0 new errors and 0 new configuration errors, removes 25 violations, 0 errors and 0 configuration errors. Full report
📖	Compared to master: This changeset changes 57096 violations, introduces 34418 new violations, 2 new errors and 0 new configuration errors, removes 138132 violations, 25 errors and 6 configuration errors. Full report

Generated by 🚫 Danger

This should be done more thoroughly in a future PR

adangel · 2022-05-05T09:23:28Z

Hm... there seems to be a general problem now - as the pmd regression tester says at the end:

[main] ERROR net.sourceforge.pmd.PMD - An error occurred while executing PMD.

There are also many reporting of unresolved method calls and such - I guess these are the new reported semantic errors?

E.g.

Unresolved at In: /home/runner/work/pmd/target/repositories/spring-framework/spring-websocket/src/test/java/org/springframework/web/socket/messaging/OrderedMessageSendingIntegrationTests.java:197:3	[MethodCall:197:3]assertThat(messageHandler.getSavedException()).hasMessage(\n\t\t\t\t\"Buffer size \" + 3

The last line of the progress report seems to be:

Processing files 100% [=] 7551/7551 (0:09:27 / 0:00:00) Violations:307292, Error

Hm... should be something like Errors: xyz - the number of processing errors.

We should solve this before merging these PRs...

adangel · 2022-05-05T13:07:20Z

Ok, so the message "[main] ERROR net.sourceforge.pmd.PMD - An error occurred while executing PMD." means: one processing error occurred. So, nothing out of the ordinary.

But - PMD 7 meanwhile exits with failing exit code, if that happens:

pmd/pmd-core/src/main/java/net/sourceforge/pmd/PMD.java

Lines 244 to 252 in 1deb4b7

    
           stats = PMD.runAndReturnStats(pmd); 
        
           if (pmdReporter.numErrors() > 0) { 
        
               // processing errors are ignored 
        
               return StatusCode.ERROR; 
        
           } else if (stats.getNumViolations() > 0 && configuration.isFailOnViolation()) { 
        
               return StatusCode.VIOLATIONS_FOUND; 
        
           } else { 
        
               return StatusCode.OK; 
        
           }

I guess, we should restore what the comment (still) says: "// processing errors are ignored".
Or do we have a command line option to enable/disable failing on processing errors?

What I don't understand is, why this happens only on this branch - the baseline on pmd/7.0.x seems to be created without problems - although we find processing errors.

See also #2827 - handling of processing errors with respect to the exit code is still undefined

adangel · 2022-05-05T13:45:54Z

Slowly I begin to understand: in the above snippet, we don't check stats.getNumErrors() (which are the processing errors), but we check pmdReport.numError(), which are any errors that are logged with log level errors...

adangel · 2022-05-05T14:01:33Z

Ok, it seems to be this error message, that is counted now and changes the exit code:

2022-04-23T18:59:00.3846552Z [main] ERROR net.sourceforge.pmd.PMD - at /home/runner/work/pmd/target/repositories/spring-framework/spring-oxm/build/generated/sources/xjc/java/test/org/springframework/oxm/jaxb/test/package-info.java :1:116: Error during type resolution of node FieldAccess

Note: I can reproduce this message on branch pmd/7.0.x as well, but the exit code is 0 there...

adangel · 2022-05-05T14:10:25Z

+        SemanticErrorReporter reporter = SemanticErrorReporter.reportToLogger(configuration.getReporter(), LOG);
        ParserTask task = new ParserTask(
            languageVersion,
            filename,
            sourceCode,
-            SemanticErrorReporter.reportToLogger(LOG),
+            reporter,


Ok, this is the change: previously we used a new SemanticErrorReporter, now the logs are forwarded to the main message reporter as well, which makes PMD exit with 1 if there are errors logged...

oowekyala · 2022-05-05T14:11:34Z

I'm surprised there is a FieldAccess in a package-info.java

Note: I can reproduce this message on branch pmd/7.0.x as well, but the exit code is 0 there...

That's because semantic errors were previously only forwarded to an slf4j logger, but this PR also forwards them to the MessageReporter, which increments its error count. I thought this was the correct behaviour, but maybe the SemanticErrorReporter should instead create a ProcessingError. I think there is a difference between a ProcessingError (PMD crashes on some file) and a semantic error (PMD detects that the file is not well-formed source code) though...

adangel · 2022-05-05T14:19:16Z

Ok, I think, that's what we need to do:

Adjust pmd-regression-tester to go on regardless of the exit code
Make the exit code part of the comparison
To help us in analyzing problems, the regression tester should also collect the stdout/stderr and make it available in the report

Overall, I think, it makes sense to exit with 1 if PMD logged something suspiciously erroneous. We can think about later, whether we should distinguish them from "hard disruption" error condition caused by exceptions (which was till now the only reason why we exited with 1). And how they relate to the processing errors, which we collect in the report.

adangel · 2022-05-05T14:23:51Z

I think there is a difference between a ProcessingError (PMD crashes on some file) and a semantic error (PMD detects that the file is not well-formed source code) though...

Yes, probably, maybe. Assuming, that the code compiles, that PMD analyzes, all semantic errors probably point out a bug in PMD. But that would be the same for processing errors...

Maybe the difference is just that (at least, how it's implemented atm): processing errors are unexpected errors and semantic errors are detectable error conditions...

oowekyala · 2022-05-06T19:30:01Z

Maybe the difference is just that (at least, how it's implemented atm): processing errors are unexpected errors and semantic errors are detectable error conditions...

A few thoughts about this

Building on the technical definition of fault and failure:

A semantic error currently is basically a gracefully detected fault, which may or may not cause an error/failure later (in rule execution). The failure can be an exception (processing error), or inconsistent behavior of rules (FNs/FPs). Rules might also "just work" by chance (or, the semantic error itself is an FP). By that reasoning, we could probably tolerate more faults and only skip rule analysis when we know rule processing will almost surely fail.

Currently things like ambiguous references are reported as errors, because that's what a Java compiler would do. But we're not a Java compiler, and skipping rule analysis because of an ambiguous reference seems like overkill. We should probably demote those to warnings, and print something like "warning: input file might be invalid, this may cause problems in rules" (and link to a page of doc with details).

About processing errors, I think they are in the report mostly so that users don't think that the file they occured in is fine because it has no violations. If semantic errors skip rule analysis, then the same argument applies to them, and they should probably cause a processing error to be reported.

This makes me wonder if we need semantic errors at all. Throwing exceptions (SemanticException?) would achieve nearly the same thing. One difference is that you can only throw one exception, whereas you can accumulate errors if you don't throw them. But we still need a way to skip rule analysis even if no exception is thrown.

So in summary,

current semantic errors could be demoted to warnings, with a link to some doc so that users understand the consequences, what they should do to fix it or where they can report FPs
very important errors would be reported with exceptions, which may be either thrown or given to a MessageReporter without throwing. If you throw it would still end up in a MessageReporter when it's caught (PmdRunnable). All of those would cause processing errors to be logged and rule analysis to be skipped. This way we don't have a new error situation for our exit code, only processing errors.

oowekyala · 2022-05-13T10:41:13Z

I'll try to implement those changes over the weekend

oowekyala added this to the 7.0.0 milestone Apr 1, 2022

[core] Abort on semantic errors

4fa7db2

oowekyala force-pushed the abort-on-semantic-errors branch from f683215 to 4fa7db2 Compare April 1, 2022 23:44

oowekyala added 4 commits April 14, 2022 20:11

Merge branch '7.0.x' into abort-on-semantic-errors

3336d11

Add tests, use MessageReporter in SemanticErrorReporter

38d731c

Add tests for Sem error reporter

d599a7d

Merge branch 'pmd7-lambda-wrong-form-bug' into abort-on-semantic-errors

f374dd2

oowekyala mentioned this pull request Apr 14, 2022

[java] Fix IndexOutOfBoundsException with lambda that has wrong shape #3912

Merged

4 tasks

Initial work to report typing errors in Java

6d2858d

This should be done more thoroughly in a future PR

oowekyala mentioned this pull request Apr 14, 2022

[core] Error handling in PMD 7 #3761

Open

7 tasks

Merge branch 'pmd7-lambda-wrong-form-bug' into abort-on-semantic-errors

341c202

oowekyala mentioned this pull request Apr 15, 2022

[core] Error reporting implementation #3923

Open

12 tasks

Merge branch '7.0.x' into abort-on-semantic-errors

226e63e

oowekyala marked this pull request as ready for review April 22, 2022 17:29

oowekyala mentioned this pull request Apr 23, 2022

[core] Text documents #3893

Merged

5 tasks

oowekyala added 3 commits April 23, 2022 20:01

checkstyle

087f97f

Add a test, cleanup PmdRunnable tests

9b965eb

Remove coupling between BaseLanguageModule and PmdRunnableTest

ff2f5ef

adangel self-requested a review April 28, 2022 15:13

adangel approved these changes May 5, 2022

View reviewed changes

Comment thread pmd-core/src/main/java/net/sourceforge/pmd/processor/PmdRunnable.java Outdated

adangel reviewed May 5, 2022

View reviewed changes

adangel mentioned this pull request May 5, 2022

Make stdout/stderr and exit code available pmd/pmd-regression-tester#109

Closed

oowekyala added 8 commits May 15, 2022 13:05

Merge branch '7.0.x' into abort-on-semantic-errors

fd6f705

Turn many semantic errors into warnings

f291a29

Only use MessageReporter as backend of SemanticErrorReporter

02571c6

Remove info level of SemanticErrorReporter

bd86027

Make semantic errors report processing errors

e759069

Fix pmd warning

0a72e50

fix java tests

5ae11f4

Merge branch '7.0.x' into abort-on-semantic-errors

9a5ab04

oowekyala merged commit 51c890c into pmd:pmd/7.0.x Jun 25, 2022

oowekyala deleted the abort-on-semantic-errors branch June 25, 2022 17:29

adangel mentioned this pull request Jan 23, 2023

PMD 7 Tracking Issue #3898

Closed

55 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[core] Abort on semantic errors#3892

[core] Abort on semantic errors#3892
oowekyala merged 19 commits into
pmd:pmd/7.0.xfrom
oowekyala:abort-on-semantic-errors

oowekyala commented Apr 1, 2022 •

edited

Loading

Uh oh!

ghost commented Apr 1, 2022 •

edited by ghost

Loading

Uh oh!

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel May 5, 2022

Uh oh!

oowekyala commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

oowekyala commented May 6, 2022 •

edited

Loading

Uh oh!

oowekyala commented May 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

oowekyala commented Apr 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe the PR

Related issues

Ready?

Uh oh!

ghost commented Apr 1, 2022 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel May 5, 2022

Choose a reason for hiding this comment

Uh oh!

oowekyala commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

adangel commented May 5, 2022

Uh oh!

oowekyala commented May 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oowekyala commented May 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oowekyala commented Apr 1, 2022 •

edited

Loading

ghost commented Apr 1, 2022 •

edited by ghost

Loading

oowekyala commented May 6, 2022 •

edited

Loading