panc: code cleanup and speedup by stdweird · Pull Request #171 · quattor/pan

stdweird · 2017-08-22T16:45:29Z

Main performance improvements through caching; further code cleanup to get saner callggraphs and traces

stdweird · 2017-08-22T16:52:05Z

This cuts our full compile time in half (around 1000 profiles), the checkStringIndex is roughly 10-15% and should be generic for all usage (given enough profiles); the retrievePanSource caching is 35-40% gain (given enough profiles and usage of includedirs (5 in our case, via cluster.build.includes) and LOADPATH (2 in our case))
YMMV

jrha · 2017-09-11T13:39:53Z

Just tested this at RAL:

aquilon

With a small test sandbox this does seem to reduce the compile time slightly, but more importantly introduces no unusual changes to the profiles.

scdb

No measurable speed-up, but also no changes to profiles.

stdweird · 2017-09-12T06:43:42Z

@jrha heh, i'm surprised and a bit dissapointed. if you find the time, you could try to connect viusalvm to the compiler (just start visualvm, it will see any java proc) and do cpu profile dump (it will slow things down). i can have a look and see if anything is different in aquilon.

jrha · 2017-09-12T11:53:06Z

I'm not sure the aquilon test was large enough to draw any conclusions from, I can profile the scdb build, but I don't know what I'm doing!

stdweird · 2017-09-12T12:18:00Z

@jrha just connect visualvm, start cpu profiing, stop cpu profiling and send me the file 😄

jrha · 2017-09-12T12:54:05Z

Ok sure, which file? 😇

gombasg · 2017-09-14T16:39:03Z

Quick testing shows that based on the "Total time" reported by panc itself, there's about 10% speedup when generating uncompressed JSON (nice), and about 20% slowdown when generating uncompressed XML (ouch!), both compared to 10.2. Even in the XML case, some batches are slower and some are quicker, so I'll need to dig a bit deeper into where the time is spent. In real life, compressed output speed is what matters for us, but using compression makes comparing the output between two runs more difficult - I'll try that later if I find the time.

The only other difference I see is some Unicode characters, which were previously replaced by '??', now are appearing in the output.

stdweird · 2017-09-14T17:35:57Z

@gombasg thanks for testing, but can you also compare to current master https://jenkins0.ugent.be/job/panc/1822/org.quattor.pan$panc/

i do not know where the xml slowdown could come from; but i haven't tested xml output myself. i also do not know what you use to compare, but there shouldn't be a real issue with using gzipped files (we use the following to compare gzipped json diff -u <(zcat old.json.gz) <(zcat new.json.gz)) (well wrapped in script etc https://github.com/stdweird/quattor-SCDB-ugent/blob/master/zdiff.sh)

stdweird · 2017-10-26T09:49:59Z

@gombasg jar files from latest master are in jenkins, eg https://jenkins0.ugent.be/job/panc/lastSuccessfulBuild/org.quattor.pan$panc/

… Patterns

…xpensive filesystem lookups from lookupSource)

…n TreeMap)

…(main use is in TreeMap)

stdweird · 2018-02-17T12:39:16Z

Closing this PR. will be replaced by smaller PRs, hoping this makes merging easier

ned21 added this to the 10.5 milestone Oct 26, 2017

stdweird mentioned this pull request Nov 2, 2017

panc: only allow digit-only string as candidate list index (and allow anything else as dict key) #157

Merged

stdweird added 7 commits February 11, 2018 19:39

panc: TermFactory: cache checkStringIndex to avoid expensive regex calls

0637911

panc: termFactory: switch to thread-safe Matcher instances instead of…

f73a0dd

… Patterns

panc: Path: improve and cleanup regex usage

c1229ef

panc: FileSystemSourceRepository: cache retrievePanSource (to avoid e…

6e17e20

…xpensive filesystem lookups from lookupSource)

panc: Property: do not cache (and init) the hashcode (main usage is i…

54ca29c

…n TreeMap)

panc: cleanup/simplify the String and Long Property compareTo method …

2b127dd

…(main use is in TreeMap)

panc: SourceFile: use string concatenation instead of very slow format

a5950a0

stdweird force-pushed the regexp_speedup branch from 3a9b987 to a5950a0 Compare February 11, 2018 19:19

stdweird mentioned this pull request Feb 17, 2018

panc: TermFactory: cache Term create #184

Merged

stdweird closed this Feb 17, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

panc: code cleanup and speedup#171

panc: code cleanup and speedup#171
stdweird wants to merge 7 commits intoquattor:masterfrom
stdweird:regexp_speedup

stdweird commented Aug 22, 2017 •

edited

Loading

Uh oh!

stdweird commented Aug 22, 2017

Uh oh!

jrha commented Sep 11, 2017

Uh oh!

stdweird commented Sep 12, 2017

Uh oh!

jrha commented Sep 12, 2017

Uh oh!

stdweird commented Sep 12, 2017

Uh oh!

jrha commented Sep 12, 2017

Uh oh!

gombasg commented Sep 14, 2017

Uh oh!

stdweird commented Sep 14, 2017

Uh oh!

stdweird commented Oct 26, 2017

Uh oh!

stdweird commented Feb 17, 2018

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

Conversation

stdweird commented Aug 22, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stdweird commented Aug 22, 2017

Uh oh!

jrha commented Sep 11, 2017

aquilon

scdb

Uh oh!

stdweird commented Sep 12, 2017

Uh oh!

jrha commented Sep 12, 2017

Uh oh!

stdweird commented Sep 12, 2017

Uh oh!

jrha commented Sep 12, 2017

Uh oh!

gombasg commented Sep 14, 2017

Uh oh!

stdweird commented Sep 14, 2017

Uh oh!

stdweird commented Oct 26, 2017

Uh oh!

stdweird commented Feb 17, 2018

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

stdweird commented Aug 22, 2017 •

edited

Loading