fix(common/core/web): optimizes transform construction by jahorton · Pull Request #5248 · keymanapp/keyman

jahorton · 2021-06-09T09:14:21Z

Addresses #5246.

In an isolated, headless unit test environment, this didn't seem to get that much improvement despite changing a particularly inefficient part of the algorithm from O(N^2) to O(N). (Maybe there were caching effects somehow?)

That said... it did make a remarkable difference in responsiveness when I used the KMW "predictive text: robust testing" page; keystroke processing as a whole took less time than focusing operations as a result.

I want to test it out a bit more before leaving draft, but it does seem to yield some significant improvement. Things still struggle with very long text, of course.

…plit

jahorton · 2021-06-09T09:35:36Z

+      let text = `Did you ever hear the Tragedy of Darth Plagueis the wise? I thought not.
+It's not a story the Jedi would tell you. It's a Sith legend. Darth Plagueis was a
+Dark Lord of the Sith, so powerful and so wise he could use the Force to influence
+the midichlorians to create life... He had such a knowledge of the dark side that
+he could even keep the ones he cared about from dying. The dark side of the Force
+is a pathway to many abilities some consider to be unnatural. He became so powerful...
+the only thing he was afraid of was losing his power, which eventually, of course,
+he did. Unfortunately, he taught his apprentice everything he knew, then his
+apprentice killed him in his sleep. It's ironic he could save others from death,
+but not himself.`;  // Sheev Palpatine, in the Star Wars prequels.


Had to pick something to use for a "long text" test case.

When I copied this 512 times over, the unit test took near a second to run. Somehow, for both the original and the new versions. (I definitely expected longer for the 'original'.) Not sure if there was some odd caching effect happening or what.

Using a single copy of this as the initial value of the "robust testing" page's text area works quite well for interactively testing the change. Simply insert within the element here, between the tags:

keyman/web/testing/prediction-mtnt/index.html

Line 90 in 6abc085

<textarea id='ta1' class='test' placeholder='Type here'></textarea>

mcdurdin

Well, the algorithm change does make a big difference, for sure. The same test on my Android phone went from 400+ msec script time to 30 msec script time. That's a big enough improvement that we can probably justify merging this and backporting to stable-14.0.

Still, it does just move the goalposts. As expected, when I tested with longer documents, I observed linear growth in script time. So this still fails with long documents -- and not all that long really, with just over 1 page of text in a note, I saw 160msec/key event which is barely tolerable. This is on a relatively fast device (for Android), so slower devices would probably be completely unusable with that much content.

The big problem in this whole algorithm is that it is ever scanning the whole document. That's not limited to this one function, but to the whole approach, I think. That seems completely unnecessary: we're never going to have a keyboard that affects more than the last few characters.

We should be working with a sliding window, say 128 code units either side of the caret. Is this possible to achieve? We may still need to iterate through the document once to get our relative code point base for the sliding window but that should be done just once for a keystroke and would then be relatively inconsequential.

I know that all our indexing is by document position, so it may be a significant challenge to rewrite it. But improved algorithms are only going to take us so far and I think we may need to consider this for a full fix.

jahorton · 2021-06-10T00:54:22Z

Well, the algorithm change does make a big difference, for sure. The same test on my Android phone went from 400+ msec script time to 30 msec script time. That's a big enough improvement that we can probably justify merging this and backporting to stable-14.0.

Still, it does just move the goalposts. As expected, when I tested with longer documents, I observed linear growth in script time. So this still fails with long documents -- and not all that long really, with just over 1 page of text in a note, I saw 160msec/key event which is barely tolerable. This is on a relatively fast device (for Android), so slower devices would probably be completely unusable with that much content.

The big problem in this whole algorithm is that it is ever scanning the whole document. That's not limited to this one function, but to the whole approach, I think. That seems completely unnecessary: we're never going to have a keyboard that affects more than the last few characters.

We should be working with a sliding window, say 128 code units either side of the caret. Is this possible to achieve? We may still need to iterate through the document once to get our relative code point base for the sliding window but that should be done just once for a keystroke and would then be relatively inconsequential.

I know that all our indexing is by document position, so it may be a significant challenge to rewrite it. But improved algorithms are only going to take us so far and I think we may need to consider this for a full fix.

As noted, this is important... but is also definitely not trivial.

In terms of context passed through to the KeyboardProcessor, there would be little problem. As noted, keystrokes don't ever have that wide a range of effect.

The major pain point would be retooling how suggestions and reversions are applied. The process for both involves comparing the "before" snapshot with the current context state, which gets tricky when context is represented by sliding windows. It does seem possible, though.

I do think we should have a "design" round before making such a change. There are a number of ramifications to think through, especially given how critical context synchronization is when embedded within the mobile apps.

mcdurdin · 2021-06-10T01:24:45Z

I do think we should have a "design" round before making such a change. There are a number of ramifications to think through, especially given how critical context synchronization is when embedded within the mobile apps.

Agreed. Let's go ahead with this fix; as a stop-gap it will definitely help, but continue to consider a full fix in a future update.

mcdurdin

LGTM. Let's cherry-pick this to stable-14.0

jahorton · 2021-06-10T04:07:17Z

Future work documented as #5258.

keyman-server · 2021-06-10T18:01:01Z

Changes in this pull request will be available for download in Keyman version 15.0.65-alpha

jahorton added 3 commits June 9, 2021 15:19

feat(common/core/web): adds extra Transcription tests

5012077

feat(common/core/web): adds extra test with much larger text

1cbfe68

fix(common/core/web): buildTransformFrom now binary-searches prefix s…

73af555

…plit

jahorton added the bug label Jun 9, 2021

jahorton added this to the A15S6 milestone Jun 9, 2021

github-actions Bot added common/ common/core/ common/web/ fix web/ labels Jun 9, 2021

jahorton commented Jun 9, 2021

View reviewed changes

mcdurdin reviewed Jun 9, 2021

View reviewed changes

jahorton marked this pull request as ready for review June 10, 2021 00:55

mcdurdin approved these changes Jun 10, 2021

View reviewed changes

This was referenced Jun 10, 2021

fix(common/core/web): optimizes transform construction 🍒 #5255

Merged

refactor(web): context operations should use a sliding window #5258

Open

jahorton merged commit 5e0aef8 into master Jun 10, 2021

jahorton deleted the fix/common/core/web/build-transform-optimization branch June 10, 2021 04:07

jahorton mentioned this pull request Jun 10, 2021

bug(web): OutputTarget.buildTransformFrom (String.kmwCharAt) is hideously slow #5246

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(common/core/web): optimizes transform construction#5248

fix(common/core/web): optimizes transform construction#5248
jahorton merged 3 commits intomasterfrom
fix/common/core/web/build-transform-optimization

jahorton commented Jun 9, 2021 •

edited

Loading

Uh oh!

jahorton Jun 9, 2021

Uh oh!

mcdurdin left a comment

Uh oh!

jahorton commented Jun 10, 2021

Uh oh!

mcdurdin commented Jun 10, 2021

Uh oh!

mcdurdin left a comment

Uh oh!

jahorton commented Jun 10, 2021

Uh oh!

keyman-server commented Jun 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

jahorton commented Jun 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jahorton Jun 9, 2021

Choose a reason for hiding this comment

Uh oh!

mcdurdin left a comment

Choose a reason for hiding this comment

Uh oh!

jahorton commented Jun 10, 2021

Uh oh!

mcdurdin commented Jun 10, 2021

Uh oh!

mcdurdin left a comment

Choose a reason for hiding this comment

Uh oh!

jahorton commented Jun 10, 2021

Uh oh!

keyman-server commented Jun 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jahorton commented Jun 9, 2021 •

edited

Loading