Encoder indent suppression: str.replace → re.sub (fixes #170) by cgevans · Pull Request #171 · UC-Davis-molecular-computing/scadnano-python-package

cgevans · 2021-03-16T23:59:47Z

Using re.sub (re is already imported), along with a replacement function that pulls from the replacement map, avoids repeated string replacements within Python and significantly speeds up encoding for large files. Note that I think this will raise a KeyError if the user has something with a name fitting "@@(\d+)@@" where the number is larger than the largest unique_id, and will corrupt the file if it is not. However, this is similar to the current situation, where I think it will never raise an error, but could corrupt the file.

This also changes the type of _replacement_map to avoid a mypy warning: it should be a correct narrowing of the value from Any to str, however.

Trying this with the design I'm trying to repeatedly save, the change brings write time down from 20 seconds to a little over 1 second.

…cular-computing#170) Using re.sub (re is already imported), along with a replacement function that pulls from the replacement map, avoids repeated string replacements within Python and significantly speeds up encoding for large files. Note that I think this will raise a KeyError if the user has something with a name fitting "@@(\d+)@@" where the number is larger than the largest unique_id, and will corrupt the file if it is not. However, this is similar to the current situation.

dave-doty

I don't quite understand what's going on in this new line of code, but it doesn't seem to break any unit tests, so I'll take you word that it works.

cgevans requested a review from dave-doty as a code owner March 16, 2021 23:59

dave-doty approved these changes Mar 17, 2021

View reviewed changes

dave-doty merged commit 5b73c2f into UC-Davis-molecular-computing:dev Mar 17, 2021

cgevans deleted the speedup-encoder branch March 17, 2021 00:38

dave-doty mentioned this pull request Mar 18, 2021

add optional parameter suppress_indent to method Design.write_scadnano_file #170

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encoder indent suppression: str.replace → re.sub (fixes #170)#171

Encoder indent suppression: str.replace → re.sub (fixes #170)#171
dave-doty merged 1 commit intoUC-Davis-molecular-computing:devfrom
cgevans:speedup-encoder

cgevans commented Mar 16, 2021 •

edited

Loading

Uh oh!

dave-doty left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cgevans commented Mar 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dave-doty left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cgevans commented Mar 16, 2021 •

edited

Loading