Possible duplicate check bypass in summarization step when same URL occurs multiple times on the same page of processing. #22
Labels
No labels
Compat/Breaking
Help Wanted
Kind/Bug
Kind/Documentation
Kind/Enhancement
Kind/Feature
Kind/Roadmap
Kind/Security
Kind/Testing
Priority
Critical
Priority
High
Priority
Low
Priority
Medium
Reviewed
Confirmed
Reviewed
Duplicate
Reviewed
Invalid
Reviewed
Won't Fix
Status
Abandoned
Status
Blocked
Status
Need More Info
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
unobtanium/unobtanium#22
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
If an URL happens to occur twice within the same page window while summarizing the duplicate isn't detected because duplicates are only checked against the database, but not the other batch members.
This should be fixed partly in the summary pipeline and part in the database code to prevent two entity generations for the same URL to be open at the same time.