-
Notifications
You must be signed in to change notification settings - Fork 313
Checkpoint acknowledgments for DynamoDB pipelines #4764
Description
Is your feature request related to a problem? Please describe.
As a user of the dynamodb source, when I have acknowledgments enabled, shards are not checkpointed by sequence number in the coordination store. This means that when data prepper stops and then starts, it will process the full shard again, rather than only processing after the checkpoint from the acknowledgments. This results in some duplicate processing and higher end to end latency reporting.
Describe the solution you'd like
Checkpoint the shards by sequence number with acknowledgments to start reading shards from the acknowledgment checkpoint to prevent duplicate processing
Describe alternatives you've considered (Optional)
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Add any other context or screenshots about the feature request here.