Skip to content

Checkpoint acknowledgments for DynamoDB pipelines #4764

@graytaylor0

Description

@graytaylor0

Is your feature request related to a problem? Please describe.
As a user of the dynamodb source, when I have acknowledgments enabled, shards are not checkpointed by sequence number in the coordination store. This means that when data prepper stops and then starts, it will process the full shard again, rather than only processing after the checkpoint from the acknowledgments. This results in some duplicate processing and higher end to end latency reporting.

Describe the solution you'd like
Checkpoint the shards by sequence number with acknowledgments to start reading shards from the acknowledgment checkpoint to prevent duplicate processing

Describe alternatives you've considered (Optional)
A clear and concise description of any alternative solutions or features you've considered.

Additional context
Add any other context or screenshots about the feature request here.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions