Skip to content

Allow setting non-standard AWS Glue Registry endpoints for testing#5502

Merged
kkondaka merged 1 commit intoopensearch-project:mainfrom
seschis:local-aws-glue-registry
Mar 18, 2025
Merged

Allow setting non-standard AWS Glue Registry endpoints for testing#5502
kkondaka merged 1 commit intoopensearch-project:mainfrom
seschis:local-aws-glue-registry

Conversation

@seschis
Copy link
Copy Markdown
Contributor

@seschis seschis commented Mar 5, 2025

Description

Add support for the kafka.schema.registry_url setting for the aws_glue schema registry. Previously this setting was ignored and the code would only connect to the standard aws glue registry endpoint. This change allows for running a local aws glue registry using something like localstack or motoserver so that local integration testing can occur. We noticed this problem when setting up AWS OSIS and being unable to locally perform integration testing in our application. Closes #5377

Issues Resolved

Resolves #5377

Check List

  • New functionality includes testing.
  • New functionality has a documentation issue. Please link to it in this PR.
  • Commits are signed with a real name per the DCO

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

topicMetrics.getNumberOfPollAuthErrors().increment();
Thread.sleep(10000);
} catch (RecordDeserializationException e) {
LOG.error("Deserialization error while doing poll()", e);
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I originally had a problem where I didn't set the AWS_SECRET_ACCESS_KEY in the data-prepper docker image and it was failing to connect to the instance of motoserver I was running. The helpful error message in the stack trace wasn't being displayed as it was about the 3rd "caused-by" down in the trace. I had to add this error message to see it.

I don't think this is ideal so I wanted some guidance on how you want me to handle this error so that it provides a meaningful message to a user for correcting things.

Change set allows using the kafka.schema.registry_url setting to set
non-standard endpoints when using a mock or fake AWS glue registry
(opensearch-project#5377).

Signed-off-by: Shane Schisler <shane@schisler.net>
@seschis seschis force-pushed the local-aws-glue-registry branch from 370ee4c to a8a32b7 Compare March 5, 2025 13:36
@seschis
Copy link
Copy Markdown
Contributor Author

seschis commented Mar 5, 2025

I see that the pipeline currently has some accepted failing tests based on previous merges. I briefly looked into the failing tests and they didn't make a lot of sense based on the changes I made. Before I go too deep troubleshooting the Kafka plugin integration test failures, could someone let me know if these are expected and accepted as failing as well.

Copy link
Copy Markdown
Member

@dlvenable dlvenable left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good to me. @kkondaka , can you also take a look?

@kkondaka kkondaka merged commit 2703e5e into opensearch-project:main Mar 18, 2025
45 of 50 checks passed
chenqi0805 pushed a commit to chenqi0805/data-prepper that referenced this pull request Apr 2, 2025
…pensearch-project#5502)

Change set allows using the kafka.schema.registry_url setting to set
non-standard endpoints when using a mock or fake AWS glue registry
(opensearch-project#5377).

Signed-off-by: Shane Schisler <shane@schisler.net>
Co-authored-by: Shane Schisler <shane@schisler.net>
chenqi0805 pushed a commit to chenqi0805/data-prepper that referenced this pull request Apr 2, 2025
…pensearch-project#5502)

Change set allows using the kafka.schema.registry_url setting to set
non-standard endpoints when using a mock or fake AWS glue registry
(opensearch-project#5377).

Signed-off-by: Shane Schisler <shane@schisler.net>
Co-authored-by: Shane Schisler <shane@schisler.net>
Signed-off-by: George Chen <qchea@amazon.com>
amdhing pushed a commit to amdhing/data-prepper that referenced this pull request Apr 16, 2025
…pensearch-project#5502)

Change set allows using the kafka.schema.registry_url setting to set
non-standard endpoints when using a mock or fake AWS glue registry
(opensearch-project#5377).

Signed-off-by: Shane Schisler <shane@schisler.net>
Co-authored-by: Shane Schisler <shane@schisler.net>
Davidding4718 pushed a commit to Davidding4718/data-prepper that referenced this pull request Apr 25, 2025
…pensearch-project#5502)

Change set allows using the kafka.schema.registry_url setting to set
non-standard endpoints when using a mock or fake AWS glue registry
(opensearch-project#5377).

Signed-off-by: Shane Schisler <shane@schisler.net>
Co-authored-by: Shane Schisler <shane@schisler.net>
Davidding4718 pushed a commit to Davidding4718/data-prepper that referenced this pull request Apr 25, 2025
…pensearch-project#5502)

Change set allows using the kafka.schema.registry_url setting to set
non-standard endpoints when using a mock or fake AWS glue registry
(opensearch-project#5377).

Signed-off-by: Shane Schisler <shane@schisler.net>
Co-authored-by: Shane Schisler <shane@schisler.net>
Mamol27 pushed a commit to Mamol27/data-prepper that referenced this pull request May 6, 2025
…pensearch-project#5502)

Change set allows using the kafka.schema.registry_url setting to set
non-standard endpoints when using a mock or fake AWS glue registry
(opensearch-project#5377).

Signed-off-by: Shane Schisler <shane@schisler.net>
Co-authored-by: Shane Schisler <shane@schisler.net>
Signed-off-by: mamol27 <mamol27@yandex.ru>
@kkondaka
Copy link
Copy Markdown
Collaborator

kkondaka commented Jun 6, 2025

@seschis this is breaking existing behavior. "Previously this setting was ignored and the code would only connect to the standard aws glue registry endpoint." -- this is incorrect. We do use this setting. I need to fix this immediately. I will add another option under schema: like override_endpoint: which when true this new code will be invoked

@kkondaka kkondaka mentioned this pull request Jun 6, 2025
2 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Kafka local aws glue registry support

3 participants