KAFKA-2373: Add Kafka-backed offset storage for Copycat. by ewencp · Pull Request #202 · apache/kafka

ewencp · 2015-09-10T02:35:06Z

No description provided.

ewencp · 2015-09-10T02:37:48Z

@gwenshap Sorry, this turned out a bit bigger than intended because some of the MockConsumer stuff was incomplete, but much more useful than trying to use an EasyMock object.

One issue with the current patch is that it is only unit tested. Two additional JIRAs (2374, 2375) address other components necessary for the full distributed version. Until then, it doesn't make much sense to have more extensive tests. I did, however, manually verify by changing the Worker to use this implementation then running the system tests using that version. Not sure if we want something more intermediate until the other two patches are in place or if we should just leave it to the last one to integrate them all and test them end-to-end.

asfbot · 2015-09-10T02:47:24Z

kafka-trunk-git-pr #386 SUCCESS
This pull request looks good

wushujames · 2015-09-10T03:40:36Z

Duplicate line

asfbot · 2015-09-10T18:44:54Z

kafka-trunk-git-pr #388 SUCCESS
This pull request looks good

wushujames · 2015-09-10T18:54:40Z

Does this mean you manually assign this consumer to read all partitions? So the consumer group id doesn't matter or is not used? I couldn't see where a consumer group was being set.

That means that each instance of this code consumes the entire topic, right? Which is exactly what you want.

I ask because I have many use cases where I want a consumer to get all partitions. We currently do it by trying to create unique consumer group ids, but that is kind of annoying.

Yes, we are using this in simple consumer mode. I initially started with the approach you're describing. I didn't actually want a consumer group since each consumer reads the entire topics, we don't need offset commits, etc. However, one drawback with this approach is that it doesn't automatically pick up changes in the # of partitions in the topic.

However, I think this shouldn't be a problem anyway because if you wan to use a compacted topic for this (which should be reasonable), you can't just change the # of partitions since it'll break the key -> partition mapping.

I agree that this seems much nicer.

You said "simple consumer mode". So this is the New Consumer in simple consumer mode, correct?

Will the New Consumer automatically handle rebalances due to leader failover (broker failure where the partition leader changes), even when in simple consumer mode? Because that was a downside to the previous Simple Consumer -- you had to handle leadership changes on your own.

In the new consumer, if you specify what to consume by using assign(), it consumes from those topic partitions and doesn't use any of the group management features. This only makes sense if you want a single consumer instance to see everything. Everything related to which broker you need to be fetching data for should be handled for you. If you use subscribe(), then you join your consumer group and are assigned a subset of the topic partitions by the consumer coordinator. All the consumer group rebalancing is handled automatically in that case.

ewencp · 2015-09-14T20:14:08Z

@gwenshap Updated with some issues after some updates to trunk. Also added some basic system tests. They are very close to the standalone tests for now. As we add the other components we can improve them to do a better job of sanity checking the distributed mode.

Also, if you can keep a careful eye on the MockConsumer/MockConsumerTest that'd be helpful. I noticed what appeared to be an error with how it was reporting offsets, and I want to make sure we get those semantics right. MockConsumer isn't much use if it doesn't actually match KafkaConsumer behavior....

gwenshap · 2015-09-14T23:23:09Z

I think your IDE did this automatically and we normally stick to the no wildcard rule?

…fkaOffsetBackingStore.

…tition info.

asfbot · 2015-09-23T01:41:28Z

kafka-trunk-git-pr #496 FAILURE
Looks like there's a problem with this pull request

ewencp · 2015-09-23T01:41:32Z

@gwenshap Rebased to fix some conflicts after (I think) resolving all your comments. Tests pass locally, and we'll see what @asfbot says. I'm running system tests here but don't expect any issues since the final fixes were pretty minimal. Anything else you want addressed before commit?

gwenshap · 2015-09-24T23:54:00Z

Looks like there are some conflicts now :(

Mind rebasing again?

LGTM otherwise.

…istributed-offset

wushujames reviewed Sep 10, 2015
View reviewed changes

Comment thread checkstyle/import-control.xml Outdated

wushujames Sep 10, 2015

Copy link
Copy Markdown

Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Duplicate line

wushujames reviewed Sep 10, 2015
View reviewed changes

gwenshap reviewed Sep 14, 2015
View reviewed changes

ewencp mentioned this pull request Sep 23, 2015

Should stop offset backing store in Copycat Worker's stop method #232

Closed

ewencp added 5 commits September 22, 2015 18:24

KAFKA-2373: Add Kafka-backed offset storage for Copycat.

004613c

Add distributed mode CLI command and system test.

33b41de

Fix offset reset on partition assignment in MockConsumer.

b8bbffb

Add documentation about overriding producer and config settings in Ka…

30b94bb

…fkaOffsetBackingStore.

Clarify exception message when KafkaOffsetBackingStore cannot get par…

04bcc1c

…tition info.

ewencp force-pushed the kafka-2373-copycat-distributed-offset branch from b3eacbe to 04bcc1c Compare September 23, 2015 01:35

ewencp added 3 commits September 24, 2015 18:26

Merge remote-tracking branch 'origin/trunk' into kafka-2373-copycat-d…

9df4bde

…istributed-offset

Fix * import in MockConsumer.java

020ad6e

Fix a few more * imports.

1882811

asfgit closed this in 48b4d69 Sep 25, 2015

Conversation

ewencp commented Sep 10, 2015

Uh oh!

ewencp commented Sep 10, 2015

Uh oh!

asfbot commented Sep 10, 2015

Uh oh!

wushujames Sep 10, 2015

Choose a reason for hiding this comment

Uh oh!

asfbot commented Sep 10, 2015

Uh oh!

wushujames Sep 10, 2015

Choose a reason for hiding this comment

Uh oh!

ewencp Sep 10, 2015

Choose a reason for hiding this comment

Uh oh!

wushujames Sep 10, 2015

Choose a reason for hiding this comment

Uh oh!

ewencp Sep 10, 2015

Choose a reason for hiding this comment

Uh oh!

ewencp commented Sep 14, 2015

Uh oh!

gwenshap Sep 14, 2015

Choose a reason for hiding this comment

Uh oh!

asfbot commented Sep 23, 2015

Uh oh!

ewencp commented Sep 23, 2015

Uh oh!

gwenshap commented Sep 24, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants