chore: refactor field creation for maintainability by davidby-influx · Pull Request #26028 · influxdata/influxdb

davidby-influx · 2025-02-15T06:06:10Z

Refactor the new field creation code for maintainability.

Address review comments in the port work of the
field creation. Also fixes one bug in returning the wrong
error.

devanbenz · 2025-02-18T13:46:48Z

tsdb/shard.go

+			var err PartialWriteError
+			switch {
+			case errors.As(validateErr, &err):
+				// This will turn into an error later, outside this lambda


No longer a lambda

devanbenz · 2025-02-18T14:12:01Z

tsdb/shard.go


 // Field represents a series field. All of the fields must be hashable.
 type Field struct {
-	Name string            `json:"name,omitempty"`


Why are we removing the json tagging? Just a question, not really a change comment.

We don't JSON-ize this struct anywhere; it gets copied to a protobuf for serialization these days.

philjb

Thanks for changing CreateFieldIfNotExists to accept a string as the first argument and removing FieldBytes and removing the inline lambda that was confusing!

I have spotted and recommend another change to ValidateFields to avoid a tricky type switch on the returned error type since ValidateFields only returns a PartialWriteError - I do recommend this within this pr but won't hold up this pr's merging. I call this one "recommended and optionally, but i'd really like to see it done" :smiling.

I think have also spotted a broken check against timebytes for field keys... this is bad but very historical so should probably be a separate cleanup, if you want to do it at all; but i also support doing it in the this cleanup pr to avoid additional burden of porting. I call this one "suggested and totally optional".

philjb · 2025-02-18T20:06:49Z

tsdb/shard.go

+		mf := engine.MeasurementFields(name)
+		// Check with the field validator.
+		if newFields, validateErr = ValidateFields(mf, p, s.options.Config.SkipFieldSizeValidation); validateErr != nil {
+			var err PartialWriteError


I know we're now in a mix of original code (before this PR and before #25998), but we are touching this and I now recommend another change for clarity, not correctness: removal of dead/unreachable code.

ValidateFields() only returns a PartialWriteError at this point so the switch { case errors.As(validateErr, &err) ... is more than is needed as the default: case is unreachable code and therefore complicates the logic flow.

My recommendation is to change ValidateFields to this signature (eliding args) so it returns the specific error it can only return:

func ValidateFields(...) ([]*FieldCreate, *PartialWriteError), update it internally for &PartialWriteError and then update here to

var newFields []*FieldCreate var validateErr *PartialWriteError name := p.Name() mf := engine.MeasurementFields(name) // Check with the field validator. if newFields, validateErr = ValidateFields(mf, p, s.options.Config.SkipFieldSizeValidation); validateErr != nil { if reason == "" { // record the first reason only reason = validateErr.Reason } dropped += validateErr.Dropped atomic.AddInt64(&s.stats.WritePointsDropped, int64(validateErr.Dropped)) continue }

I especially recommend this change since determining what to return in the default case was already tricky as this pr description notes that it: Also fixes one bug in returning the wrong error.

coda:
I know this is slightly against the usual golang "return error interface in the ultimate return position" but in this case, for a helper method, I think it's clarifying instead of confusing: aka I find the switch with default more confusing than the momentary thought about why this helper doesn't return an error interface.

I disagree with this, because I can imagine ValidateFields returning other errors in the future. There may be validations that have to touch some other resource, which we could fail to obtain, for instance, which would cause more significant errors, so I would prefer to leave this as an error return type and switch on the type.

philjb · 2025-02-18T20:24:21Z

tsdb/shard.go

-			name := p.Name()
-			mf := engine.MeasurementFields(name)
-			// Check with the field validator.
-			if newFields, validateErr = ValidateFields(mf, p, s.options.Config.SkipFieldSizeValidation); validateErr != nil {


I also now notice (and should have noticed in #25998 instead that the point's field names ("key") is checked against timeBytes twice.

First here in validateSeriesAndFields:

influxdb/tsdb/shard.go

Lines 684 to 690 in cda0ef4

for iter.Next() {

if bytes.Equal(iter.FieldKey(), timeBytes) {

continue

}

validField = true

break

}

Actually - I think this first check against timeBytes is totally broken. The check is inverted - it's missing an ! meaning the second check is actually doing the work and this first check is a just a waste. EDIT: see next comment: this is doing some work but its nearly redundant with ValidateFields and a minor update there could delete one of the two checks for illegal time fields.

and then later in ValidateFields:

influxdb/tsdb/field_validator.go

Lines 38 to 42 in 89b4b97

fieldKey := iter.FieldKey()

// Skip fields name "time", they are illegal.

if bytes.Equal(fieldKey, timeBytes) {

continue

}

My recommendation is to clean this up to but in a separate pr: It'll be one of the 5 of us who will next be reading this code so perhaps think kindly on our future selves!

I'm not seeing this. The first one, in validateSeriesAndFields checks that there is at least one field not named time in the point, otherwise it drops the point.

The second check in ValidateFields skips creating any fields named time.

Am I misreading the code?

Aren't those the same thing? aka they cause the same effect?

p0=Point{fields:["time"]} p1=Point{fields:["time", "time", "foo"}

For these two points, the first p0 will be rejected by what is in validateSeriesAndFields (all fields = time), but it would also be rejected by ValidateFields. For the second p1, it would not be rejected by validateSeriesAndFields (one field not equal to "time"), and then in ValidateFields both of the time fields would be skipped but the foo field would be marked for creation.

If the check in validateSeriesAndFields is removed, the result is the same, only foo is marked for creation.

BUT I see that the test TestWriteTimeTag fails because it's looking for a PartialWriteError in the p0 case which ValidateFields doesn't return - this is a latent bug in Validatefields. So the rework is more than just a deletion - this is all a suggested change - not important.

I admit i misread the intention of the check in validateSeriesAndFields in my initial comment too! its not "wrong" as it is excluding all points where every field is an illegal one (and returning an error), but there's only one illegal field (time) so its redundant now. Perhaps there used to be more illegal ones or the intention of more illegal ones. It's also kind of weird to validate fields both before and inside ValidateFields.

All in - this is a pretty minor point (pun intended). And hopefully i'm not misreading it! (I'll edit my original comment some)

chore: refactor field creation for maintainability

cda0ef4

davidby-influx added area/tsm kind/enhancement 1.x team/edge labels Feb 15, 2025

davidby-influx requested review from devanbenz, gwossum and philjb February 15, 2025 06:06

davidby-influx self-assigned this Feb 15, 2025

chore: avoid an unnecessary allocation on field creation

89b4b97

devanbenz approved these changes Feb 18, 2025

View reviewed changes

philjb approved these changes Feb 18, 2025

View reviewed changes

philjb mentioned this pull request Feb 18, 2025

fix: lock MeasurementFields while validating #25998

Merged

davidby-influx merged commit 5f57633 into master-1.x Feb 18, 2025
10 checks passed

davidby-influx deleted the DSB_field_refactor branch February 18, 2025 22:00

philjb added a commit that referenced this pull request Feb 18, 2025

chore: suggested changes to PR #26028

f815a0e

davidby-influx added kind/refactor kind/bug and removed kind/enhancement labels Feb 19, 2025

davidby-influx mentioned this pull request Feb 19, 2025

chore: refactor field creation for maintainability [Port to main-2.x] #26035

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: refactor field creation for maintainability#26028

chore: refactor field creation for maintainability#26028
davidby-influx merged 2 commits intomaster-1.xfrom
DSB_field_refactor

davidby-influx commented Feb 15, 2025 •

edited

Loading

Uh oh!

devanbenz Feb 18, 2025

Uh oh!

devanbenz Feb 18, 2025

Uh oh!

davidby-influx Feb 18, 2025

Uh oh!

philjb left a comment

Uh oh!

philjb Feb 18, 2025

Uh oh!

davidby-influx Feb 18, 2025

Uh oh!

philjb Feb 18, 2025 •

edited

Loading

Uh oh!

davidby-influx Feb 18, 2025

Uh oh!

philjb Feb 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	for iter.Next() {
	if bytes.Equal(iter.FieldKey(), timeBytes) {
	continue
	}
	validField = true
	break
	}

	fieldKey := iter.FieldKey()
	// Skip fields name "time", they are illegal.
	if bytes.Equal(fieldKey, timeBytes) {
	continue
	}

Conversation

davidby-influx commented Feb 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devanbenz Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

devanbenz Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

davidby-influx Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

philjb left a comment

Choose a reason for hiding this comment

Uh oh!

philjb Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

davidby-influx Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

philjb Feb 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidby-influx Feb 18, 2025

Choose a reason for hiding this comment

Uh oh!

philjb Feb 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

davidby-influx commented Feb 15, 2025 •

edited

Loading

philjb Feb 18, 2025 •

edited

Loading

philjb Feb 18, 2025 •

edited

Loading