Implement `max-age` directive #1957

bbockelm · 2023-03-13T03:08:46Z

This is the third PR in the series (to be reviewed after #1953 and #1954; it'll be rebased on top of whatever those become).

This implements the HTTP max-age directive, allowing the client to control how old the data returned can be. If the cached copy is over the age threshold, either the PFC will delete it (if the file is being opened for the first time) or the HTTP request will (for files that are already opened by other clients).

This will need the most careful feedback from @osschar as it contains a key improvement for the XrdPfc -- an Unlink request no longer invalidates all open file handles but rather the next read will cause it to be reopened.

Note the careful cleanup of the HTTP GET state machine -- now each request state represents a unique operation instead of overloading the request state with other pieces of information. I also tried to add some more careful comments to the state machine to try and make it more understandable by the next developer.

The HTTP `cache-control` header has a well-defined set of potential values. This helper class will allow us to central its parsing.

This provides the PFC with the ability to parse the `cache-control` CGI, using the XrdOucCacheDirective helper class, and understand the no-store and no-cache directives. *Note* this handles the simple cases only -- if the file in the cache was already opened by another client then we will serve from that file according to the original open rules.

Fixes xrootd#1886

With this, clients will receive the RFC standard 504 Gateway Timeout if the file is not already cached. This subtly changes the semantics of the creation time in the cinfo file to be when the first block of data is written as opposed to when the cinfo is created (as opening but not reading the file will create the cinfo file).

This allows the client to specify the maximum age of the cached data they are willing to accept. This currently only works for the first attach of the data.

Previously, an Unlink of a file that was opened would cause a failure on future file reads.

Permits an invalidation of the cache without trying to delete the path from upstream as well.

With this, the HTTP server can invalidate the cache after it has opened a file if the file proves to be older than the requested max-age. A side-effect is refactoring the state machine to be simpler -- for GET, each request state has a single operation that may occur.

abh3

OK, I have looked at this and I cannot say I am happy. This is not what we have traditionally doe visive the architecture. The approach here is rather invasive as it tries to force the whole http header protocol all the way through the end-point code. That really is not what we want to do. In general, we have always promoted headers to cgi elements. In this case, one would need to promote header sub-elements to cgi elements. We then do not pollute the code with specific syntax requirements, Additionally, the protocol gets to choose what actually gets exposed and it's all done in a way that is transparent. I know you spent a lot of work here (obviously) but the approach is really not the way to go as it really muddles the code.

I also see that you better structured the http state machine. That was a good thing. Unfortunately, it's totally tied up with this feature patch. They really should be sperate changes. Could we start with that (i.e. fixing the http state machine)?

This change was mostly extracted from a larger pull request that included other features, xrootd#1957. The current change aims to be a minimal refactor in preparation for other work in the same area of the code. Co-authored-by: Brian Bockelman <bbockelman@morgridge.org>

This change was mostly extracted from a larger pull request that included other features, #1957. The current change aims to be a minimal refactor in preparation for other work in the same area of the code. Co-authored-by: Brian Bockelman <bbockelman@morgridge.org>

bbockelm · 2024-02-13T13:46:50Z

Closing this one out -- Alja has been picking up where I've left off in separate PRs.

bbockelm added 8 commits March 11, 2023 11:13

Add helper class for parsing cache-control header

1e20fbd

The HTTP `cache-control` header has a well-defined set of potential values. This helper class will allow us to central its parsing.

Add support for the cache-control header

e63f575

Fixes xrootd#1886

Add support for max-age directive

ef7527a

This allows the client to specify the maximum age of the cached data they are willing to accept. This currently only works for the first attach of the data.

Permit XrdPfc to reopen files internally after unlink

f1c2a5e

Previously, an Unlink of a file that was opened would cause a failure on future file reads.

Allow a XrdPosix operation to target only the cache

d2257bf

Permits an invalidation of the cache without trying to delete the path from upstream as well.

bbockelm mentioned this pull request Mar 13, 2023

XRootD HTTP erroneously returns old Digest header #1956

Closed

amadio requested a review from osschar March 13, 2023 15:27

amadio assigned osschar Mar 13, 2023

abh3 reviewed Apr 14, 2023

View reviewed changes

smithdh mentioned this pull request Aug 17, 2023

[XrdHttp] Refactoring the read issuing for GET and fix issues 1976, 2076 #2072

Merged

bbockelm closed this Feb 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement `max-age` directive #1957

Implement `max-age` directive #1957

Uh oh!

bbockelm commented Mar 13, 2023

Uh oh!

abh3 left a comment

Uh oh!

bbockelm commented Feb 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Implement max-age directive #1957

Implement max-age directive #1957

Uh oh!

Conversation

bbockelm commented Mar 13, 2023

Uh oh!

abh3 left a comment

Choose a reason for hiding this comment

Uh oh!

bbockelm commented Feb 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Implement `max-age` directive #1957

Implement `max-age` directive #1957