-
Notifications
You must be signed in to change notification settings - Fork 166
[Tests] Add third-party copy test script #2441
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Notes from Zoom chat with @dynamic-entropy:
|
|
One more addendum:
|
6a12c4e to
a2c54c7
Compare
|
I need to either do some CMake magic or create two nodes with the same cluster or an entirely different setup which only runs these tests on platforms that have SciTokens available. We cannot do HTTP TPC without SSL enabled, and I am relying on the SciTokens fixture to create the necessary certs and CAs. |
|
Please rebase on |
144ad8f to
80bd4f3
Compare
d2100fb to
97eea5e
Compare
97eea5e to
bc02369
Compare
|
@dynamic-entropy Please check that at each commit point, things still work. On the first commit, I get this: This means we cannot git-bisect through your changes. You can simply merge some of the commits to fix this, I think (or we can merge with squash, although that's suboptimal). |
bc02369 to
b50aabf
Compare
|
Hello @amadio |
|
Ok, thanks. I will try again. |
|
It seems that, when run alone, the test may pass, but when I run it along with the other tests, it always fails as shown below: |
amadio
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very good, this is mostly done, just need to add the extra required fixture to the list and it should be good to merge. Cheers,
It seems that the tests take more than a minute to run everything that requires a token. |
|
@amadio I have added a commit to request a token with an extended (2 min) lifetime for the TPC tests. |
afc1b88 to
b55ca90
Compare
b55ca90 to
ee3f010
Compare
|
I have changed the timeout to 30 minutes |
|
The test now fails with a checksum mismatch: https://my.cdash.org/tests/260410204 |
|
Okay, I can reproduce this on Ubuntu. The transfers seem to stall in Ubuntu and fail with "Internal server error". |
|
Yes, it looks like the test has caught a real issue. The test itself is not the problem. However, we should not merge a test that fails, we should fix the issue it caught along with the addition of the test. |
bbb5d5b to
181d1e3
Compare
|
@amadio - do you think the prior failures could be related to #2464? That is, if there's a problem with the IPv6 code for HTTP redirects as reported b @ellert, then it might be what causes the periodic test failure (and, depending on the runner you land on for retries, may explain why the failures are not consistent -- latest rebase passes just fine). So, it's possible it's not really the scitokens/TPC-related pieces but the underlying HTTP tests for multi-server setups. |
bbockelm
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@dynamic-entropy and I went over the tests in Zoom. All my initial suggestions have been addressed and this is starting to look quite nice from my point of view.
|
@bbockelm Indeed, the test is looking good. However, before merging, we need to understand and fix what is causing it to fail. I'd like to avoid merging tests that fail into master/devel, because then other pull requests will have failures as well. It could be that the reasons are the same as #2464. If yes, then we will need to adapt what runs to the network environment we have (or document that a working ipv4/ipv6 environment is a requirement to run the test suite). |
|
There is an actual failure to transfer, but the return code is still 201, which doesn't make sense to me (from https://my.cdash.org/tests/260410204): |
|
The error comes from this place in the code: xrootd/src/XrdTpc/XrdTpcTPC.cc Lines 757 to 763 in 53474d9
|
181d1e3 to
11d49a0
Compare
|
Looks like the current changes are still not enough to fix the issue, the test still failed on Alma 10. |
8bacd94 to
0c7c455
Compare
|
Hello @amadio Brian just fixed the content length check. Could you please trigger the jobs to run? Seems like GitHub did not run them on my recent push. |
0c7c455 to
37da93b
Compare
|
@dynamic-entropy There are some spurious changes in the first commit. Could you please rebase on master again (now that I merged the other PR), and push here so that I can merge also this one? Thanks! |
- Add configuration files for TPCTests server instances - Add setup and test scripts Signed-off-by: Rahul Chauhan <rahul.chauhan@cern.ch>
- Add scitokens auth to HTTP TPC Signed-off-by: Rahul Chauhan <rahul.chauhan@cern.ch>
Signed-off-by: Rahul Chauhan <rahul.chauhan@cern.ch>
…eter Signed-off-by: Rahul Chauhan <rahul.chauhan@cern.ch>
37da93b to
0675f88
Compare
Todos:
This draft PR only contains a full mesh TPC test only via the xroot protocol among the 4 nodes of the xrootd cluster.
I plan to add HTTP tests soon and refactor the code along with
tests.sh.Feel free to provide any initial reviews or suggestions.