Fix chi-square test by kroeckx · Pull Request #8826 · openssl/openssl

kroeckx · 2019-04-25T20:17:47Z

No description provided.

kroeckx · 2019-04-25T20:28:09Z

Now that this test uses the correct values, each subtest fails 5% of the runs, as expected.

I ran 1000 tests, and had:

test 1 failed 54 times.
test 2 failed 64 times.
test 3 failed 49 times.
One or more tests failed 157 times.

paulidale · 2019-04-25T20:47:59Z

What chance of getting a false positive is comfortable?
The critical value can be adjusted to suite.

kroeckx · 2019-04-25T21:14:34Z

I think I can live with a 99.99% success rate. But at what point is the test still useful?

paulidale · 2019-04-25T21:30:17Z

Catching egregious failures of uniformity is about the best we can hope for.

For normal builds 8 run the tests, extended has 17 doing so. Probabilities of 0.9976 and 0.9949 of the three tests passing for all builds with alpha = 0.9999 (under the uniformity hypothesis).

paulidale · 2019-04-25T21:34:44Z

More useful might be printing out the critical value which the test actually fails at and producing successively a warning and then an error.

kroeckx · 2019-04-25T21:56:45Z

You can also do something like test it 100 times, and then check that the expected failure rate it 5%. I wonder what things like dieharder do.

paulidale · 2019-04-25T22:21:12Z

TestU01, dieharder, NIST's SP 800-90B are testing for something different. They are attempting to test that each bit of the data is IID. I'm attempting to test that values generated over a range are uniform. e.g. a range 0-2 won't have evenly distributed bits (four zero bits, two one bits over all output values). Likewise a range 0-4 won't have equally probable bits (the '4' bit is only set once, the others twice each).

Repeating the test and checking the failure rate has the same issue -- it's a binomial distribution instead of χ² -- a critical value still needs to be chosen.

paulidale · 2019-04-26T03:19:23Z

Running multiple tests and checking for the 5% failure rate works. I've thrown something together in #8830 which does this. The underlying assumption is of independence -- both of the samplings within a test and between tests. Because the data are sourced from a DRBG, this assumption could be suspect, however a CSRNG should be designed to minimise and dependence.

Each of the tests passes 95% of the time which gives a binomial distribution for which a 99.99% critical value can be calculated. This means less than 0.1% of normal test runs will false positive and about 0.2% of extended test runs will. These numbers seem livable and we can move the critical value either way pretty easily.

Great idea :)

richsalz · 2019-04-26T15:19:12Z

And how will this be explained to users, who are pretty trained to see a "test failure" as a blocker?

kroeckx · 2019-05-21T15:37:00Z

Replaced by #8830

Fix chi-square test

d2a0240

paulidale mentioned this pull request Apr 26, 2019

Test of uniformity of BN_rand_range output. #8830

Closed

1 task

kroeckx closed this May 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix chi-square test#8826

Fix chi-square test#8826
kroeckx wants to merge 1 commit intoopenssl:masterfrom
kroeckx:bnrand_chisq

kroeckx commented Apr 25, 2019

Uh oh!

kroeckx commented Apr 25, 2019 •

edited

Loading

Uh oh!

paulidale commented Apr 25, 2019

Uh oh!

kroeckx commented Apr 25, 2019

Uh oh!

paulidale commented Apr 25, 2019

Uh oh!

paulidale commented Apr 25, 2019

Uh oh!

kroeckx commented Apr 25, 2019 via email

Uh oh!

paulidale commented Apr 25, 2019

Uh oh!

paulidale commented Apr 26, 2019

Uh oh!

richsalz commented Apr 26, 2019

Uh oh!

kroeckx commented May 21, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

kroeckx commented Apr 25, 2019

Uh oh!

kroeckx commented Apr 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paulidale commented Apr 25, 2019

Uh oh!

kroeckx commented Apr 25, 2019

Uh oh!

paulidale commented Apr 25, 2019

Uh oh!

paulidale commented Apr 25, 2019

Uh oh!

kroeckx commented Apr 25, 2019 via email

Uh oh!

paulidale commented Apr 25, 2019

Uh oh!

paulidale commented Apr 26, 2019

Uh oh!

richsalz commented Apr 26, 2019

Uh oh!

kroeckx commented May 21, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kroeckx commented Apr 25, 2019 •

edited

Loading