Marking tests as "long" is outdated

`test-run.py`, that we use for running Tarantool tests, allows marking functional tests as "long" [^1]:

> long_run - mark tests as long, enabled only with `--long` option (delimited with the space, e.g. `long_run=t1.test.lua t2.test.lua`)

The tests marked as "long" executes in a separate job in GH Actions and skipped by default by `test-run.py`.

I took a look at these tests when doing PR #10216, and I see two problems with this:
- There is no precise criteria to distinguish long tests from other tests. Often it's a matter of taste: it feels like some tests take longer than others.
- A test added to "long_run" will most likely not be removed, even if it is no longer such.

[^1]: https://github.com/tarantool/test-run/?tab=readme-ov-file#test-suite

Let's run `test-run.py` with `--long`, only 5 tests in the longest tests are the tests marked as "long":

```
Top 10 longest tests (seconds):                                          
* 102.90 engine_long/delete_replace_update.test.lua:memtx       (long)
*  99.75 vinyl-luatest/select_consistency_test.lua                             (long)
*  93.07 engine_long/delete_replace_update.test.lua:vinyl             (long)
*  81.26 config-luatest/failover_and_election_mode_test.lua           
*  79.75 box-luatest/gh_7605_qsort_recovery_test.lua                    (long)
*  79.18 config-luatest/compat_test.lua                               
*  73.64 box/alter-primary-index-tuple-leak-long.test.lua               (long)
*  71.89 config-luatest/log_wrapper_test.lua                          
*  65.56 replication-luatest/quorum_orphan_test.lua                   
*  61.71 config-luatest/basic_test.lua                                
```

In Tarantool source tree with latest commit (f65de7e71ec1a1f80cf960ba653b672ba358c74a) there are 19 tests marked as "long".
I've executed tests by CTest to obtain test execution times (see JUnit report [tarantool.xml.zip](https://github.com/user-attachments/files/17831810/tarantool.xml.zip) with timings):

1. `vinyl/stress.test.lua` - 33.274 s
1. `vinyl/large.test.lua` - 18.654 s
1. `vinyl/write_iterator_rand.test.lua` - 20.1814 s
1. `vinyl/dump_stress.test.lua` - 18.8457 s
1. `vinyl/select_consistency.test.lua` - 9.12993 s
1. `vinyl/throttle.test.lua` - 8.32499 s
1. `replication/prune.test.lua` - 43.6834 s
1. `vinyl-luatest/select_consistency_test.lua` - 93.4773 s
1. `box/huge_field_map_long.test.lua` - 8.38625 s
1. `box/alter-primary-index-tuple-leak-long.test.lua` - 90.0156 s
1. `box-luatest/gh_7605_qsort_recovery_test.lua` - 101.907 s
1. `box-luatest/gh_7670_memtx_tx_manager_idx_rand_inconsistency_test.lua` - 31.2257 s
1. `xlog/snap_io_rate.test.lua` - 14.1248 s
1. `sql-luatest/ghs_119_too_long_mem_values_test.lua` - 62.4867 s
1. `sql-luatest/ghs_122_allocations_in_printf_test.lua` - 11.7021 s
1. `sql-tap/gh-3332-tuple-format-leak.test.lua` - 22.3404 s
1. `sql-tap/gh-3083-ephemeral-unref-tuples.test.lua` - 37.5453 s
1. `engine_long/delete_replace_update.test.lua` - 174.352 s
1. `engine_long/delete_insert.test.lua` - 43.6055 s

Seems most of these tests are not "long" as supposed.

I propose to get rid of marking tests as "long" and run these tests as others.
Or define exact criteria for different test sizes (learn more about "small", "medium" and "large" tests in [^2], [^3], [^4] and [^5]):

![Image](https://github.com/user-attachments/assets/6af058d3-980f-4351-8b76-84ae60fc9938)

and support these test sizes in our test runners (at least in luatest and test-run.py).

[^2]: https://testing.googleblog.com/2010/12/test-sizes.html
[^3]: https://testing.googleblog.com/2011/03/how-google-tests-software-part-five.html
[^4]: https://mike-bland.com/2011/11/01/small-medium-large.html
[^5]: https://abseil.io/resources/swe-book/html/ch14.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Marking tests as "long" is outdated #10840

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Marking tests as "long" is outdated #10840

Description

Footnotes

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions