Ensure test threads share a DB connection by eileencodes · Pull Request #28083 · rails/rails

eileencodes · 2017-02-20T21:37:35Z

This ensures multiple threads inside a transactional test to see consistent
database state.

When a system test starts Puma spins up one thread and Capybara spins up
another thread. Because of this when tests are run the database cannot
see what was inserted into the database on teardown. This is because
there are two threads using two different connections.

This change uses the statement cache to lock the threads to using a
single connection ID instead of each not being able to see each other.
This code only runs in the fixture setup and teardown so it does not
affect real production databases.

When a transaction is opened we set lock_thread to Thread.current so
we can keep track of which connection the thread is using. When we
rollback the transaction we unlock the thread and then there will be no
left-over data in the database because the transaction will roll back
the correct connections.

[ Eileen M. Uchitelle, Matthew Draper ]

cc/ @matthewd

This ensures multiple threads inside a transactional test to see consistent database state. When a system test starts Puma spins up one thread and Capybara spins up another thread. Because of this when tests are run the database cannot see what was inserted into the database on teardown. This is because there are two threads using two different connections. This change uses the statement cache to lock the threads to using a single connection ID instead of each not being able to see each other. This code only runs in the fixture setup and teardown so it does not affect real production databases. When a transaction is opened we set `lock_thread` to `Thread.current` so we can keep track of which connection the thread is using. When we rollback the transaction we unlock the thread and then there will be no left-over data in the database because the transaction will roll back the correct connections. [ Eileen M. Uchitelle, Matthew Draper ]

sgrif · 2017-02-24T19:40:34Z

Sorry for dropping in with little context, but this seems like it needs to do some additional work to actually ensure the connections aren't used concurrently. While usually the server is only executing if the test is blocked on a request, that's not always the case. Particularly if Javascript is involved, it seems like we could very easily end up with a race condition between the threads. The underlying C structures that the sqlite and pg gems are wrapping are very specifically not thread safe (I'm unsure about mysql2)

(But also I've not been following super closely since I'm away on leave so feel free to just tell me I'm wrong)

matthewd · 2017-02-24T20:09:37Z

@sgrif https://github.com/rails/rails/pull/28083/files#diff-c226a4680f86689c3c170d4bc5911e96R610

Could do with some rearrangement to make it clearer what's going on, but this was the easiest option for a simple drop-in solution for [almost] all actual-adapter interaction in one go.

matthewd · 2017-02-24T20:13:54Z

(higher level concurrency issues, like one thread working in a transaction, or some other long term statefulness on the connection, are consciously out of scope)

sgrif · 2017-02-24T20:21:27Z

Oh, I get it now. I parsed that line wrong mentally before. Yeah, seems fine then. I do think it would be a good idea to add some more explicit locking throughout the methods higher up in the future. We should also document that the result returned needs to either be thread safe or eagerly buffer since a lazy cursor that isn't thread safe would cause issues.

…ldren`) If we run only following tests: - test/cases/scoping/default_scoping_test.rb - test/cases/associations_test.rb ``` $ cat Rakefile.test require "rake/testtask" ENV["ARCONN"] = "postgresql" Rake::TestTask.new do |t| t.libs << "test" t.test_files = %w( test/cases/scoping/default_scoping_test.rb test/cases/associations_test.rb ) end ``` a test will fail: ``` $ bundle exec rake test -f Rakefile.test /app/activesupport/lib/active_support/core_ext/enumerable.rb:20: warning: method redefined; discarding old sum Using postgresql Run options: --seed 11830 # Running: .........................................................................................F................ Finished in 6.939055s, 15.2759 runs/s, 27.9577 assertions/s. 1) Failure: AssociationProxyTest#test_save_on_parent_saves_children [/app/activerecord/test/cases/associations_test.rb:185]: Expected: 1 Actual: 2 106 runs, 194 assertions, 1 failures, 0 errors, 0 skips rake aborted! Command failed with status (1) /usr/local/bin/bundle:22:in `load' /usr/local/bin/bundle:22:in `<main>' Tasks: TOP => test (See full trace by running task with --trace) ``` In rails#28083, change `self.use_transactional_tests` to `false` but we forget to clean-up fixture. However we don't have to disable transaction except a few tests.

In Rails 5.1 transactional tests share the same connection id between the webserver and test runner. This removes the need for special cleanup strategies. This speeds up the tests significantly, before: ``` Finished in 3 minutes 30.3 seconds (files took 5.46 seconds to load) ``` After: ``` Finished in 1 minute 41.61 seconds (files took 5.45 seconds to load) ``` rails/rails#28083

bf4 · 2017-04-30T19:56:44Z

I'd be willing to make a PR to backport this to earlier versions of Rails if there's an interest.

eileencodes · 2017-05-01T18:43:13Z

@bf4 I consider this a feature, not a bug fix, so it won't be back ported. It changes expected behavior too much. Sometimes if a bug lives long enough it becomes a feature. That's the case for this change.

In Rails 5.1 transactional tests share the same connection id between the webserver and test runner. This removes the need for special cleanup strategies. This speeds up the tests significantly, before: ``` Finished in 3 minutes 30.3 seconds (files took 5.46 seconds to load) ``` After: ``` Finished in 1 minute 41.61 seconds (files took 5.45 seconds to load) ``` rails/rails#28083

tgxworld · 2017-09-07T00:24:55Z

activerecord/lib/active_record/connection_adapters/abstract/query_cache.rb

-              @query_cache[sql][binds] = yield
-            end
-          result.dup
+          @lock.synchronize do


This code only runs in the fixture setup and teardown so it does not
affect real production databases.

@eileencodes @matthewd Is this lock and the lock in abstract_adapter necessary outside of the test environment?

No.. my theory was that acquiring an uncontended lock wouldn't be noticeably slower than checking whether the lock was needed (given that we're about to perform IO anyway). I guess the fact you're asking suggests I was wrong?

Yea I started noticing it in our flamegraphs but the overhead doesn't contribute significantly when I tried to benchmark it.

moveson · 2017-10-19T07:48:00Z

I'm using Rails 5.1/Devise/RSpec/Capybara, and the test thread does not appear to be sharing a connection when running a selenium browser. Reference the following test:

RSpec.describe 'User logs in' do
  let!(:user) { create(:user, email: email, password: password, password_confirmation: password) }
  let(:email) { 'jane@example.com' }
  let(:password) { '12345678' }

  scenario 'with valid email and password' do
    visit new_user_session_path
    fill_in 'Email', with: email
    fill_in 'Password', with: password
    click_button 'Sign in'
    expect(page).to have_content('You are signed in.')
  end
end

This passes when driven_by :rack_test, but when driven_by :selenium it results in "Invalid email or password". It fails in the same way when using selenium-chrome and selenium-chrome-headless.

eileencodes · 2017-10-19T11:48:15Z

@moveson please open a new issue with a way to reproduce it and demonstrates the failure you're seeing.

moveson · 2017-10-19T20:32:45Z

@eileencodes This turned out to be a problem with puma running in cluster mode, which made the Capybara thread unable to see the database. The problem has been addressed in a Rails PR here and will hopefully see the light of day in Rails 5.1.5.

maschwenk · 2017-10-19T20:40:25Z

Glad to see it helped someone else!

moveson · 2017-10-19T21:15:09Z

It's a good fix. Also thanks to @twalpole for diagnosing the problem and pointing me to the solution.

Before rails#34953, when using the `:async` Active Job queue adapter, jobs enqueued in `db/seeds.rb`, such as Active Storage analysis jobs, would cause a hang (see rails#34939). Therefore, rails#34953 changed all jobs enqueued in `db/seeds.rb` to use the `:inline` queue adapter instead. (This behavior was later limited to only take effect when the `:async` adapter was configured, see rails#35905.) However, inline jobs in `db/seeds.rb` cleared `CurrentAttributes` values (see rails#37526). Therefore, rails#37568 changed the `:inline` adapter to wrap each job in its own thread, for isolation. However, wrapping a job in its own thread affects which database connection it uses. Thus inline jobs can no longer execute within the calling thread's database transaction, including seeing any uncommitted changes. Additionally, if the calling thread is not wrapped with the executor, the inline job thread (which is wrapped with the executor) can deadlock on the load interlock. And when testing (with `connection_pool.lock_thread = true`), the inline job thread can deadlock on one of the locks added by rails#28083. Therefore, this commit reverts the solutions of rails#34953 and rails#37568, and instead wraps evaluation of `db/seeds.rb` with the executor. This eliminates the original hang from rails#34939, which was also due to running multiple threads and not wrapping all of them with the executor. And, because nested calls to `executor.wrap` are ignored, any inline jobs in `db/seeds.rb` will not clear `CurrentAttributes` values. Alternative fix for rails#34939. Reverts rails#34953. Reverts rails#35905. Partially reverts rails#35896. Alternative fix for rails#37526. Reverts rails#37568. Fixes rails#40552.

Before rails#34953, when using the `:async` Active Job queue adapter, jobs enqueued in `db/seeds.rb`, such as Active Storage analysis jobs, would cause a hang (see rails#34939). Therefore, rails#34953 changed all jobs enqueued in `db/seeds.rb` to use the `:inline` queue adapter instead. (This behavior was later limited to only take effect when the `:async` adapter was configured, see rails#35905.) However, inline jobs in `db/seeds.rb` cleared `CurrentAttributes` values (see rails#37526). Therefore, rails#37568 changed the `:inline` adapter to wrap each job in its own thread, for isolation. However, wrapping a job in its own thread affects which database connection it uses. Thus inline jobs can no longer execute within the calling thread's database transaction, including seeing any uncommitted changes. Additionally, if the calling thread is not wrapped with the executor, the inline job thread (which is wrapped with the executor) can deadlock on the load interlock. And when testing (with `connection_pool.lock_thread = true`), the inline job thread can deadlock on one of the locks added by rails#28083. Therefore, this commit reverts the solutions of rails#34953 and rails#37568, and instead wraps evaluation of `db/seeds.rb` with the executor. This eliminates the original hang from rails#34939, which was also due to running multiple threads and not wrapping all of them with the executor. And, because nested calls to `executor.wrap` are ignored, any inline jobs in `db/seeds.rb` will not clear `CurrentAttributes` values. Alternative fix for rails#34939. Reverts rails#34953. Reverts rails#35905. Partially reverts rails#35896. Alternative fix for rails#37526. Reverts rails#37568. Fixes rails#40552. (cherry picked from commit 648da12)

eileencodes added the activerecord label Feb 20, 2017

eileencodes assigned matthewd Feb 20, 2017

eileencodes added this to the 5.1.0 milestone Feb 20, 2017

matthewd merged commit 0ce6418 into rails:master Feb 20, 2017

eileencodes mentioned this pull request Feb 21, 2017

WIP: Capybara Integration with Rails (AKA System Tests) #26703

Merged

13 tasks

kaspth mentioned this pull request Feb 21, 2017

5.1.0.beta1 release post rails/weblog#97

Merged

eileencodes deleted the ensure-test-threads-shared-db-conn branch February 23, 2017 20:21

samstickland mentioned this pull request Feb 24, 2017

Rails 5.1 shares database connections between threads.. I think! grosser/parallel_tests#554

Closed

wjordan mentioned this pull request Feb 27, 2017

Transactional test cases #28178

Closed

KeithP mentioned this pull request Feb 27, 2017

Rails 5.1.0.beta1: PG::SEInvalidSpecification: ERROR: no such savepoint #28197

Closed

This was referenced Feb 27, 2017

Deferred fixture enrolment causes over-eager connection #27581

Open

Don't create new connections during test transaction setup #28207

Closed

mtsmfm mentioned this pull request Mar 2, 2017

Fix random failure on system test with ajax #28223

Merged

mtsmfm mentioned this pull request Mar 15, 2017

Fix fragile test (AssociationProxyTest#test_save_on_parent_saves_children) #28426

Merged

JonRowe mentioned this pull request Apr 27, 2017

Rails 5.1 capybara integration rspec/rspec-rails#1808

Closed

samstickland mentioned this pull request May 24, 2017

Rails 5.1 compatibility rosenfeld/rspec_nested_transactions#2

Closed

wtfiwtz mentioned this pull request Aug 8, 2017

Que jobs running concurrently have interference on Rails 5. que-rb/que#166

Closed

tgxworld reviewed Sep 7, 2017

View reviewed changes

jhawthorn mentioned this pull request Oct 24, 2017

Use transactional fixtures in frontend and backend solidusio/solidus#2320

Merged

This was referenced Dec 26, 2018

BUG in 5.1 and 5.2 test: Mysql adapter count number of deleted rows outside of @lock.synchronize block #34798

Closed

Wrap Mysql count of deleted rows in lock block to avoid conflict in test #34800

Merged

trcarden mentioned this pull request Mar 29, 2019

Occasional deadlocks between Dependencies::Interlock and db adapter lock #34310

Open

palkan mentioned this pull request Jun 25, 2019

before_all transaction has been already rollbacked and could work incorrectly test-prof/test-prof#146

Closed

maerch mentioned this pull request Aug 15, 2019

Apartment is not thread-safe with postgres schemas and transactional system tests influitive/apartment#615

Open

bubaflub mentioned this pull request Feb 28, 2020

ActiveRecord Shared Connection exposes feature spec race condition in Rails <5? test-prof/test-prof#179

Closed

maschwenk mentioned this pull request Aug 27, 2020

Ensure Puma runs in 0:1 configuration #40116

Closed

eugeneius mentioned this pull request Nov 8, 2020

Use shared thread everywhere in ConnectionPool #40574

Closed

jonathanhefner mentioned this pull request Nov 15, 2020

Wrap evaluation of db/seeds.rb with the executor #40626

Merged

mockdeep mentioned this pull request Jan 31, 2021

Add includes to agency show query EBWiki/EBWiki#3892

Merged

bensheldon mentioned this pull request Aug 13, 2021

ActionMailer::MailDeliveryJob executing twice bensheldon/good_job#329

Closed

throwern mentioned this pull request Apr 20, 2022

Handle multiple threads in rails system tests rsim/oracle-enhanced#2287

Open

eileencodes mentioned this pull request Sep 12, 2022

LoadInterlockAwareMonitor deadlock when clearing cache (multiple databases, test) #45994

Closed

casperisfine mentioned this pull request Nov 18, 2022

AbstractAdapter: only synchronize when necessary #46519

Merged

texpert mentioned this pull request Dec 16, 2022

Remove Database Cleaner and share FactoryBot factories owen2345/camaleon-cms#1028

Merged

bensheldon mentioned this pull request Oct 26, 2023

GoodJob.on_thread_error not called in tests bensheldon/good_job#1102

Closed

snickell mentioned this pull request Aug 22, 2024

seth/rails-7.2 code-dot-org/code-dot-org#60437

Draft

4 tasks

Edouard-chin mentioned this pull request Apr 22, 2025

No savepoint created when other active savepoint in system tests #54956

Open

Conversation

eileencodes commented Feb 20, 2017

Uh oh!

sgrif commented Feb 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

matthewd commented Feb 24, 2017

Uh oh!

matthewd commented Feb 24, 2017

Uh oh!

sgrif commented Feb 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bf4 commented Apr 30, 2017

Uh oh!

eileencodes commented May 1, 2017

Uh oh!

tgxworld Sep 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matthewd Sep 7, 2017

Choose a reason for hiding this comment

Uh oh!

tgxworld Sep 11, 2017

Choose a reason for hiding this comment

Uh oh!

moveson commented Oct 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eileencodes commented Oct 19, 2017

Uh oh!

moveson commented Oct 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maschwenk commented Oct 19, 2017

Uh oh!

moveson commented Oct 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

sgrif commented Feb 24, 2017 •

edited

Loading

sgrif commented Feb 24, 2017 •

edited

Loading

tgxworld Sep 7, 2017 •

edited

Loading

moveson commented Oct 19, 2017 •

edited

Loading

moveson commented Oct 19, 2017 •

edited

Loading

moveson commented Oct 19, 2017 •

edited

Loading