storage: Introduce storage coordination thread and infrastructure for parallel next-generation IndexedDB implementation #40661

janvarga · 2025-11-16T01:13:00Z

This PR introduces a new storage coordination thread, intended to serve as the central point for managing all current and future storage endpoints in Servo.

In addition to the new coordination thread, this PR also lays the infrastructure required to develop a parallel, next-generation IndexedDB implementation under the indexeddb_next feature flag living on a separate branch.

Testing: Unit and WPT tests continue to pass

mrobinson · 2025-11-16T07:48:56Z

High level thing: I'm not sure it makes sense to check in a million lines of test expectations, especially considering that in Servo the CI is typically the only thing that can use expectations with any kind of consistency. For now, let's keep the expectations outside the repository until the CI can use them.

Question for folks working on the IndexedDB backend: Is there any way that this can land as-is? I mean, is it complete enough to replace the existing backend?

arihant2math · 2025-11-16T08:27:29Z

Question for folks working on the IndexedDB backend: Is there any way that this can land as-is? I mean, is it complete enough to replace the existing backend?

The work on this backend has to be incremental. One option is to create a feature branch to work on. That would allow for an as-is landing onto main.

arihant2math · 2025-11-16T08:28:56Z

If we do that, it'd be good to backport the renames/client storage thread over to main.

janvarga · 2025-11-16T08:31:02Z

High level thing: I'm not sure it makes sense to check in a million lines of test expectations, especially considering that in Servo the CI is typically the only thing that can use expectations with any kind of consistency. For now, let's keep the expectations outside the repository until the CI can use them.

I understand the concern about checking in a large number of test expectations.

For local development, though, having the expectations available is quite practical and it works. The README in components/storage/ explains how they can be used effectively.

That said, we can definitely explore alternative places to host or share these expectations if committing them directly to the repository isn’t desirable right now.

Question for folks working on the IndexedDB backend: Is there any way that this can land as-is? I mean, is it complete enough to replace the existing backend?

The goal here is to enable the redesign to move forward incrementally and collaboratively.

The parallel experimental implementation doesn’t need to be complete before landing. The point is to create a structure where we can iterate on the redesign without risking regressions or blocking ongoing work on the current backend. This lets multiple people make progress in parallel while keeping Servo stable.

Also, this work is not only about IndexedDB. It is part of implementing the Storage specification, especially storage buckets and providing a coordination layer for all storage endpoints. The new IndexedDB backend would be the first consumer of that coordination layer. However other storage endpoints can be implemented based on that. So that part is not meant to be behind a feature flag and can be eventually used by people and especially tested by CI.

arihant2math · 2025-11-16T08:42:34Z

Also, this work is not only about IndexedDB. It is part of implementing the Storage specification, especially storage buckets and providing a coordination layer for all storage endpoints.

Should we land that in a separate PR to reduce the size of this one? I think it wouldn't be very difficult to hook it up to the current implementation as well, and localstorage should be even easier.

janvarga · 2025-11-16T09:07:52Z

Ok, I discussed this with Martin, and I’m now talking it through with arihant2math. We might move to a hybrid model where the client storage work continues on main and the IndexedDB next work proceeds on a branch.

Thanks everyone for the quick initial feedback!

janvarga · 2025-11-16T10:42:18Z

I also want to clarify that the parallel IndexedDB implementation is not meant to compete with the existing one. The goal is simply to enable experimentation with areas that are difficult to prototype in the current implementation, such as a new way for database opening/deletion, listing existing databases, and trying out alternative threading and communication models.

arihant2math · 2025-11-19T17:24:36Z

FYI the "indexeddb-next" branch has been created. Also, can the ClientStorageThread related things be backported to main? Or we could just wait till that is merged and rebase out the indexeddb-next branch.

janvarga · 2025-11-19T17:37:46Z

FYI the "indexeddb-next" branch has been created. Also, can the ClientStorageThread related things be backported to main? Or we could just wait till that is merged and rebase out the indexeddb-next branch.

Yes, I asked @mrobinson to create it, and it was set up yesterday. And agreed, we need this PR both on main and on the branch. A backport or rebasing the branch would work, but let’s wait for reviews on this PR first.

mrobinson

Looks good, with some minor naming nits:

mrobinson · 2025-11-19T19:07:06Z

components/shared/storage/lib.rs

 #[derive(Clone, Debug, Deserialize, Serialize)]
 pub struct StorageThreads {
-    web_storage_thread: GenericSender<WebStorageThreadMsg>,
+    clientstorage_thread: GenericSender<ClientStorageThreadMsg>,


Suggested change

clientstorage_thread: GenericSender<ClientStorageThreadMsg>,

client_storage_thread: GenericSender<ClientStorageThreadMsg>,

mrobinson · 2025-11-19T19:07:13Z

components/shared/storage/lib.rs

 impl StorageThreads {
    pub fn new(
-        web_storage_thread: GenericSender<WebStorageThreadMsg>,
+        clientstorage_thread: GenericSender<ClientStorageThreadMsg>,


Suggested change

clientstorage_thread: GenericSender<ClientStorageThreadMsg>,

client_storage_thread: GenericSender<ClientStorageThreadMsg>,

mrobinson · 2025-11-19T19:07:19Z

components/shared/storage/lib.rs

    ) -> StorageThreads {
        StorageThreads {
-            web_storage_thread,
+            clientstorage_thread,


Suggested change

clientstorage_thread,

client_storage_thread,

mrobinson · 2025-11-19T19:09:28Z

components/shared/storage/client_storage.rs

+use serde::{Deserialize, Serialize};
+
+#[derive(Debug, Deserialize, Serialize)]
+pub enum ClientStorageThreadMsg {


Some messages use "Msg" and some use "Message" in the name so let's go with "Message":

Suggested change

pub enum ClientStorageThreadMsg {

pub enum ClientStorageThreadMessage {

janvarga · 2025-11-20T06:15:25Z

Thanks, I’ll fix the naming inconsistencies shortly.

This patch updates the argument order passed to StorageThreads to match the actual creation order (IndexedDB first, WebStorage second). It also updates Constellation to exit the threads in the same order. These changes are cosmetic but improve consistency and readability. Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

This patch updates the naming of the WebStorage thread exit sender/receiver in constellation code to reflect recent renaming of Storage to WebStorage. Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

This patch ensures that unit tests for the newly separated storage_traits crate are included when running `mach test-unit`. Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

This patch introduces a new thread responsible for coordinating all current and future storage endpoints. The prefix "client" was chosen to make it explicit that this functionality applies to client side, locally persisted storage, and should not be confused with the localStorage API or the StorageManager interface. Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

The extra suffix is not necessary, as the separation between the script side and the backend side of the IndexedDB implementation is already clear from imports (dom::indexeddb vs storage_traits::indexeddb). This patch renames indexeddb_thread.rs to indexeddb.rs and updates all imports accordingly. No functional changes are included. Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

Renames components/script/indexed_db.rs to indexeddb.rs to maintain consistent naming across Servo. The codebase already uses the indexeddb form in most locations, so this change aligns indexedb.rs with the rest of the code. Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

Drops the indexeddb path prefix from several imports for consistency with generated DOM bindings. This also simplifies introducing a parallel IndexedDB implementation. Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

- clientstorage_thread -> client_storage_thread - ClientStorageThreadMsg -> ClientStorageThreadMessage Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

gterzian

LGTM with one comment. I do wonder at this point whether a thread will be necessary at all. Maybe the coordinator could just be a struct on the constellation. Do you plan to consolidate other threads, like IndexedDBThreadFactory, into this one? If not, I doubt another thread is necessary since running the basic storage spec logic should not take a long time. Good to keep these question in mind.

gterzian · 2025-11-26T13:43:44Z

components/storage/client_storage.rs

+}
+
+impl ClientStorageThreadFactory for GenericSender<ClientStorageThreadMessage> {
+    fn new(config_dir: Option<PathBuf>) -> GenericSender<ClientStorageThreadMessage> {


Could you return a join handle as well, and have the constellation join on it as part of shutdown? We've made quite a lot of efforts on that side in #30849 this did not include joining on all threads, but if we are adding new ones it's worth doing it if possible).

Aha, I guess that means that the joining is better than just sending the Exit message from constellation and synchronously waiting for a reply. Yeah, I can do that if that's the case.

Yes that is the case(we didn;t do it for the various storage threads either, but we should).

Yes that is the case(we didn;t do it for the various storage threads either, but we should).

Ok, so let me add that. Thanks!

janvarga · 2025-11-26T14:10:59Z

LGTM with one comment. I do wonder at this point whether a thread will be necessary at all. Maybe the coordinator could just be a struct on the constellation. Good to keep these question in mind.

Yeah, I’ve been thinking about that too. It seems that in the long run we won’t need the StorageThreads at all, just the ClientStorageThread.
I’m not entirely sure whether the coordination could happen directly on the constellation thread, it can be busy already, and adding more coordination work might risk starving it?

In the very long term, all storage related functionality might even move into a separate process I guess.

gterzian · 2025-11-27T10:44:39Z

it can be busy already, and adding more coordination work might risk starving it?

I haven't seen any evidence of that, for example latency increasing for specific workflow involving the constellation.

Note that adding more messages to be handled by the constellation will if anything do the opposite of starving the thread: it will ensure it does a maximum of work within it's allocated time slice, which might actually improve performance since the cpu cache could be hot for a larger amount of workflows.

janvarga · 2025-11-27T11:37:14Z

it can be busy already, and adding more coordination work might risk starving it?

I haven't seen any evidence of that, for example latency increasing for specific workflow involving the constellation.

Note that adding more messages to be handled by the constellation will if anything do the opposite of starving the thread: it will ensure it does a maximum of work within it's allocated time slice, which might actually improve performance since the cpu cache could be hot for a larger amount of workflows.

That's fair, and I agree I don't have any hard evidence for potential starvation, it's more of a reflex from Gecko, where eventually all storage related coordination had to be moved off the main thread in the parent process.

That said, my gut feeling is still that storage coordination will benefit from having its own thread, and eventually its own process (with the process main thread acting as the coordination thread and spawning I/O workers, etc.). That could also help mitigate potential issues caused by running unsafe C code (like SQLite)?

I don't expect the coordination thread to do a lot of cpu intensive work, and it's good that you mentioned the cpu cache. For IndexedDB (and potentially other storage endpoints), I've been thinking about a communication and threading model where the script thread has a direct channel for a given database to a dedicated backend I/O thread for that specific context, avoiding unnecessary indirection and keeping backend operations for a given database on the same thread as much as possible. This is largely inspired by what we learned in Gecko, using a generic thread pool where work can migrate between threads can hurt performance, exactly for the cpu cache reasons you mentioned (and I think we already talked about this in the past).

I'll try to describe this more clearly and start drafting some initial PRs for the indexeddb-next branch. The whole idea is experimental for now.

janvarga · 2025-11-27T12:06:19Z

So I looked at how the join handles are implemented for some of the existing threads, and I have a rough plan:

move the storage thread shutdown related code into a new function on the StorageThreads struct
rework the storage thread factories (like IndexedDBThreadFactory) so they also return join handles
add a new function on StorageThreads for joining all storage threads sequentially
call that new function from Constellation

The only question is whether this can be done in a follow-up PR.

gterzian · 2025-11-28T10:39:59Z

The only question is whether this can be done in a follow-up PR.

Sound good.

arihant2math approved these changes Nov 16, 2025

View reviewed changes

janvarga force-pushed the indexeddb branch from ce9abcb to 8d3bb61 Compare November 16, 2025 09:35

janvarga changed the title ~~storage: Introduce storage coordination thread and parallel next-generation IndexedDB implementation~~ storage: Introduce storage coordination thread and infrastructure for parallel next-generation IndexedDB implementation Nov 16, 2025

janvarga force-pushed the indexeddb branch 2 times, most recently from a757769 to 68e0cd4 Compare November 16, 2025 10:33

janvarga marked this pull request as ready for review November 16, 2025 11:26

janvarga requested a review from gterzian as a code owner November 16, 2025 11:26

servo-highfive added the S-awaiting-review There is new code that needs to be reviewed. label Nov 16, 2025

mrobinson approved these changes Nov 19, 2025

View reviewed changes

servo-highfive added S-needs-rebase There are merge conflict errors. and removed S-awaiting-review There is new code that needs to be reviewed. labels Nov 19, 2025

janvarga force-pushed the indexeddb branch from 6bc2ab9 to b6e9e9a Compare November 24, 2025 17:39

servo-highfive added S-awaiting-review There is new code that needs to be reviewed. and removed S-needs-rebase There are merge conflict errors. labels Nov 24, 2025

janvarga added 6 commits November 24, 2025 19:06

storage: Run storage_traits unit tests in mach test-unit

d8d40ff

This patch ensures that unit tests for the newly separated storage_traits crate are included when running `mach test-unit`. Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

janvarga added 2 commits November 24, 2025 19:06

storage: Apply naming cleanup based on review comments

7ecb144

- clientstorage_thread -> client_storage_thread - ClientStorageThreadMsg -> ClientStorageThreadMessage Testing: Unit tests continue to pass. Signed-off-by: Jan Varga <jvarga@igalia.com>

janvarga force-pushed the indexeddb branch from b6e9e9a to 7ecb144 Compare November 24, 2025 18:07

gterzian approved these changes Nov 26, 2025

View reviewed changes

servo-highfive removed the S-awaiting-review There is new code that needs to be reviewed. label Nov 26, 2025

gterzian added this pull request to the merge queue Nov 28, 2025

servo-highfive added the S-awaiting-merge The PR is in the process of compiling and running tests on the automated CI. label Nov 28, 2025

Merged via the queue into servo:main with commit 1d05408 Nov 28, 2025
32 checks passed

servo-highfive removed the S-awaiting-merge The PR is in the process of compiling and running tests on the automated CI. label Nov 28, 2025

	clientstorage_thread: GenericSender<ClientStorageThreadMsg>,
	client_storage_thread: GenericSender<ClientStorageThreadMsg>,

	pub enum ClientStorageThreadMsg {
	pub enum ClientStorageThreadMessage {

Uh oh!

storage: Introduce storage coordination thread and infrastructure for parallel next-generation IndexedDB implementation #40661

storage: Introduce storage coordination thread and infrastructure for parallel next-generation IndexedDB implementation #40661

Uh oh!

Conversation

janvarga commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mrobinson commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arihant2math commented Nov 16, 2025

Uh oh!

arihant2math commented Nov 16, 2025

Uh oh!

janvarga commented Nov 16, 2025

Uh oh!

arihant2math commented Nov 16, 2025

Uh oh!

janvarga commented Nov 16, 2025

Uh oh!

janvarga commented Nov 16, 2025

Uh oh!

arihant2math commented Nov 19, 2025

Uh oh!

janvarga commented Nov 19, 2025

Uh oh!

mrobinson left a comment

Choose a reason for hiding this comment

Uh oh!

mrobinson Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

mrobinson Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

mrobinson Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

mrobinson Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

janvarga commented Nov 20, 2025

Uh oh!

gterzian left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gterzian Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janvarga Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

gterzian Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

janvarga Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

janvarga commented Nov 26, 2025

Uh oh!

gterzian commented Nov 27, 2025

Uh oh!

janvarga commented Nov 27, 2025

Uh oh!

janvarga commented Nov 27, 2025

Uh oh!

gterzian commented Nov 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

janvarga commented Nov 16, 2025 •

edited

Loading

mrobinson commented Nov 16, 2025 •

edited

Loading

gterzian left a comment •

edited

Loading

gterzian Nov 26, 2025 •

edited

Loading