In the case where most or all the database is loaded in RAM, it would be useful to execute the query on multiple threads. Certain queries like full scan or index lookups could run in parallel, one per bucket.
Also some aggregation could be cpu intensive and could be performed in parallel.