Skip to content

[Context Aware Segments] Recovery failure incase CompositeIndexWriter is unable to obtain lock on Active map during recovery #19965

@RS146BIJAY

Description

@RS146BIJAY

Is your feature request related to a problem? Please describe

During translog replay incase CompositeIndexWriter is unable to obtain lock on active map, recovery fails with below exception:

Caused by: [logs-231998/RpxglvFRTgCpO7Exby9UBA][[logs-231998][0]] LookupMapLockAcquisitionException[Unable to obtain lock on the current Lookup map]
[2025-11-12T01:46:23,186][WARN ][o.o.i.c.IndicesClusterStateService][2331be4162107575f8651514dd901a8d] [logs-231998][0] marking and sending shard failed due to [shard failure, reason [failed to recover from translog]]
[logs-231998/RpxglvFRTgCpO7Exby9UBA][[logs-231998][0]] TranslogException[failed to recover from translog]; nested: LookupMapLockAcquisitionException[Unable to obtain lock on the current Lookup map];
        at org.opensearch.index.translog.InternalTranslogManager.recoverFromTranslogInternal(InternalTranslogManager.java:152)
        at org.opensearch.index.translog.InternalTranslogManager.recoverFromTranslog(InternalTranslogManager.java:132)
        at org.opensearch.index.shard.IndexShard.openEngineAndRecoverFromTranslog(IndexShard.java:2915)
        at org.opensearch.index.shard.IndexShard.openEngineAndRecoverFromTranslog(IndexShard.java:2878)
        at org.opensearch.index.shard.StoreRecovery.internalRecoverFromStore(StoreRecovery.java:770)
        at org.opensearch.index.shard.StoreRecovery.lambda$recoverFromStore$0(StoreRecovery.java:129)
        at org.opensearch.core.action.ActionListener.completeWith(ActionListener.java:344)
        at org.opensearch.index.shard.StoreRecovery.recoverFromStore(StoreRecovery.java:124)
        at org.opensearch.index.shard.IndexShard.recoverFromStore(IndexShard.java:3326)
        at org.opensearch.action.ActionRunnable$2.doRun(ActionRunnable.java:89)
        at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:984)
        at org.opensearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:52)
        at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1144)
        at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:642)
        at java.base/java.lang.Thread.run(Thread.java:1583)

Describe the solution you'd like

TODO

Related component

Indexing

Describe alternatives you've considered

No response

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    IndexingIndexing, Bulk Indexing and anything related to indexingenhancementEnhancement or improvement to existing feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions