Skip to content

Periodical OOM incidents on Testnet storage nodes #1319

@cthulhu-rider

Description

@cthulhu-rider

Some NeoFS Testnet storage nodes (nagisa, ai, yu) are periodically killed by OOM signal from OS. All these nodes has ~2GB RAM. We need to detect the reason and try to prevent it.

Possible reasons:

  • incoming RPC spikes (server doesn't limit the incoming connections)
  • outgoing RPC spikes (internal routines with API communication)
  • internal work on some event (e.g. new epoch)
  • ???

Observations also show that memory consuming sometimes happens almost simultaneously on different nodes, which can hint either at an external load spike on the container, or global event processing.

Metadata

Metadata

Assignees

Labels

U3RegularbugSomething isn't workingneofs-storageStorage node application issues

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions