Skip to content

warm reboot failed due to LAG flapping #3630

@keboliu

Description

@keboliu

Description

Steps to reproduce the issue:

with latest master image, there is a possibility encounter into this issue when run warm reboot test.

Describe the results you received:

04:21:55 "2019-10-14 20:21:53 : Extracted from VM logs:",
04:21:55 "2019-10-14 20:21:53 : --------------------------------------------------",
04:21:55 "2019-10-14 20:21:53 : Extracted log info from 10.112.207.42",
04:21:55 "2019-10-14 20:21:53 : BGP IPv4 was down (seconds) : 169",
04:21:55 "2019-10-14 20:21:53 : BGP IPv4 was down (times) : 1",
04:21:55 "2019-10-14 20:21:53 : BGP IPv4 was gotten up after Po was up (seconds) : 31",
04:21:55 "2019-10-14 20:21:53 : BGP IPv6 was down (seconds) : 170",
04:21:55 "2019-10-14 20:21:53 : BGP IPv6 was down (times) : 1",
04:21:55 "2019-10-14 20:21:53 : BGP IPv6 was gotten up after Po was up (seconds) : 32",
04:21:55 "2019-10-14 20:21:53 : Interface Port-Channel1 was down (times) : 1",
04:21:55 "2019-10-14 20:21:53 : PortChannel was down (seconds) : 37",
04:21:55 "2019-10-14 20:21:53 : PortChannel went down after bgp session was down (seconds) : 101",
04:21:55 "2019-10-14 20:21:53 : defaultdict(<type 'list'>, {})",
04:21:55 "2019-10-14 20:21:53 : --------------------------------------------------",
04:21:55 "2019-10-14 20:21:53 : Extracted log info from 10.112.207.43",
04:21:55 "2019-10-14 20:21:53 : BGP IPv4 was down (seconds) : 171",
04:21:55 "2019-10-14 20:21:53 : BGP IPv4 was down (times) : 1",
04:21:55 "2019-10-14 20:21:53 : BGP IPv4 was gotten up afterFAIL",
04:21:55 "",
04:21:55 "======================================================================",
04:21:55 "FAIL: advanced-reboot.ReloadTest",
04:21:55 "----------------------------------------------------------------------",
04:21:55 "Traceback (most recent call last):",
04:21:55 " File \"ptftests/advanced-reboot.py\", line 894, in runTest",
04:21:55 " self.assertTrue(is_good, errors)",
04:21:55 "AssertionError: ",
04:21:55 "",
04:21:55 "Something went wrong. Please check output below:",
04:21:55 "",
04:21:55 "FAILED:10.112.207.45:LAG flapped 2 times on 10.112.207.45 after warm boot",
04:21:55 "FAILED:10.112.207.44:LAG flapped 2 times on 10.112.207.44 after warm boot",
04:21:55 "FAILED:10.112.207.43:LAG flapped 2 times on 10.112.207.43 after warm boot",
04:21:55 "FAILED:10.112.207.42:LAG flapped 2 times on 10.112.207.42 after warm boot",
04:21:55 "",
04:21:55 "",
04:21:55 "----------------------------------------------------------------------",
04:21:55 "Ran 1 test in 915.417s",
04:21:55 "",
04:21:55 "FAILED (failures=1)"
04:21:55 ]
04:21:55 }

Describe the results you expected:

warm reboot shall pass.
syslog.gz
syslog.1.gz

Additional information you deem important (e.g. issue happens only occasionally):

**Output of `show version`:**

```

"SONiC Software Version: SONiC.HEAD.99-2694e660",
"Distribution: Debian 9.11",
"Kernel: 4.9.0-9-2-amd64",
"Build commit: 2694e66",
"Build date: Fri Oct 11 06:24:58 UTC 2019",
"Built by: johnar@jenkins-worker-7",
```

**Attach debug file `sudo generate_dump`:**

```
(paste your output here)
```

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions