Preempt Loop Detected

cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Announcements
Please sign in to see details of an important advisory in our Customer Advisories area.

Preempt Loop Detected

Not applicable

Hi All,

I've implemented Active-Passive firewall with preempt options enabled as below :

Topologi.png

I use Path monitoring to IP B to detect failure. When I shutdown the bridge (device between A and B), the Active device switch to X and A status became non-functional (Path Down).

But after A became passive, immediately he request preempt to become active although the bridge didn't up yet. And this will process wil continued 3 times until A became suspended because preempt loop detected.

Is there something wrong on my configuration? What event that will be trigger the preempt process?

Thank you.

Andi

3 REPLIES 3

L7 Applicator

Hello Andi,

It's looks like the Y IP is not reachable from X, while X become Active. Or any monitoring object configured on the Firewall X already down. Hence it is unable to acquire the Acive status of the HA.

I would request you to take CLI access to  FW X during this activity and apply command > tail follow yes mp-log ha_agent.log. It would give you the exact reason for this.

Few related DOC for your reference:

Re: Next-hop Resilience (Shared IP) options for L3 Interfaces

Special cases in which HA failover doesn't happen

Logical Shutdown of an Interface Does Not Cause HA Failover

Gratuitous ARP in HA Failover

High Availability Failover Optimization

Hope this helps.

Thanks

Hi Hulk,

I ensured the X to Y is no problem, if I don't use preempt, and I simulate again everything going well, the traffic can go through to the X-Y link, and I think if the problem on the X-Y link the status error not the "preemted loop detected" but "non-functional loop detected".

Here is the complete log from first non functional status until suspended :

Jul 20 12:19:56 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Path group 'default' destination IP '172.16.0.10' is down

Jul 20 12:19:56 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Path group 'default' failure; one or more destination IPs are down

Jul 20 12:19:56 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Non-Functional

Jul 20 12:19:56 ha_state_start_monitor_holdup(src/ha_state.c:2276): Starting monitor holdup for group 1: 500ms

Jul 20 12:19:56 ha_state_monitor_holdup_callback(src/ha_state.c:2329): Going to degraded state

Jul 20 12:19:56 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Non-Functional

Jul 20 12:19:56 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Moved from state Active to state Non-Functional

Jul 20 12:19:56 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Non-Functional

Jul 20 12:19:56 ha_sysd_dev_alarm_update(src/ha_sysd.c:1494): Set dev alarm to on

Jul 20 12:19:56 ha_state_move_degraded(src/ha_state.c:1502): Group 1: Non-functional loop count updated to 3

Jul 20 12:19:56 ha_state_check_nonfunc_hold(src/ha_state.c:2376): Starting NonFunc holdtime for group 1

Jul 20 12:19:56 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Non-Functional (2)

priority : 10

cookie   : 64773

num tlvs : 3

  Printing out 3 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

  TLV[3]: type 50 (STATE_REASON); len 10; value:

    50617468 20646f77 6e00

Jul 20 12:19:56 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns

Jul 20 12:19:56 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Non-Functional (2)

priority : 10

cookie   : 64773

num tlvs : 3

  Printing out 3 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

  TLV[3]: type 50 (STATE_REASON); len 10; value:

    50617468 20646f77 6e00

Jul 20 12:19:56 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns

Jul 20 12:19:56 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection

Jul 20 12:19:56 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0x986f

flags   : 0x1 (req:)

length  : 52

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

  state    : Active (5)

  priority : 100

  cookie   : 64773

  num tlvs : 2

    Printing out 2 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

Jul 20 12:19:56 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0xa057

flags   : 0x1 (req:)

length  : 52

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

   state    : Active (5)

  priority : 100

  cookie   : 64773

  num tlvs : 2

    Printing out 2 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

Jul 20 12:19:56 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)

Jul 20 12:19:56 ha_sysd_dev_peer_state_update(src/ha_sysd.c:1555): Set dev peer state to Active

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/1 up

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/1 down

Jul 20 12:19:56 Group 1: Link 'ethernet1/1' in link group 'Untrust_Link' state is going from up to down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/2 up

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/2 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/3 up

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/3 down

Jul 20 12:19:56 Group 1: Link 'ethernet1/3' in link group 'Untrust_Link' state is going from up to down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/4 up

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/4 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/5 up

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/5 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/6 up

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/6 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/7 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/8 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/9 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/10 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/11 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/12 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/13 down

Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/14 down

Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/15 down

Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/16 down

Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/17 down

Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/18 down

Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/19 down

Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/20 down

Jul 20 12:19:59 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/2 up

Jul 20 12:19:59 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/4 up

Jul 20 12:19:59 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/6 up

Jul 20 12:20:00 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/1 up

Jul 20 12:20:00 Group 1: Link 'ethernet1/1' in link group 'Untrust_Link' state is going from down to up

Jul 20 12:20:00 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/5 up

Jul 20 12:20:00 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/3 up

Jul 20 12:20:00 Group 1: Link 'ethernet1/3' in link group 'Untrust_Link' state is going from down to up

Jul 20 12:20:56 ha_state_nonfunc_hold_callback(src/ha_state.c:2412): Group 1: ending non-func hold

Jul 20 12:20:56 ha_state_nonfunc_hold_callback(src/ha_state.c:2422): Group 1: After non-func hold trying to move to state Passive

Jul 20 12:20:56 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Passive

Jul 20 12:20:56 ha_state_start_rt_sync_hold(src/ha_state.c:1853): Group 1: starting runtime state sync hold (0)

Jul 20 12:20:56 ha_state_rt_sync_hold_callback(src/ha_state.c:1882): Group 1: ending runtime state sync hold

Jul 20 12:20:56 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Non-Functional to Passive

Jul 20 12:20:56 HA Group 1: Moved from state Non-Functional to state Passive

Jul 20 12:20:56 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Passive

Jul 20 12:20:56 ha_sysd_dev_alarm_update(src/ha_sysd.c:1494): Set dev alarm to off

Jul 20 12:20:56 ha_state_clear_monitor_log_history(src/ha_state.c:3429): Clearing all monitoring log history

Jul 20 12:20:56 ha_state_start_monitor_hold(src/ha_state.c:1014): Starting initial monitor hold for group 1; linkmon not monitored

        Ignoring link and path monitoring failures due to an HA state transition

Jul 20 12:20:56 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Passive (4)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:20:56 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns

Jul 20 12:20:56 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Passive (4)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:20:56 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns

Jul 20 12:20:56 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection

Jul 20 12:20:56 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0xa05f

flags   : 0x1 (req:)

length  : 52

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

  state    : Active (5)

  priority : 100

  cookie   : 64773

  num tlvs : 2

    Printing out 2 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

Jul 20 12:20:56 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)

Jul 20 12:20:56 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0x9877

flags   : 0x1 (req:)

length  : 52

  Hello Msg

    ---------

  flags    : 0x1 (preempt:)

  state    : Active (5)

  priority : 100

  cookie   : 64773

  num tlvs : 2

    Printing out 2 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

Jul 20 12:21:05 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0xa061

flags   : 0x1 (req:)

length  : 52

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

  state    : Active (5)

  priority : 100

  cookie   : 64773

  num tlvs : 2

    Printing out 2 tlvs

    TLV[1]: type 41 (RT_SYNC_DONE); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

Jul 20 12:21:56 ha_state_monitor_hold_callback(src/ha_state.c:1936): Group 1: ending initial monitor hold; no longer ignoring link and path monitoring failures due to an HA state transition

Jul 20 12:22:05 ha_peer_recv_preempt(src/ha_peer.c:4803): Group 1 (HA1-MAIN): Receiving preempt message

Msg Hdr

-------

version : 1

groupID : 1

type    : Preempt (4)

token   : 0xa069

flags   : 0x1 (req:)

length  : 24

  Preempt Msg

  -----------

  flags    : 0x1 (preempt:)

  priority : 100

  num tlvs : 0

Jul 20 12:22:05 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Active

Jul 20 12:22:05 ha_state_start_promotion_hold(src/ha_state.c:1961): Group 1: starting promotion hold

Jul 20 12:22:05 Group 1: Received preempt message for us to take over which we accepted

Jul 20 12:22:05 ha_peer_send_preempt_ack(src/ha_peer.c:428): Group 1 (HA1-MAIN): Sending preempt ack message

Ack Msg

-------

accept   : 1

status   : 0

num tlvs : 0

Jul 20 12:22:05 ha_state_promotion_hold_callback(src/ha_state.c:2029): Group 1: ending promotion hold

Jul 20 12:22:05 ha_state_start_monitor_hold(src/ha_state.c:1014): Starting initial monitor hold for group 1; linkmon not monitored

        Ignoring link and path monitoring failures due to an HA state transition

Jul 20 12:22:05 ha_state_promotion_hold_callback(src/ha_state.c:2066): Group 1: moving to Active due to preemption

Jul 20 12:22:05 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Passive to Active

Jul 20 12:22:05 HA Group 1: Moved from state Passive to state Active

Jul 20 12:22:05 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Active

Jul 20 12:22:05 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting

Jul 20 12:22:05 ha_state_start_monitor_hold(src/ha_state.c:1014): Starting initial monitor hold for group 1; linkmon not monitored

        Ignoring link and path monitoring failures due to an HA state transition

Jul 20 12:22:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Active (5)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:22:05 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns

Jul 20 12:22:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Active (5)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:22:05 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns

Jul 20 12:22:05 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection

Jul 20 12:22:05 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0x9880

flags   : 0x1 (req:)

length  : 52

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

  state    : Passive (4)

  priority : 100

  cookie   : 64773

  num tlvs : 2

    Printing out 2 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

      TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

Jul 20 12:22:05 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0xa06a

flags   : 0x1 (req:)

length  : 52

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

  state    : Passive (4)

  priority : 100

  cookie   : 64773

  num tlvs : 2

    Printing out 2 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

Jul 20 12:22:05 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)

Jul 20 12:22:05 ha_sysd_dev_peer_state_update(src/ha_sysd.c:1555): Set dev peer state to Passive

Jul 20 12:22:05 Group 1 peer state moved from Active to Passive

Jul 20 12:22:05 ha_state_eval_peer(src/ha_state.c:2870): Group 1: Goto active automatically

Jul 20 12:22:05 ha_state_peer_change(src/ha_state.c:440): Group 1: Peer change requests group move to Active state from Active

Jul 20 12:22:05 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Active

Jul 20 12:22:05 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Active

Jul 20 12:22:05 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting

Jul 20 12:22:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Active (5)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:22:05 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns

Jul 20 12:22:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Active (5)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:22:05 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns

Jul 20 12:22:05 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/1 up

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/2 up

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/3 up

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/4 up

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/5 up

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/6 up

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/7 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/8 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/9 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/10 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/11 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/12 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/13 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/14 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/15 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/16 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/17 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/18 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/19 down

Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/20 down

Jul 20 12:22:06 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0xa06b

flags   : 0x1 (req:)

length  : 70

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

  state    : Non-Functional (2)

  priority : 100

  cookie   : 64773

  num tlvs : 3

    Printing out 3 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

    TLV[3]: type 50 (STATE_REASON); len 10; value:

      4c696e6b 20646f77 6e00

Jul 20 12:22:06 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)

Jul 20 12:22:06 ha_sysd_dev_peer_state_update(src/ha_sysd.c:1555): Set dev peer state to Non-Functional

Jul 20 12:22:06 Group 1 peer state moved from Passive to Non-Functional

Jul 20 12:22:06 ha_state_peer_change(src/ha_state.c:440): Group 1: Peer change requests group move to Active state from Active

Jul 20 12:22:06 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Active

Jul 20 12:22:06 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Active

Jul 20 12:22:06 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting

Jul 20 12:22:06 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Active (5)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:22:06 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns

Jul 20 12:22:06 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Active (5)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:22:06 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns

Jul 20 12:22:06 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection

Jul 20 12:22:06 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0x9881

flags   : 0x1 (req:)

length  : 70

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

  state    : Non-Functional (2)

  priority : 100

  cookie   : 64773

  num tlvs : 3

    Printing out 3 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

    TLV[3]: type 50 (STATE_REASON); len 10; value:

      4c696e6b 20646f77 6e00

Jul 20 12:22:07 Group 1: Path 'default' destination ip '172.16.0.10' state is going from up to down

Jul 20 12:22:09 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0xa06c

flags   : 0x1 (req:)

length  : 52

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

  state    : Passive (4)

  priority : 100

  cookie   : 64773

  num tlvs : 2

    Printing out 2 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

Jul 20 12:22:09 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)

Jul 20 12:22:09 ha_sysd_dev_peer_state_update(src/ha_sysd.c:1555): Set dev peer state to Passive

Jul 20 12:22:09 Group 1 peer state moved from Non-Functional to Passive

ul 20 12:22:09 ha_state_eval_peer(src/ha_state.c:2870): Group 1: Goto active automatically

Jul 20 12:22:09 ha_state_peer_change(src/ha_state.c:440): Group 1: Peer change requests group move to Active state from Active

Jul 20 12:22:09 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Active

Jul 20 12:22:09 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Active

Jul 20 12:22:09 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting

Jul 20 12:22:09 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Active (5)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:22:09 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns

Jul 20 12:22:09 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Active (5)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:22:09 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns

Jul 20 12:22:09 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection

Jul 20 12:22:09 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message

Msg Hdr

-------

version : 1

groupID : 1

type    : Hello (2)

token   : 0x9882

flags   : 0x1 (req:)

length  : 52

  Hello Msg

  ---------

  flags    : 0x1 (preempt:)

  state    : Passive (4)

  priority : 100

  cookie   : 64773

  num tlvs : 2

    Printing out 2 tlvs

    TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:

      00000001

    TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

      00000000

Jul 20 12:22:10 ha_state_pathmon_update(src/ha_state.c:525): Path 'default' destination ip '172.16.0.10' state is going from down to down

Jul 20 12:22:17 ha_state_runtime_insync(src/ha_state.c:607): runtime insync: YES

Jul 20 12:22:17 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting

Jul 20 12:22:17 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message

Hello Msg

---------

flags    : 0x1 (preempt:)

state    : Active (5)

priority : 10

cookie   : 64773

num tlvs : 2

  Printing out 2 tlvs

  TLV[1]: type 41 (RT_SYNC_DONE); len 4; value:

    00000001

  TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:

    00000000

Jul 20 12:23:05 ha_state_monitor_hold_callback(src/ha_state.c:1936): Group 1: ending initial monitor hold; no longer ignoring link and path monitoring failures due to an HA state transition

Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Path group 'default' destination IP '172.16.0.10' is down

Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Path group 'default' failure; one or more destination IPs are down

Jul 20 12:23:05 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Non-Functional

Jul 20 12:23:05 ha_state_start_monitor_holdup(src/ha_state.c:2276): Starting monitor holdup for group 1: 500ms

Jul 20 12:23:05 ha_state_monitor_holdup_callback(src/ha_state.c:2329): Going to degraded state

Jul 20 12:23:05 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Non-Functional

Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Moved from state Active to state Non-Functional

Jul 20 12:23:05 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Non-Functional

Jul 20 12:23:05 ha_sysd_dev_alarm_update(src/ha_sysd.c:1494): Set dev alarm to on

Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Going to Suspended state due to detection of a preemption loop after 3 loops

Jul 20 12:23:05 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Suspended

Jul 20 12:23:05 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Non-Functional to Suspended

Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Moved from state Non-Functional to state Suspended

Jul 20 12:23:05 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Suspended

Jul 20 12:23:05 ha_state_clear_monitor_log_history(src/ha_state.c:3429): Clearing all monitoring log history

Jul 20 12:23:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message

Thank you for your help.



Hello amappersona, 


those  3 flaps you observe seems to be as expected, this is a counter that you can set (Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47), from the logs it looks that the other device is the one not reaching 172.16.0.10, is there any other alternate path to reach this network via l2? I am curios to understand why pinging this IP, it looks like monitoring the Ethernet port will be a better option, based on the diagram you provide, I am assuming that those bridge devices are not stp aware are they, why there is not link between them?  I would be interested in seeing the complete ha_agent.log off the other unit (better both sides Smiley Happy)

just for grins, and if the issue is related to the flap, maybe setting the Passive Link State to Auto ( Device > High Availability then Edit Active/Passive Settings) could solve this issue?

just my 2 bits,

thanks

Victor

  • 5602 Views
  • 3 replies
  • 0 Likes
Like what you see?

Show your appreciation!

Click Like if a post is helpful to you or if you just want to show your support.

Click Accept as Solution to acknowledge that the answer to your question has been provided.

The button appears next to the replies on topics you’ve started. The member who gave the solution and all future visitors to this topic will appreciate it!

These simple actions take just seconds of your time, but go a long way in showing appreciation for community members and the LIVEcommunity as a whole!

The LIVEcommunity thanks you for your participation!