- Access exclusive content
- Connect with peers
- Share your expertise
- Find support resources
07-19-2014 09:30 PM
Hi All,
I've implemented Active-Passive firewall with preempt options enabled as below :
I use Path monitoring to IP B to detect failure. When I shutdown the bridge (device between A and B), the Active device switch to X and A status became non-functional (Path Down).
But after A became passive, immediately he request preempt to become active although the bridge didn't up yet. And this will process wil continued 3 times until A became suspended because preempt loop detected.
Is there something wrong on my configuration? What event that will be trigger the preempt process?
Thank you.
Andi
07-20-2014 12:04 AM
Hello Andi,
It's looks like the Y IP is not reachable from X, while X become Active. Or any monitoring object configured on the Firewall X already down. Hence it is unable to acquire the Acive status of the HA.
I would request you to take CLI access to FW X during this activity and apply command > tail follow yes mp-log ha_agent.log. It would give you the exact reason for this.
Few related DOC for your reference:
Re: Next-hop Resilience (Shared IP) options for L3 Interfaces
Special cases in which HA failover doesn't happen
Logical Shutdown of an Interface Does Not Cause HA Failover
High Availability Failover Optimization
Hope this helps.
Thanks
07-20-2014 03:35 PM
Hi Hulk,
I ensured the X to Y is no problem, if I don't use preempt, and I simulate again everything going well, the traffic can go through to the X-Y link, and I think if the problem on the X-Y link the status error not the "preemted loop detected" but "non-functional loop detected".
Here is the complete log from first non functional status until suspended :
Jul 20 12:19:56 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Path group 'default' destination IP '172.16.0.10' is down
Jul 20 12:19:56 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Path group 'default' failure; one or more destination IPs are down
Jul 20 12:19:56 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Non-Functional
Jul 20 12:19:56 ha_state_start_monitor_holdup(src/ha_state.c:2276): Starting monitor holdup for group 1: 500ms
Jul 20 12:19:56 ha_state_monitor_holdup_callback(src/ha_state.c:2329): Going to degraded state
Jul 20 12:19:56 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Non-Functional
Jul 20 12:19:56 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Moved from state Active to state Non-Functional
Jul 20 12:19:56 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Non-Functional
Jul 20 12:19:56 ha_sysd_dev_alarm_update(src/ha_sysd.c:1494): Set dev alarm to on
Jul 20 12:19:56 ha_state_move_degraded(src/ha_state.c:1502): Group 1: Non-functional loop count updated to 3
Jul 20 12:19:56 ha_state_check_nonfunc_hold(src/ha_state.c:2376): Starting NonFunc holdtime for group 1
Jul 20 12:19:56 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Non-Functional (2)
priority : 10
cookie : 64773
num tlvs : 3
Printing out 3 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
TLV[3]: type 50 (STATE_REASON); len 10; value:
50617468 20646f77 6e00
Jul 20 12:19:56 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns
Jul 20 12:19:56 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Non-Functional (2)
priority : 10
cookie : 64773
num tlvs : 3
Printing out 3 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
TLV[3]: type 50 (STATE_REASON); len 10; value:
50617468 20646f77 6e00
Jul 20 12:19:56 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns
Jul 20 12:19:56 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection
Jul 20 12:19:56 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0x986f
flags : 0x1 (req:)
length : 52
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 100
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:19:56 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0xa057
flags : 0x1 (req:)
length : 52
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 100
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:19:56 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)
Jul 20 12:19:56 ha_sysd_dev_peer_state_update(src/ha_sysd.c:1555): Set dev peer state to Active
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/1 up
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/1 down
Jul 20 12:19:56 Group 1: Link 'ethernet1/1' in link group 'Untrust_Link' state is going from up to down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/2 up
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/2 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/3 up
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/3 down
Jul 20 12:19:56 Group 1: Link 'ethernet1/3' in link group 'Untrust_Link' state is going from up to down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/4 up
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/4 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/5 up
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/5 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/6 up
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/6 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/7 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/8 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/9 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/10 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/11 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/12 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/13 down
Jul 20 12:19:56 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/14 down
Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/15 down
Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/16 down
Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/17 down
Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/18 down
Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/19 down
Jul 20 12:19:57 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/20 down
Jul 20 12:19:59 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/2 up
Jul 20 12:19:59 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/4 up
Jul 20 12:19:59 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/6 up
Jul 20 12:20:00 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/1 up
Jul 20 12:20:00 Group 1: Link 'ethernet1/1' in link group 'Untrust_Link' state is going from down to up
Jul 20 12:20:00 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/5 up
Jul 20 12:20:00 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/3 up
Jul 20 12:20:00 Group 1: Link 'ethernet1/3' in link group 'Untrust_Link' state is going from down to up
Jul 20 12:20:56 ha_state_nonfunc_hold_callback(src/ha_state.c:2412): Group 1: ending non-func hold
Jul 20 12:20:56 ha_state_nonfunc_hold_callback(src/ha_state.c:2422): Group 1: After non-func hold trying to move to state Passive
Jul 20 12:20:56 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Passive
Jul 20 12:20:56 ha_state_start_rt_sync_hold(src/ha_state.c:1853): Group 1: starting runtime state sync hold (0)
Jul 20 12:20:56 ha_state_rt_sync_hold_callback(src/ha_state.c:1882): Group 1: ending runtime state sync hold
Jul 20 12:20:56 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Non-Functional to Passive
Jul 20 12:20:56 HA Group 1: Moved from state Non-Functional to state Passive
Jul 20 12:20:56 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Passive
Jul 20 12:20:56 ha_sysd_dev_alarm_update(src/ha_sysd.c:1494): Set dev alarm to off
Jul 20 12:20:56 ha_state_clear_monitor_log_history(src/ha_state.c:3429): Clearing all monitoring log history
Jul 20 12:20:56 ha_state_start_monitor_hold(src/ha_state.c:1014): Starting initial monitor hold for group 1; linkmon not monitored
Ignoring link and path monitoring failures due to an HA state transition
Jul 20 12:20:56 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Passive (4)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:20:56 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns
Jul 20 12:20:56 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Passive (4)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:20:56 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns
Jul 20 12:20:56 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection
Jul 20 12:20:56 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0xa05f
flags : 0x1 (req:)
length : 52
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 100
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:20:56 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)
Jul 20 12:20:56 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0x9877
flags : 0x1 (req:)
length : 52
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 100
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:21:05 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0xa061
flags : 0x1 (req:)
length : 52
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 100
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 41 (RT_SYNC_DONE); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:21:56 ha_state_monitor_hold_callback(src/ha_state.c:1936): Group 1: ending initial monitor hold; no longer ignoring link and path monitoring failures due to an HA state transition
Jul 20 12:22:05 ha_peer_recv_preempt(src/ha_peer.c:4803): Group 1 (HA1-MAIN): Receiving preempt message
Msg Hdr
-------
version : 1
groupID : 1
type : Preempt (4)
token : 0xa069
flags : 0x1 (req:)
length : 24
Preempt Msg
-----------
flags : 0x1 (preempt:)
priority : 100
num tlvs : 0
Jul 20 12:22:05 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Active
Jul 20 12:22:05 ha_state_start_promotion_hold(src/ha_state.c:1961): Group 1: starting promotion hold
Jul 20 12:22:05 Group 1: Received preempt message for us to take over which we accepted
Jul 20 12:22:05 ha_peer_send_preempt_ack(src/ha_peer.c:428): Group 1 (HA1-MAIN): Sending preempt ack message
Ack Msg
-------
accept : 1
status : 0
num tlvs : 0
Jul 20 12:22:05 ha_state_promotion_hold_callback(src/ha_state.c:2029): Group 1: ending promotion hold
Jul 20 12:22:05 ha_state_start_monitor_hold(src/ha_state.c:1014): Starting initial monitor hold for group 1; linkmon not monitored
Ignoring link and path monitoring failures due to an HA state transition
Jul 20 12:22:05 ha_state_promotion_hold_callback(src/ha_state.c:2066): Group 1: moving to Active due to preemption
Jul 20 12:22:05 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Passive to Active
Jul 20 12:22:05 HA Group 1: Moved from state Passive to state Active
Jul 20 12:22:05 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Active
Jul 20 12:22:05 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting
Jul 20 12:22:05 ha_state_start_monitor_hold(src/ha_state.c:1014): Starting initial monitor hold for group 1; linkmon not monitored
Ignoring link and path monitoring failures due to an HA state transition
Jul 20 12:22:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:05 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns
Jul 20 12:22:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:05 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns
Jul 20 12:22:05 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection
Jul 20 12:22:05 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0x9880
flags : 0x1 (req:)
length : 52
Hello Msg
---------
flags : 0x1 (preempt:)
state : Passive (4)
priority : 100
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:05 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0xa06a
flags : 0x1 (req:)
length : 52
Hello Msg
---------
flags : 0x1 (preempt:)
state : Passive (4)
priority : 100
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:05 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)
Jul 20 12:22:05 ha_sysd_dev_peer_state_update(src/ha_sysd.c:1555): Set dev peer state to Passive
Jul 20 12:22:05 Group 1 peer state moved from Active to Passive
Jul 20 12:22:05 ha_state_eval_peer(src/ha_state.c:2870): Group 1: Goto active automatically
Jul 20 12:22:05 ha_state_peer_change(src/ha_state.c:440): Group 1: Peer change requests group move to Active state from Active
Jul 20 12:22:05 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Active
Jul 20 12:22:05 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Active
Jul 20 12:22:05 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting
Jul 20 12:22:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:05 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns
Jul 20 12:22:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:05 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns
Jul 20 12:22:05 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/1 up
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/2 up
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/3 up
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/4 up
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/5 up
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/6 up
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/7 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/8 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/9 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/10 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/11 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/12 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/13 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/14 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/15 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/16 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/17 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/18 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/19 down
Jul 20 12:22:05 ha_sysd_linkmon_link_change(src/ha_sysd.c:4021): Link 1/20 down
Jul 20 12:22:06 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0xa06b
flags : 0x1 (req:)
length : 70
Hello Msg
---------
flags : 0x1 (preempt:)
state : Non-Functional (2)
priority : 100
cookie : 64773
num tlvs : 3
Printing out 3 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
TLV[3]: type 50 (STATE_REASON); len 10; value:
4c696e6b 20646f77 6e00
Jul 20 12:22:06 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)
Jul 20 12:22:06 ha_sysd_dev_peer_state_update(src/ha_sysd.c:1555): Set dev peer state to Non-Functional
Jul 20 12:22:06 Group 1 peer state moved from Passive to Non-Functional
Jul 20 12:22:06 ha_state_peer_change(src/ha_state.c:440): Group 1: Peer change requests group move to Active state from Active
Jul 20 12:22:06 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Active
Jul 20 12:22:06 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Active
Jul 20 12:22:06 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting
Jul 20 12:22:06 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:06 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns
Jul 20 12:22:06 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:06 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns
Jul 20 12:22:06 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection
Jul 20 12:22:06 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0x9881
flags : 0x1 (req:)
length : 70
Hello Msg
---------
flags : 0x1 (preempt:)
state : Non-Functional (2)
priority : 100
cookie : 64773
num tlvs : 3
Printing out 3 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
TLV[3]: type 50 (STATE_REASON); len 10; value:
4c696e6b 20646f77 6e00
Jul 20 12:22:07 Group 1: Path 'default' destination ip '172.16.0.10' state is going from up to down
Jul 20 12:22:09 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-MAIN): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0xa06c
flags : 0x1 (req:)
length : 52
Hello Msg
---------
flags : 0x1 (preempt:)
state : Passive (4)
priority : 100
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:09 ha_peer_recv_tlv(src/ha_peer.c:3553): true: Cfg Sync compat from peer set to Group 1 (HA1-MAIN)
Jul 20 12:22:09 ha_sysd_dev_peer_state_update(src/ha_sysd.c:1555): Set dev peer state to Passive
Jul 20 12:22:09 Group 1 peer state moved from Non-Functional to Passive
ul 20 12:22:09 ha_state_eval_peer(src/ha_state.c:2870): Group 1: Goto active automatically
Jul 20 12:22:09 ha_state_peer_change(src/ha_state.c:440): Group 1: Peer change requests group move to Active state from Active
Jul 20 12:22:09 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Active
Jul 20 12:22:09 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Active
Jul 20 12:22:09 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting
Jul 20 12:22:09 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:09 Group 1 (HA1-MAIN): Starting hello with timeout: 8s/0ns
Jul 20 12:22:09 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-BKUP): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:09 Group 1 (HA1-BKUP): Starting hello with timeout: 8s/0ns
Jul 20 12:22:09 ha_peer_start_hello(src/ha_peer.c:1003): Group 1 (HA1-MGMT): can't start hello, no connection
Jul 20 12:22:09 ha_peer_recv_hello(src/ha_peer.c:4682): Group 1 (HA1-BKUP): Receiving hello message
Msg Hdr
-------
version : 1
groupID : 1
type : Hello (2)
token : 0x9882
flags : 0x1 (req:)
length : 52
Hello Msg
---------
flags : 0x1 (preempt:)
state : Passive (4)
priority : 100
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 8 (CFG_SYNC_COMPAT); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:22:10 ha_state_pathmon_update(src/ha_state.c:525): Path 'default' destination ip '172.16.0.10' state is going from down to down
Jul 20 12:22:17 ha_state_runtime_insync(src/ha_state.c:607): runtime insync: YES
Jul 20 12:22:17 ha_state_start_preemption_hold(src/ha_state.c:2101): Group 1: no need for preemption waiting
Jul 20 12:22:17 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message
Hello Msg
---------
flags : 0x1 (preempt:)
state : Active (5)
priority : 10
cookie : 64773
num tlvs : 2
Printing out 2 tlvs
TLV[1]: type 41 (RT_SYNC_DONE); len 4; value:
00000001
TLV[2]: type 11 (SYSD_PEER_DOWN); len 4; value:
00000000
Jul 20 12:23:05 ha_state_monitor_hold_callback(src/ha_state.c:1936): Group 1: ending initial monitor hold; no longer ignoring link and path monitoring failures due to an HA state transition
Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Path group 'default' destination IP '172.16.0.10' is down
Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Path group 'default' failure; one or more destination IPs are down
Jul 20 12:23:05 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Non-Functional
Jul 20 12:23:05 ha_state_start_monitor_holdup(src/ha_state.c:2276): Starting monitor holdup for group 1: 500ms
Jul 20 12:23:05 ha_state_monitor_holdup_callback(src/ha_state.c:2329): Going to degraded state
Jul 20 12:23:05 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Active to Non-Functional
Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Moved from state Active to state Non-Functional
Jul 20 12:23:05 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Non-Functional
Jul 20 12:23:05 ha_sysd_dev_alarm_update(src/ha_sysd.c:1494): Set dev alarm to on
Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Going to Suspended state due to detection of a preemption loop after 3 loops
Jul 20 12:23:05 ha_state_transition(src/ha_state.c:1116): Group 1: transition to state Suspended
Jul 20 12:23:05 ha_state_move(src/ha_state.c:1202): Group 1: moving from state Non-Functional to Suspended
Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47): HA Group 1: Moved from state Non-Functional to state Suspended
Jul 20 12:23:05 ha_sysd_dev_state_update(src/ha_sysd.c:1522): Set dev state to Suspended
Jul 20 12:23:05 ha_state_clear_monitor_log_history(src/ha_state.c:3429): Clearing all monitoring log history
Jul 20 12:23:05 ha_peer_send_hello(src/ha_peer.c:4629): Group 1 (HA1-MAIN): Sending hello message
Thank you for your help.
03-07-2015 05:36 PM
Hello amappersona,
those 3 flaps you observe seems to be as expected, this is a counter that you can set (Jul 20 12:23:05 Warning: ha_event_log(src/ha_event.c:47), from the logs it looks that the other device is the one not reaching 172.16.0.10, is there any other alternate path to reach this network via l2? I am curios to understand why pinging this IP, it looks like monitoring the Ethernet port will be a better option, based on the diagram you provide, I am assuming that those bridge devices are not stp aware are they, why there is not link between them? I would be interested in seeing the complete ha_agent.log off the other unit (better both sides )
just for grins, and if the issue is related to the flap, maybe setting the Passive Link State to Auto ( Device > High Availability then Edit Active/Passive Settings) could solve this issue?
just my 2 bits,
thanks
Victor
Click Accept as Solution to acknowledge that the answer to your question has been provided.
The button appears next to the replies on topics you’ve started. The member who gave the solution and all future visitors to this topic will appreciate it!
These simple actions take just seconds of your time, but go a long way in showing appreciation for community members and the LIVEcommunity as a whole!
The LIVEcommunity thanks you for your participation!