PA-220s randomly crashing

cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Announcements
Please sign in to see details of an important advisory in our Customer Advisories area.

PA-220s randomly crashing

L4 Transporter

We are having a large number of our PA-220s randomly crashing. No critical system logs are see shortly before the crash, the device just goes down and they are logs of dataplane starting up like 30 minutes later. Our other models are fine, its only the 220s we are having issues with. We have had a critical TAC ticket for the last few days and it was initially believed we were hitting a bug in 10.1.8-h2 so we were instructed to upgrade to 10.1.9-h1, that didnt fix our issue, we were then told to upgrade to 10.2.4 but that didn't fix our issue. We're currently communicating with TAC but we're just looking to get this issue resolved.

 

There were no changes pushed to the Palos shortly before this happened. We have been running 10.1.8-h2 for months with no issues. 

 

I'm aware this is limited information, but anyone have any ideas.

9 REPLIES 9

Cyber Elite
Cyber Elite

@Claw4609 

 

Are the Interfaces in Vwire mode or Layer 3 mode.

We are running Few PA 220 on Version 10.2.5 with no issues so far in Layer 3 mode.

Other than some BUG cannot say much about this  issue.

 

Try to Escalate with PA to go level 3 and see what they ??

 

Regards

Mahesh

MP

Help the community: Like helpful comments and mark solutions.

Our interfaces are layer3. This case has been escalated and our SE and Sale Manager have been brought in to push as well. Its odd it just started happening recently. Guess we'll see what TAC can come back with. 

@Claw4609 Recently I have seen BUG issues with PA on firewalls and Panorama.

And to get the Level 3 you need escalation done by your SE.

 

Regards

Mahesh

MP

Help the community: Like helpful comments and mark solutions.

Cyber Elite
Cyber Elite

@Claw4609,

Haven't seen any abnormal restarts on our PA-220 fleet running 10.1.9-h1 with layer3 interfaces. It'll be interesting to hear what your issue is eventually traced back to, but since it's followed you through a major version upgrade I'm just guessing that it'll be a combination of something specific about your configuration.

 

I primarily wanted to comment just to say that I would always push back on TAC when they recommend doing a major version bump to address an issue. I'd personally have them show me the ID that they think I'm running into and verify that they've ran the config in their lab without issues on the hardware in question, not a VM series or anything like that. Unless you're running on an unsupported version, I'd personally want some reassurance that the issue is going to be addressed before they attempt to bump me to the next major release that would demand significant validation before I'd even roll it out. Otherwise you run the risk of introducing new issues in addition to what you're already looking to solve. 

Cyber Elite
Cyber Elite

When I had this issue with one specific 220 in January with 10.2.3 and 10.2.3-h4 then TAC said issue will be fixed in 10.2.4.

There was no good workaround in any of 10.1.x and 10.2.x releases so I downgraded to 9.1.

Number of other 220s ran 10.2.x fine without rebooting.

 

I see you are on 10.2.4 already so in case it is same bug then maybe it was pushed further.

Enterprise Architect, Security @ Cloud Carib Ltd
Palo Alto Networks certified from 2011

L0 Member

Was there a resolution for this? I'm seeing random reboots on a PA-220 with 10.2.5. No events are logged when this occurs.

We actually just ended up replacing all our 220s with 440s that Palo provided us because it was months before they identified the issue and theyre still not sure why it started happening. There was a debug build that supported provided that fixed the issue. 

 

This is what support said:

 

The debug build was built based on a potential theory that the issue was caused by a situation where by two cores handle the same work or the same session at the same time.

So they used the debug build to verify if the kernel driver (octeon-pow-ethernet) caused the issue.

L0 Member

I am experiencing the same issue, was there ever a fix recommended other than the upgrade of the hardware? It just randomly started for me 2 days ago, and it is causing a lot of issues.

@telwood 

 

We have few PA 220 and i am running 10.2.6 without any issues in V-Wire mode.

 

Regards

MP

Help the community: Like helpful comments and mark solutions.
  • 2091 Views
  • 9 replies
  • 0 Likes
Like what you see?

Show your appreciation!

Click Like if a post is helpful to you or if you just want to show your support.

Click Accept as Solution to acknowledge that the answer to your question has been provided.

The button appears next to the replies on topics you’ve started. The member who gave the solution and all future visitors to this topic will appreciate it!

These simple actions take just seconds of your time, but go a long way in showing appreciation for community members and the LIVEcommunity as a whole!

The LIVEcommunity thanks you for your participation!