- Access exclusive content
- Connect with peers
- Share your expertise
- Find support resources
09-01-2023 02:56 AM
I don't think the following bot useragent is acceptable:
Expanse, a Palo Alto Networks company, searches across the global IPv4 space multiple times per day to identify customers' presences on the Internet. If you would like to be excluded from our scans, please send IP addresses/domains to: scaninfo@paloaltonetworks.com
It is excessively long, and reads more like an advert. A legit organisation should not be doing this IMO
Site admins should need to opt-in, as is good practice for accepting cookies for example, not opt-out.
Does the Palo bot abide by robots.txt or similar mechanism?
09-06-2023 03:54 AM
Hi @chaeron ,
This was discussed before here:
https://live.paloaltonetworks.com/t5/general-topics/high-loads-by-scanner/td-p/503445
As suggested, in order to be removed contact the email from the access log and request to remove your IP address from the scans.
Kind regards,
-Kim.
09-06-2023 05:25 AM
Thank you for your response.
Although I understand site owners can request that the Palo bot does not scan their ip address(es), this sets a precedent, site owners should not be expected to "opt out" via email from bots sent from legitimate companies - that would be a full-time job 🙂
At a minimum, the robots.txt file should be respected.
...non-legit organisations would ignore the request, and possibly spam harder ( how does a site owner know Palo is legit, again, researching this would be part of the full-time job )
I think my main point is that the site logs get populated with overly-long "useragents" that read like advertisements, something a malicious actor might do, but not a legitimate organisation.
Looking at the multitude of useragents, malicious and legit, Palo bot stands out as not really being a useragent at all.
Although not a scientific test, a sample of 10,000 useragents gives me an average string length of 78, Palo bot is 269 characters in length.
imo legitimate companies should "play nicely" and follow convention.
Click Accept as Solution to acknowledge that the answer to your question has been provided.
The button appears next to the replies on topics you’ve started. The member who gave the solution and all future visitors to this topic will appreciate it!
These simple actions take just seconds of your time, but go a long way in showing appreciation for community members and the LIVEcommunity as a whole!
The LIVEcommunity thanks you for your participation!