<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Excessively long useragent in General Topics</title>
    <link>https://live.paloaltonetworks.com/t5/general-topics/excessively-long-useragent/m-p/556740#M113002</link>
    <description>&lt;P&gt;Thank you for your response.&lt;BR /&gt;Although I understand site owners can request that the Palo bot does not scan their ip address(es), this sets a precedent, site owners should not be expected to "opt out" via email from bots sent from legitimate companies - that would be a full-time job &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;At a minimum, the robots.txt file should be respected.&lt;BR /&gt;...non-legit organisations would ignore the request, and possibly spam harder ( how does a site owner know Palo is legit, again, researching this would be part of the full-time job )&lt;BR /&gt;I think my main point is that the site logs get populated with overly-long "useragents" that read like advertisements, something a malicious actor might do, but not a legitimate organisation. &lt;BR /&gt;Looking at the multitude of useragents, malicious and legit, Palo bot stands out as not really being a useragent at all.&lt;BR /&gt;Although not a scientific test, a sample of 10,000 useragents gives me an average string length of 78, Palo bot is 269 characters in length.&lt;BR /&gt;imo legitimate companies should "play nicely" and follow convention.&lt;/P&gt;</description>
    <pubDate>Wed, 06 Sep 2023 12:25:55 GMT</pubDate>
    <dc:creator>chaeron</dc:creator>
    <dc:date>2023-09-06T12:25:55Z</dc:date>
    <item>
      <title>Excessively long useragent</title>
      <link>https://live.paloaltonetworks.com/t5/general-topics/excessively-long-useragent/m-p/556116#M112922</link>
      <description>&lt;P&gt;I don't think the following bot useragent is acceptable:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Expanse, a Palo Alto Networks company, searches across the global IPv4 space multiple times per day to identify customers&amp;amp;#39; presences on the Internet. If you would like to be excluded from our scans, please send IP addresses/domains to: &lt;A href="mailto:scaninfo@paloaltonetworks.com" target="_blank"&gt;scaninfo@paloaltonetworks.com&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;It is excessively long, and reads more like an advert. A legit organisation should not be doing this IMO&lt;/P&gt;
&lt;P&gt;Site admins should need to opt-in, as is good practice for accepting cookies for example, not opt-out. &lt;BR /&gt;&lt;BR /&gt;Does the Palo bot abide by robots.txt or similar mechanism?&lt;/P&gt;</description>
      <pubDate>Fri, 01 Sep 2023 09:56:24 GMT</pubDate>
      <guid>https://live.paloaltonetworks.com/t5/general-topics/excessively-long-useragent/m-p/556116#M112922</guid>
      <dc:creator>chaeron</dc:creator>
      <dc:date>2023-09-01T09:56:24Z</dc:date>
    </item>
    <item>
      <title>Re: Excessively long useragent</title>
      <link>https://live.paloaltonetworks.com/t5/general-topics/excessively-long-useragent/m-p/556726#M112998</link>
      <description>&lt;P&gt;Hi &lt;a href="https://live.paloaltonetworks.com/t5/user/viewprofilepage/user-id/313150"&gt;@chaeron&lt;/a&gt; ,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This was discussed before here:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://live.paloaltonetworks.com/t5/general-topics/high-loads-by-scanner/td-p/503445" target="_blank"&gt;https://live.paloaltonetworks.com/t5/general-topics/high-loads-by-scanner/td-p/503445&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As suggested, in order to be removed contact the email from the access log and request to remove your IP address from the scans.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Kind regards,&lt;/P&gt;
&lt;P&gt;-Kim.&lt;/P&gt;</description>
      <pubDate>Wed, 06 Sep 2023 10:54:16 GMT</pubDate>
      <guid>https://live.paloaltonetworks.com/t5/general-topics/excessively-long-useragent/m-p/556726#M112998</guid>
      <dc:creator>kiwi</dc:creator>
      <dc:date>2023-09-06T10:54:16Z</dc:date>
    </item>
    <item>
      <title>Re: Excessively long useragent</title>
      <link>https://live.paloaltonetworks.com/t5/general-topics/excessively-long-useragent/m-p/556740#M113002</link>
      <description>&lt;P&gt;Thank you for your response.&lt;BR /&gt;Although I understand site owners can request that the Palo bot does not scan their ip address(es), this sets a precedent, site owners should not be expected to "opt out" via email from bots sent from legitimate companies - that would be a full-time job &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;At a minimum, the robots.txt file should be respected.&lt;BR /&gt;...non-legit organisations would ignore the request, and possibly spam harder ( how does a site owner know Palo is legit, again, researching this would be part of the full-time job )&lt;BR /&gt;I think my main point is that the site logs get populated with overly-long "useragents" that read like advertisements, something a malicious actor might do, but not a legitimate organisation. &lt;BR /&gt;Looking at the multitude of useragents, malicious and legit, Palo bot stands out as not really being a useragent at all.&lt;BR /&gt;Although not a scientific test, a sample of 10,000 useragents gives me an average string length of 78, Palo bot is 269 characters in length.&lt;BR /&gt;imo legitimate companies should "play nicely" and follow convention.&lt;/P&gt;</description>
      <pubDate>Wed, 06 Sep 2023 12:25:55 GMT</pubDate>
      <guid>https://live.paloaltonetworks.com/t5/general-topics/excessively-long-useragent/m-p/556740#M113002</guid>
      <dc:creator>chaeron</dc:creator>
      <dc:date>2023-09-06T12:25:55Z</dc:date>
    </item>
  </channel>
</rss>

