<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Spurious hits from the Expanse webcrawler... in General Topics</title>
    <link>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447675#M100733</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;Yeah sounds a bit off to me as well. Definitely try to get someone on the phone, since you are not a customer:&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Also an email? In the meantime, set your firewall to drop the traffic?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Regards,&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 15 Nov 2021 16:02:08 GMT</pubDate>
    <dc:creator>OtakarKlier</dc:creator>
    <dc:date>2021-11-15T16:02:08Z</dc:date>
    <item>
      <title>Spurious hits from the Expanse webcrawler...</title>
      <link>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447239#M100665</link>
      <description>&lt;P&gt;Much the same as the issue in this post:&amp;nbsp;&lt;A href="https://live.paloaltonetworks.com/t5/general-topics/incoming-traffic-from-palo-alto-ip-address/td-p/196099," target="_blank" rel="noopener"&gt;https://live.paloaltonetworks.com/t5/general-topics/incoming-traffic-from-palo-alto-ip-address/td-p/196099&lt;/A&gt;&amp;nbsp;only with a different set of IP addresses (34.77.162.0 -&amp;nbsp;34.96.130.0).&lt;BR /&gt;&lt;BR /&gt;Telling me that &lt;EM&gt;"we crawl on a regular basis"&lt;/EM&gt; is decidedly &lt;STRONG&gt;NOT&lt;/STRONG&gt; an answer!&lt;BR /&gt;&lt;BR /&gt;One, I am not a client of Palo Alto or Expanse, Inc, so your crawling is of no benefit to myself.&lt;BR /&gt;Two, your "regular crawling" is disrupting my network majorly. Is that really your end goal?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What EXACTLY are you looking for on my PRIVATE network?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Partial log:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;MISS|403|1636727367297|2668|604509|34.86.35.0|-|&lt;A href="http://cdnotherworlds.b-cdn.net/|" target="_blank" rel="noopener"&gt;http://cdnotherworlds.b-cdn.net/|&lt;/A&gt;IL|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|efd0e52587a035b40b4f0568448b0507|US&lt;BR /&gt;MISS|403|1636718015879|2668|604509|34.77.162.0|-|&lt;A href="http://cdnotherworlds.b-cdn.net/|" target="_blank" rel="noopener"&gt;http://cdnotherworlds.b-cdn.net/|&lt;/A&gt;TX|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|38ec458cac0e000c52a54a3fdc02b407|US&lt;BR /&gt;MISS|403|1636711019886|2668|604509|34.77.162.0|-|&lt;A href="https://cdn.otherworlds.tv/|" target="_blank" rel="noopener"&gt;https://cdn.otherworlds.tv/|&lt;/A&gt;MI|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|00963a032da5da4b32c7382d1e6bc10e|US&lt;BR /&gt;MISS|403|1636710780963|2668|604509|34.86.35.0|-|&lt;A href="https://cdn.otherworlds.tv/|" target="_blank" rel="noopener"&gt;https://cdn.otherworlds.tv/|&lt;/A&gt;DE|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|87b25183aaa7f90ea9074e09ef68e9e7|US&lt;BR /&gt;MISS|403|1636709293552|2668|604509|34.96.130.0|-|&lt;A href="http://cdnotherworlds.b-cdn.net/|" target="_blank" rel="noopener"&gt;http://cdnotherworlds.b-cdn.net/|&lt;/A&gt;DE|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|3e6c123ef8b8cbd65bd938d4decad824|US&lt;BR /&gt;MISS|403|1636697076710|2668|604509|34.96.130.0|-|&lt;A href="https://cdn.otherworlds.tv/|" target="_blank" rel="noopener"&gt;https://cdn.otherworlds.tv/|&lt;/A&gt;LA|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|199bcf329b04eed7cd8112c05f1d75ee|US&lt;BR /&gt;MISS|403|1636693708796|2668|604509|34.77.162.0|-|&lt;A href="http://cdn.otherworlds.tv/|" target="_blank" rel="noopener"&gt;http://cdn.otherworlds.tv/|&lt;/A&gt;LA|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|95c210d97d6db0993d277bffecace1d2|US&lt;BR /&gt;MISS|403|1636682456854|2668|604509|34.77.162.0|-|&lt;A href="https://cdnotherworlds.b-cdn.net/|" target="_blank" rel="noopener"&gt;https://cdnotherworlds.b-cdn.net/|&lt;/A&gt;DE|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|aaf73b49b6c569d067c921a8a88a730c|US&lt;BR /&gt;MISS|403|1636681389575|2668|604509|34.86.35.0|-|&lt;A href="https://cdnotherworlds.b-cdn.net/|" target="_blank" rel="noopener"&gt;https://cdnotherworlds.b-cdn.net/|&lt;/A&gt;MI|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|e4d4cbaeb257d24a6dcfc9b4ca7787cd|US&lt;BR /&gt;MISS|403|1636673213470|2668|604509|34.77.162.0|-|&lt;A href="http://cdn.otherworlds.tv/|" target="_blank" rel="noopener"&gt;http://cdn.otherworlds.tv/|&lt;/A&gt;DE|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|800a4e888d49bd473492fd4dcd234ff7|US&lt;BR /&gt;MISS|403|1636672374657|2668|604509|34.77.162.0|-|&lt;A href="https://cdnotherworlds.b-cdn.net/|" target="_blank" rel="noopener"&gt;https://cdnotherworlds.b-cdn.net/|&lt;/A&gt;IL|Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com|ed1cc263aadda33463bcecc10ee7b9c3|US&lt;/P&gt;</description>
      <pubDate>Fri, 12 Nov 2021 17:49:07 GMT</pubDate>
      <guid>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447239#M100665</guid>
      <dc:creator>OtherWorldsTV</dc:creator>
      <dc:date>2021-11-12T17:49:07Z</dc:date>
    </item>
    <item>
      <title>Re: Spurious hits from the Expanse webcrawler...</title>
      <link>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447273#M100666</link>
      <description>&lt;P&gt;Hi &lt;a href="https://live.paloaltonetworks.com/t5/user/viewprofilepage/user-id/199799"&gt;@OtherWorldsTV&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;- It may be not beneficial to you, but it is definately beneficial to others. If you read the all posts from the link you found you will understand that Palo Alto - like any other vendor with URL filtering product - is keeping a "database" of websites/domains/urls and try to categorise them. That way Palo Alto products can be used as URL filtering solution and control access to websites, based on their content/category. One way to keep such database is to use automated crawlers. This crawlers will try to crawl any domain that is currently not categorized - "unknown" and give it proper category. So you see why it is beneficial for other to crawl your domain - so it can be categorized and Palo Alto product users to safely access your domain.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;- I am not sure what you mean by "private network". If your domain is publically resolvable and there is no access control (any public IP can access it), that it is obviosly not a private network. If it ment to be private, than you are responsible to put the proper access control and restrict the access. Web crawling is very common think these days, how do you think web searches like Google, Bing etc are indexing the whole internet? Do you think Palo Alto Networks crawlers are the only one crawling your domain?&lt;/P&gt;
&lt;P&gt;- What are your concerns? How does those crawlers disrupt your network?&lt;/P&gt;
&lt;P&gt;- Have you tried to contact the email from the user-agent and explain your problem?&lt;/P&gt;</description>
      <pubDate>Fri, 12 Nov 2021 18:35:01 GMT</pubDate>
      <guid>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447273#M100666</guid>
      <dc:creator>aleksandar.astardzhiev</dc:creator>
      <dc:date>2021-11-12T18:35:01Z</dc:date>
    </item>
    <item>
      <title>Re: Spurious hits from the Expanse webcrawler...</title>
      <link>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447284#M100669</link>
      <description>&lt;P&gt;I'm sorry, but rather than making excuses for their bad behavior, you should be asking WHY their crawler is hitting my server 20,000 times since Midnight causing a Denial of Service.&lt;BR /&gt;&lt;BR /&gt;The server in question is not a web server, but a file server for a Roku channel, which has zero use for any Palo Alto product user.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Other crawlers (Google, Bing, etc) all follow the directives set by our robots.txt and do NOT try to index our servers. Why should Palo Alto be any different?&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;As for contacting them by email, I've tried that. And got an immediate bounce message saying my email was "undeliverable."&lt;BR /&gt;&lt;BR /&gt;Honestly, a single scan and move on would be fine. But continually hitting my server this excessively shows malicious intent on the part of Palo Alto Networks.&lt;/P&gt;</description>
      <pubDate>Fri, 12 Nov 2021 19:13:37 GMT</pubDate>
      <guid>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447284#M100669</guid>
      <dc:creator>OtherWorldsTV</dc:creator>
      <dc:date>2021-11-12T19:13:37Z</dc:date>
    </item>
    <item>
      <title>Re: Spurious hits from the Expanse webcrawler...</title>
      <link>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447287#M100670</link>
      <description>&lt;P&gt;Their phone support is equally impressive. I've only been hung up on 3 times, and spent a little over 2 hours on hold so far.&lt;/P&gt;</description>
      <pubDate>Fri, 12 Nov 2021 19:28:56 GMT</pubDate>
      <guid>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447287#M100670</guid>
      <dc:creator>OtherWorldsTV</dc:creator>
      <dc:date>2021-11-12T19:28:56Z</dc:date>
    </item>
    <item>
      <title>Re: Spurious hits from the Expanse webcrawler...</title>
      <link>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447675#M100733</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;Yeah sounds a bit off to me as well. Definitely try to get someone on the phone, since you are not a customer:&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: scaninfo@expanseinc.com&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Also an email? In the meantime, set your firewall to drop the traffic?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Regards,&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 15 Nov 2021 16:02:08 GMT</pubDate>
      <guid>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447675#M100733</guid>
      <dc:creator>OtakarKlier</dc:creator>
      <dc:date>2021-11-15T16:02:08Z</dc:date>
    </item>
    <item>
      <title>Re: Spurious hits from the Expanse webcrawler...</title>
      <link>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447732#M100741</link>
      <description>&lt;P&gt;By default we already drop any traffic that doesn't meet specific patterns. But it's annoying watching the intrusion logs fill up with Expanse over and over.&lt;BR /&gt;&lt;BR /&gt;As I said, their phone support lives up to everything else I've seen from them so far. 2 hour hold times, repeated hang ups, and one particular call of being taken off hold to be asked "Are you sure you aren't a customer" FIVE times before being hung up on.&lt;BR /&gt;&lt;BR /&gt;Honestly, I think they should be investigated for criminal intent. Their "research" is doing nothing to benefit anyone.&lt;/P&gt;</description>
      <pubDate>Mon, 15 Nov 2021 19:31:38 GMT</pubDate>
      <guid>https://live.paloaltonetworks.com/t5/general-topics/spurious-hits-from-the-expanse-webcrawler/m-p/447732#M100741</guid>
      <dc:creator>OtherWorldsTV</dc:creator>
      <dc:date>2021-11-15T19:31:38Z</dc:date>
    </item>
  </channel>
</rss>

