Crawler

Web-Crawler in general

To achieve this, we have implemented an automatic crawler (like the common searchengines use) which visits every .at-domain.
Because there are a great many at-domains we currently only retrieve the starting page of each domain. Should we expand on this service, this might change in the future.

The crawler identifies itself as CERT.at-Crawler/0.9 (+http://www.cert.at/about/crawler/content.html).

How does it work?

Currently the crawler tries to retrieve the following files:

robots.txt (this states what automatic crawlers are allowed to retrieve, see http://en.wikipedia.org/wiki/Robots.txt for details).
a sitemap, if it was mentioned in robots.txt and not forbidden
the start-page, if not forbidden in robots.txt

Usual hostnames like "www" and "shop" are also tried.

Web-Crawler for MS14-066 (WinShock)

CERT.at - the National Computer Emergency Response Team of Austria - is searching for vulnerable software-installations within our constituency. The intent is to help the people responsible to close the security-holes. To achieve this, we have implemented an automatic crawler (like the common searchengines use) which visits IP addresses within Austrian IP ranges.

The crawler identifies itself as CERT.at-Crawler/1.0 (+http://www.cert.at/about/crawlerms14066/content.html).

How does it work?

Currently the crawler tries to retrieve/verify the following:

(retrieve) Index-/Entry-Page
(verify) if Internet Information Services for Windows Server (IIS)
(verify) supported SSL/TLS-Ciphers

Frequently asked questions

How has the crawler found my domain, I just registered it?

NIC.at, the registry for .at-domains, kindly provides us with a list of all .at-domains. This information is confidential, of course, and we will not give it to third parties.

In case we find problems, we'll work with the contacts provided during registration (admin-c, tech-c) and the owner to fix them.

I don't want CERT.at accessing my domain, how can I prevent that?

The best option is to make an appropriate entry in the sites' robots.txt-file, such as:

User-agent: CERT.at-Crawler*
Disallow: /

For details, please see http://en.wikipedia.org/wiki/Robots.txt.

There's a technical problem with your crawler, help!

Don't hesitate to contact us! See our contact page for details.