# robots.txt for http://www.w3.org/ # exclude some access-controlled areas User-agent: *