robots.txt parser
HTTP /api/v1/http/robotsFetch and parse robots.txt — User-agent groups, Disallow/Allow, Sitemap, Crawl-delay.
https://www.google.com/robots.txt
200
6519 bytes
6 User-agent groups
- /imgres
- /search
- /groups
- /hosted/images/
- /m/
- /imgres
- /search
- /groups
- /hosted/images/
- /m/
- /imgres
- /search
- /groups
- /hosted/images/
- /m/
- /imgres
- /search
- /groups
- /hosted/images/
- /m/
- /imgres
- /search
- /groups
- /hosted/images/
- /m/
- /imgres
- /search
- /groups
- /hosted/images/
- /m/
Raw robots.txt
User-agent: * User-agent: Yandex Disallow: /search Allow: /search/about Allow: /search/howsearchworks Disallow: /sdch Disallow: /groups Disallow: /index.html? Disallow: /? Allow: /?hl= Disallow: /?hl=*& Allow: /?hl=*&gws_rd=ssl$ Disallow: /?hl=*&*&gws_rd=ssl Allow: /?gws_rd=ssl$ Allow: /?pt1=true$ Disallow: /imgres Disallow: /u/ Disallow: /setprefs Disallow: /m? Disallow: /m/ Allow: /m/finance Disallow: /wml? Disallow: /wml/? Disallow: /wml/search? Disallow: /xhtml? Disallow: /xhtml/? Disallow: /xhtml/search? Disallow: /xml? Disallow: /imode? Disallow: /imode/? Disallow: /imode/search? Disallow: /jsky? Disallow: /jsky/? Disallow: /jsky/search? Disallow: /pda? Disallow: /pda/? Disallow: /pda/search? Disallow: /local? Disallow: /local_url Disallow: /products? Disallow: /product_ Disallow: /products_ Disallow: /products; Disallow: /print Disallow: /books/ Disallow: /bkshp?*dq= Disallow: /bkshp?*q= Disallow: /books?*dq= Disallow: /books?*q= Disallow: /books?*qtid= Disallow: /books?*output= Disallow: /books?*pg= Disallow: /books?*jtp= Disallow: /books?*jscmd= Disallow: /books?*buy= Disallow: /books?*zoom= Allow: /books/about Allow: /books?*zoom=1 Allow: /books?*zoom=5 Allow: /books/content?*zoom=1 Allow: /books/content?*zoom=5 Disallow: /patents? Disallow: /patents/download/ Disallow: /patents/pdf/ Disallow: /patents/related/ Disallow: /scholar Disallow: /citations? Allow: /citations?user= Allow: /citations?view_op=new_profile Allow: /citations?view_op=top_venues Allow: /scholar_share Disallow: /s? Disallow: /maps? Allow: /maps?daddr= Allow: /maps?entry=wc Allow: /maps?f= Allow: /maps?hl= Allow: /maps?q= Allow: /maps?saddr= Allow: /maps?sid= Allow: /maps?*output=classic Allow: /maps?*file= Disallow: /mapslt? Disallow: /maphp? Disallow: /maps/ Allow: /maps/$ Allow: /maps/@ Allow: /maps/?daddr= Allow: /maps/?entry=wc Allow: /maps/?f= Allow: /maps/?hl= Allow: /maps/?q= Allow: /maps/?saddr= Allow: /maps/?sid= Allow: /maps/search/ Allow: /maps/sitemap.xml Allow: /maps/sitemaps/ Allow: /maps/dir/ Allow: /maps/d/ Allow: /maps/reserve Allow: /maps/about Allow: /maps/contrib/ Allow: /maps/match Allow: /maps/ms? Allow: /maps/place/ Allow: /maps/_/ Allow: /search?*tbm=map Allow: /maps/vt? Allow: /maps/preview Disallow: /maps/api/js/ Allow: /maps/api/js Disallow: /mld? Disallow: /staticmap? Disallow: /help/maps/streetview/partners/welcome/ Disallow: /help/maps/indoormaps/partners/ Disallow: /lochp? Disallow: /ie? Disallow: /uds/ Disallow: /transit? Disallow: /trends? Disallow: /trends/music? Disallow: /trends/hottrends? Disallow: /trends/viz? Disallow: /trends/embed.js? Disallow: /trends/fetchComponent? Disallow: /trends/beta Disallow: /trends/topics Disallow: /trends/explore? Disallow: /trends/api Disallow: /musica Disallow: /musicl Disallow: /musics Disallow: /urchin_test/ Disallow: /movies? Disallow: /wapsearch? Disallow: /reviews/search? Disallow: /cbk Disallow: /profiles/me Disallow: /s2/profiles/me Allow: /s2/profiles Allow: /s2/oz Allow: /s2/photos Allow: /s2/search/social Allow: /s2/static Disallow: /s2 Disallow: /transconsole/portal/ Disallow: /aclk Disallow: /tbproxy/ Disallow: /support/forum/search? Disallow: /reviews/polls/ Disallow: /hosted/images/ Disallow: /accounts/ClientLogin Disallow: /accounts/ClientAuth Disallow: /accounts/o8 Allow: /accounts/o8/id Disallow: /quality_form? Disallow: /labs/popgadget/search Disallow: /compressiontest/ Disallow: /analytics/feeds/ Disallow: /analytics/partners/comments/ Disallow: /analytics/portal/ Disallow: /analytics/uploads/ Allow: /alerts/manage Allow: /alerts/remove Disallow: /alerts/ Allow: /alerts/$ Disallow: /phone/compare/? Disallow: /travel/clk Disallow: /travel/entity Disallow: /travel/search Disallow: /travel/flights/booking Disallow: /travel/flights/s/ Disallow: /travel/flights/search Disallow: /travel/hotels/stories Disallow: /travel/hotels/*/stories Disallow: /travel/story Disallow: /hotelfinder/rpc Disallow: /hotels/rpc Disallow: /evaluation/ Disallow: /forms/perks/ Disallow: /shopping/suppliers/search Disallow: /edu/cs4hs/ Disallow: /trustedstores/s/ Disallow: /trustedstores/tm2 Disallow: /trustedstores/verify Disallow: /shopping? Disallow: /shopping/product/ Disallow: /shopping/seller Disallow: /shopping/ratings/account/metrics Disallow: /shopping/ratings/merchant/immersivedetails Disallow: /shopping/reviewer Disallow: /shopping/search Disallow: /shopping/deals Allow: /shopping?udm=28$ Disallow: /storefront Disallow: /storepicker Disallow: /about/careers/applications/candidate-prep Disallow: /about/careers/applications/connect-with-a-googler Disallow: /about/careers/applications/jobs/results?page= Disallow: /about/careers/applications/jobs/results/?page= Disallow: /about/careers/applications/jobs/results?*&page= Disallow: /about/careers/applications/jobs/results/?*&page= Disallow: /landing/signout.html Disallow: /gallery/ Disallow: /landing/now/ontap/ Allow: /maps/reserve Allow: /maps/reserve/partners Disallow: /maps/reserve/api/ Disallow: /maps/reserve/search Disallow: /maps/reserve/bookings Disallow: /maps/reserve/settings Disallow: /maps/reserve/manage Disallow: /maps/reserve/payment Disallow: /maps/reserve/receipt Disallow: /maps/reserve/sellersignup Disallow: /maps/reserve/feedback Disallow: /maps/reserve/terms Disallow: /maps/reserve/m/ Disallow: /maps/reserve/b/ Disallow: /maps/reserve/partner-dashboard Disallow: /local/cars Disallow: /local/dealership/ Disallow: /local/dining/ Disallow: /local/place/products/ Disallow: /local/place/reviews/ Disallow: /local/place/rap/ Disallow: /local/tab/ Disallow: /localservices/ Disallow: /nonprofits/account/ Disallow: /uviewer Disallow: /landing/cmsnext-root/ # AdsBot User-agent: AdsBot-Google Disallow: /maps/api/js/ Allow: /maps/api/js Disallow: /maps/api/place/js/ Disallow: /maps/api/staticmap Disallow: /maps/api/streetview # New user agent groups must also have a user agent reference in the global (*) # group. See "Order of precedence" section in # https://goo.gle/rep#order-of-precedence-for-user-agents User-agent: Yandex Disallow: /about/careers/applications/jobs/results Disallow: /about/careers/applications-a/jobs/results # Crawlers of certain social media sites are allowed to access page markup when # google.com/imgres* links are shared. To learn more, please contact # [email protected]. User-agent: facebookexternalhit User-agent: Twitterbot Allow: /imgres Allow: /search Disallow: /groups Disallow: /hosted/images/ Disallow: /m/ Sitemap: https://www.google.com/sitemap.xml
-
1
Paste your input
Enter the value at the top — domain, IP, URL, email, ASN, hash, whatever fits this tool. The smart input auto-detects type.
-
2
Click "Inspect"
host.tools issues real probes (DNS, HTTP, TCP, TLS, WHOIS where applicable) and renders the result in milliseconds.
-
3
Open the API tab
Every web tool has a sibling /api/v1/http/robots JSON endpoint with the same payload. One copy-as-curl click and you're scripting it.
Headers are how the modern web declares its security posture. Auditing them is the highest-ROI thing you can do this week.
/api/v1/http/robots?q=https%3A%2F%2Fwww.google.com%2Ffinance%2Fsitemap.xml
curl -s '/api/v1/http/robots?q=https%3A%2F%2Fwww.google.com%2Ffinance%2Fsitemap.xml'
<iframe src="/http/robots?q={INPUT}&embed=1"
width="100%" height="600" frameborder="0"></iframe>
Drop into any HTML page. The embed=1 flag hides nav and footer.
Upgrade to Pro for $19/mo. Cancel anytime. Works with the same API you already use.
Common questions
Is robots.txt parser free?
Where does the data come from?
How fresh are the results?
?nocache=1 for a forced refresh.Can I run this from the command line?
host.tools http robots YOUR_INPUT.Can I monitor results over time?
Run robots.txt parser on a schedule. Get pinged when it changes.
Pro gets you bulk lookups, monitors, webhook alerts, history, exports and 10,000 API calls/day. $19/mo.
- ✓Schedule any tool — every 1, 5, 15, 60 min
- ✓Diff against last run, alert on change
- ✓Webhook + email + Slack + PagerDuty + OpsGenie
- ✓Bulk CSV upload, 1,000 inputs per job
- ✓Export results as CSV / NDJSON / Excel
- ✓90-day history, comparison view