User-agent: Googlebot Allow: / User-agent: Google-Extended Allow: / User-agent: GoogleOther Allow: / User-agent: Bingbot Allow: / User-agent: GPTBot Allow: / User-agent: DeepSeekBot Allow: / User-agent: Grok Allow: / User-agent: Perplexity Allow: / User-agent: Llama Allow: / User-agent: Claude Allow: / User-agent: FacebookBot Allow: / User-agent: Applebot-Extended Allow: / # 屏蔽非官方或非授权爬虫的抓取,统一规则 User-agent: * Disallow: /lp/ Disallow: /feedback/ Disallow: /langtest/ # 屏蔽带有会话ID、排序参数、筛选参数的URL,防止重复抓取 Disallow: /*?sessionid= Disallow: /*?sort= Disallow: /*&filter= # 屏蔽后台、登录和用户隐私相关路径 Disallow: /admin/ Disallow: /login/ Disallow: /user/ # 允许抓取网站其他所有内容 Allow: / # Sitemap 文件地址,方便搜索引擎发现站点结构 Sitemap: https://www.halalive.com/google_sitemap_index.xml