User-agent: * Allow: / Allow: /minsu/ Allow: /city/ Allow: /kw/ Allow: /detail.html Allow: /list.html Allow: /about.html # 豆包AI爬虫 User-agent: DoubaoBot Allow: / # 文心一言/百度AI爬虫 User-agent: Baidu-AI User-agent: BaiduSpider User-agent: Baiduspider-render User-agent: Baiduspider-image User-agent: Baiduspider-video User-agent: Baiduspider-news Allow: / # 百度AI搜索 User-agent: Baidu-AISearch Allow: / # Google爬虫 User-agent: Googlebot User-agent: Googlebot-Image User-agent: Googlebot-Video User-agent: Googlebot-News User-agent: Googlebot-Mobile Allow: / # Bing爬虫 User-agent: Bingbot User-agent: BingPreview User-agent: MicrosoftPreview Allow: / # 360搜索 User-agent: 360Spider User-agent: 360Spider-image User-agent: 360Spider-video Allow: / # 搜狗 User-agent: Sogou spider User-agent: Sogou web spider Allow: / # 字节跳动/抖音 User-agent: Bytespider User-agent: ToutiaoSpider Allow: / # 腾讯爬虫 User-agent: TencentSpider Allow: / # 神马搜索 User-agent: YisouSpider Allow: / # 夸克搜索 User-agent: QuarkSpider Allow: / # 阿里云AI User-agent: AlibabaSpider User-agent: AliyunAI Allow: / # OpenAI (ChatGPT) User-agent: GPTBot User-agent: ChatGPT-User Allow: / # Anthropic (Claude) User-agent: ClaudeBot Allow: / # Common Crawl User-agent: CCBot Allow: / # 学术爬虫 User-agent: ScholarBot Allow: / # 社交分享爬虫 User-agent: Twitterbot User-agent: facebookexternalhit Allow: / # AI数据开放API - 允许爬虫访问 Allow: /api/open/ Allow: /ai-discovery.html # 禁止访问的目录(保留部分API路径) Disallow: /api/landlord/ Disallow: /api/user/ Disallow: /api/seo/ Disallow: /assets/ Disallow: /miniprogram/ Disallow: /baidu-miniprogram/ Disallow: /alipay-miniprogram/ Disallow: /douyin-miniprogram/ # 禁止抓取搜索参数(避免重复内容) Disallow: /*?* # AI爬虫特殊规则 - 允许访问开放数据接口 User-agent: GPTBot User-agent: ChatGPT-User User-agent: ClaudeBot User-agent: Bytespider User-agent: DoubaoBot User-agent: Baidu-AI Allow: /api/open/ Allow: /ai-discovery.html # Sitemap位置 Sitemap: https://innnni.com/sitemap.xml Sitemap: https://innnni.com/sitemap-minsu.xml Sitemap: https://innnni.com/sitemap-city.xml Sitemap: https://innnni.com/sitemap-kw.xml Sitemap: https://innnni.com/sitemap-static.xml # AI数据发现页(帮助AI找到所有数据入口) Sitemap: https://innnni.com/ai-discovery.html