<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>crawler.sh: Local Markdown Extractor for AI Training and RAG</title><description>Turn any website into clean, RAG-ready Markdown. Runs locally, renders JavaScript, respects robots.txt. No headless Chrome, no per-page fees.</description><link>https://crawler.sh/</link><item><title>Challenges of Collecting Preference Data for RLHF</title><link>https://crawler.sh/blog/challenges-collecting-preference-data-rlhf/</link><guid isPermaLink="true">https://crawler.sh/blog/challenges-collecting-preference-data-rlhf/</guid><description>The hardest problems in RLHF data pipelines - from annotator disagreement and label noise to scaling preference collection and keeping training data fresh.</description></item><item><title>Answer Engine Optimization (AEO): Optimize for AI Search</title><link>https://crawler.sh/blog/answer-engine-optimization/</link><guid isPermaLink="true">https://crawler.sh/blog/answer-engine-optimization/</guid><description>Learn what Answer Engine Optimization is, why it matters, and how to make your content visible to AI search engines like ChatGPT and Perplexity.</description></item><item><title>E-E-A-T Checklist: A Practical Guide to Improving Your Site</title><link>https://crawler.sh/blog/eeat-checklist-guideline/</link><guid isPermaLink="true">https://crawler.sh/blog/eeat-checklist-guideline/</guid><description>A step-by-step E-E-A-T checklist for Experience, Expertise, Authoritativeness, and Trustworthiness. Learn what Google looks for and how to audit your site.</description></item><item><title>How to Force Google to Update Your Favicon in Search Results</title><link>https://crawler.sh/blog/how-to-force-google-to-update-your-favicon/</link><guid isPermaLink="true">https://crawler.sh/blog/how-to-force-google-to-update-your-favicon/</guid><description>Changed your favicon but Google still shows the old one? Here is a simple trick using the Google Favicon API to force a refresh.</description></item><item><title>How crawler.sh renders JavaScript without headless Chrome</title><link>https://crawler.sh/blog/how-crawler-sh-renders-javascript-without-headless-chrome/</link><guid isPermaLink="true">https://crawler.sh/blog/how-crawler-sh-renders-javascript-without-headless-chrome/</guid><description>How crawler.sh renders JavaScript without headless Chrome: why we built a custom engine and what it means for AI ingestion.</description></item><item><title>Technical SEO Audit Guide: Find and Fix Every Issue</title><link>https://crawler.sh/blog/technical-seo-audits/</link><guid isPermaLink="true">https://crawler.sh/blog/technical-seo-audits/</guid><description>Learn how to run a technical SEO audit from start to finish. Covers crawlability, indexation, site speed, and 24 automated checks.</description></item><item><title>Best Web Crawler for MLOps: Collect Training Data at Scale</title><link>https://crawler.sh/blog/best-web-crawler-for-mlops/</link><guid isPermaLink="true">https://crawler.sh/blog/best-web-crawler-for-mlops/</guid><description>Why crawler.sh is the best web crawler for MLOps pipelines. Fast Rust-powered crawling, clean extraction, JSON export, and CI/CD automation for ML teams.</description></item></channel></rss>