Skip to content

A test page designed for validating web crawlers, scrapers, and content extraction algorithms. Built with modern web standards including semantic HTML5, JSON-LD structured data, and proper meta tags.

License

Notifications You must be signed in to change notification settings

CameronWhiteside/crawl-test

Repository files navigation

CrawlTest.com

A test page designed for validating web crawlers, scrapers, and content extraction algorithms. Built with modern web standards including semantic HTML5, JSON-LD structured data, and proper meta tags.

Features

  • Semantic HTML5 markup
  • JSON-LD structured data
  • OpenGraph & Twitter meta tags
  • robots.txt & sitemap.xml

Usage

Visit crawltest.com to test your web crawler or scraper. The page is designed to be crawled once and provides a reliable baseline for testing parsing capabilities.

Development

This is a site built with React and Remix. To run locally:

npm install
npm run build
npm run preview

License

MIT License - see LICENSE file for details.

Contributing

Feel free to submit issues, feature requests, or pull requests. This project is open source and welcomes contributions from the community.

About

A test page designed for validating web crawlers, scrapers, and content extraction algorithms. Built with modern web standards including semantic HTML5, JSON-LD structured data, and proper meta tags.

Resources

License

Stars

Watchers

Forks

Contributors 2

  •  
  •