AI Corpus Desk
Structured English reference notes for search engines, developers, and automated readers: emoji semantics, schema patterns, llms.txt, sitemaps, and machine-friendly page design.
Latest corpus notes
View all →BreadcrumbList schema for hierarchy clarity
BreadcrumbList schema for hierarchy clarity. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
Canonical URLs and duplicate content in LLM-era indexes
Canonical URLs and duplicate content in LLM-era indexes. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
Client search indexes versus server-hosted sitemaps
Client search indexes versus server-hosted sitemaps. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
Crawl politeness, ETags, and caching headers
Crawl politeness, ETags, and caching headers. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
Definition lists and glossary pages for retrieval
Definition lists and glossary pages for retrieval. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
Emoji skin-tone modifiers for inclusive NLP datasets
Emoji skin-tone modifiers for inclusive NLP datasets. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
Emoji version drift across operating systems and fonts
Emoji version drift across operating systems and fonts. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
Entity linking with consistent surface forms
Entity linking with consistent surface forms. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
FAQPage schema: risks and rewards for answer engines
FAQPage schema: risks and rewards for answer engines. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
Heading landmarks, accessibility, and parser-friendly pages
Heading landmarks, accessibility, and parser-friendly pages. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.