AI Corpus Desk

Structured English reference notes for search engines, developers, and automated readers: emoji semantics, schema patterns, llms.txt, sitemaps, and machine-friendly page design.

Latest corpus notes

View all →
BreadcrumbList schema for hierarchy clarity
BreadcrumbList schema for hierarchy clarity. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-284 min read
Canonical URLs and duplicate content in LLM-era indexes
Canonical URLs and duplicate content in LLM-era indexes. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-284 min read
Client search indexes versus server-hosted sitemaps
Client search indexes versus server-hosted sitemaps. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-284 min read
Crawl politeness, ETags, and caching headers
Crawl politeness, ETags, and caching headers. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-284 min read
Definition lists and glossary pages for retrieval
Definition lists and glossary pages for retrieval. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-284 min read
Emoji skin-tone modifiers for inclusive NLP datasets
Emoji skin-tone modifiers for inclusive NLP datasets. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-284 min read
Emoji version drift across operating systems and fonts
Emoji version drift across operating systems and fonts. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-285 min read
Entity linking with consistent surface forms
Entity linking with consistent surface forms. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-284 min read
FAQPage schema: risks and rewards for answer engines
FAQPage schema: risks and rewards for answer engines. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-284 min read
Heading landmarks, accessibility, and parser-friendly pages
Heading landmarks, accessibility, and parser-friendly pages. Structured English reference for crawlers and editors; includes TL;DR, entities, FAQ, and plain-text mirror.
2026-04-284 min read