Source
CBC News British Columbia
Indexed Articles (2)
Indexing details: last successful crawl May 15, 2026 12:00 pm.
Public broadcaster (CBC). National scope; we crawl the BC RSS feed and URL-slug filter to keep only Kamloops & Thompson-Cariboo region articles. robots.txt allows public crawling. [BLOCKED 2026-05-15: CBCs CDN (Akamai) rejects all bot-identifying User-Agent strings (000 response). robots.txt explicitly allows /webfeed/rss/, but CDN-level anti-bot is stricter than the documented policy. Options: (a) use a Firefox UA via per-source override; (b) skip CBC. Inactive until decision.] [UA override 2026-05-15]: Firefox UA because CBC CDN rejects bot UAs. robots.txt allows /webfeed/rss/. TODO: contact CBC audience services after 6-12 months.