HKIA Content Aggregation System - Complete content scraping and markdown generation for 5 sources (WordPress, MailChimp RSS, Podcast RSS, YouTube, Instagram)
Find a file
Ben Reed c1831d3a52 feat: Implement YouTube scraper with humanized behavior
- YouTube channel scraper using yt-dlp
- Authentication and session persistence via cookies
- Humanized delays and rate limiting (2-5 seconds between requests)
- User agent rotation for stealth
- Incremental updates via state management
- Support for videos, shorts, and live streams detection
- All 11 tests passing

🤖 Generated with Claude Code

Co-Authored-By: Claude <noreply@anthropic.com>
2025-08-18 12:39:49 -03:00
docs Initial commit: Project foundation with base scraper and tests 2025-08-18 12:15:17 -03:00
src feat: Implement YouTube scraper with humanized behavior 2025-08-18 12:39:49 -03:00
tests feat: Implement YouTube scraper with humanized behavior 2025-08-18 12:39:49 -03:00
.gitignore Initial commit: Project foundation with base scraper and tests 2025-08-18 12:15:17 -03:00
.python-version Initial commit: Project foundation with base scraper and tests 2025-08-18 12:15:17 -03:00
claude.md Initial commit: Project foundation with base scraper and tests 2025-08-18 12:15:17 -03:00
main.py Initial commit: Project foundation with base scraper and tests 2025-08-18 12:15:17 -03:00
pyproject.toml Initial commit: Project foundation with base scraper and tests 2025-08-18 12:15:17 -03:00
status.md Initial commit: Project foundation with base scraper and tests 2025-08-18 12:15:17 -03:00
uv.lock Initial commit: Project foundation with base scraper and tests 2025-08-18 12:15:17 -03:00