feat: add job_scout — automated CH job-board scraper for target companies

Pulls fresh postings from the quarterly target-company list, filters by Swiss
location / EU-remote eligibility, scores fit against profile keywords, tracks
seen jobs, and writes a markdown report.

Adapters (13 companies automated):
- Workday: NVIDIA, Novartis
- Ashby: Kraken, OpenAI
- Greenhouse: Anthropic
- PCSX (Eightfold): Microsoft
- WordPress AJAX: Sygnum
- Playwright (headless + stealth): Google, Apple, Meta, Roche, Cisco, IBM

4 companies (Sonova, Coinbase, AMINA, Bitcoin Suisse) resist automation
(server-side bot blocks / JS widgets) and surface as a manual-check checklist.

venv, reports/, and state/ are gitignored; only scout.py + requirements.txt
are tracked.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This commit is contained in:
2026-05-22 10:12:27 +02:00
parent 1fde4c6b34
commit eeec77b1e3
4 changed files with 741 additions and 1 deletions
+9
View File
@@ -20,6 +20,15 @@ Thumbs.db
__pycache__/
*.pyc
# job_scout runtime artifacts (keep scout.py + requirements.txt only)
job_scout/.venv/
job_scout/reports/
job_scout/state/
# One-off job-board data pulls (debug artifacts)
*_jd.json
*_jobs*.json
# Editor
*.swp
*.swo