Din Solr, toate companiile cu URL anofm
Vezi ce ai trimis azi
Workflow: opencode_scraper_to_solr.yml
~27 companii -> commit -> continua
Sterge run-urile completed
Noua zi = reextrage + reincepe
curl -s -u solr:SolrRocks "https://solr.pevitor.ro/solr/job/select?q=url:*anofm*&rows=0&facet=true&facet.field=company&facet.limit=10000"
gh api "repos/peviitor-ro/peviitor_opencode_AI_scrapers/actions/runs?per_page=100" -q '.workflow_runs[] | select(.created_at >= "2026-04-11T20:00:00Z") | .id' | wc -l
Workflow: .github/workflows/opencode_scraper_to_solr.yml din peviitor-ro/peviitor_opencode_AI_scrapers
gh workflow run .github/workflows/opencode_scraper_to_solr.yml -f company='NUME_COMPANIE'
3a. Trimite ~27 companii
3b. Salveaza scraped_today.json + commit
3c. Sterge 1 pagina de completed runs
3d. Repeat
# Batch + commit node scrape_remaining.js git add scraped_today.json && git commit -m "Track X companies" && git push # Curata 1 pagina completed gh api ".../actions/runs?per_page=100" -q '[.workflow_runs[] | select(.status == "completed")] | .[] | .id' | while read id; do gh api -X DELETE ".../actions/runs/$id"; done
# Stergem run-urile completed (nu cele active/queued) gh api "repos/peviitor-ro/peviitor_opencode_AI_scrapers/actions/runs?per_page=100" -q '[.workflow_runs[] | select(.status == "completed")] | .[] | .id' | while read id; do gh api -X DELETE "repos/peviitor-ro/peviitor_opencode_AI_scrapers/actions/runs/$id"; done
Stergem scraped_today.json - cream fisier NOU - reextragem din Solr