Remove strip_nikkud from all pipeline files — use ktiv_male directly. Fix case-insensitive binyan matching in detail scraper (og:description uses UPPERCASE). Fix integration test slugs and test limits. Delete legacy CSVs, stale .apkg, and dead scripts from git. Add vulture to pre-commit hook. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
60 lines
1.1 KiB
Text
60 lines
1.1 KiB
Text
archive
|
|
nikkud.csv
|
|
practice.py
|
|
cardinal_one_to_ten.*
|
|
*.swp
|
|
bin**
|
|
lib**
|
|
include**
|
|
lib64**
|
|
pyvenv.cfg
|
|
venv/
|
|
__pycache__/
|
|
*.pyc
|
|
.pytest_cache/
|
|
|
|
# Large generated cache files (rebuild locally)
|
|
data/benyehuda_index.json
|
|
|
|
# Audio directories (large; rebuild locally)
|
|
data/audio/
|
|
data/audio_conj/
|
|
|
|
# Image cache and downloads (rebuild with image_fetch.py)
|
|
data/image_cache.json
|
|
data/images/
|
|
|
|
# Output .apkg files (generated by pipeline)
|
|
output/
|
|
|
|
# Internal / private files — not for public repo
|
|
ANKIWEB_DESCRIPTION.md
|
|
PROJECT_NOTES.md
|
|
PROJECTS.md
|
|
SPRINT_LOG.md
|
|
CLAUDE.md
|
|
RECOMMENDATIONS.md
|
|
|
|
# Intermediate scrape progress files
|
|
data/ktiv_male_forms.json.partial
|
|
data/ktiv_male_forms_partial.json
|
|
data/ktiv_scrape_progress.json
|
|
data/noun_slug_map_progress.json
|
|
data/top_verbs_to_scrape.json
|
|
|
|
# EPUB source files (large; user-specific)
|
|
data/epubs/
|
|
|
|
# Stray deck files
|
|
Everything__*.apkg
|
|
*.apkg
|
|
|
|
# Legacy CSV files (replaced by data/words.json)
|
|
*.csv
|
|
data/*.csv
|
|
|
|
# Dead whitelist files
|
|
vulture_whitelist.py
|
|
|
|
# Release artifacts — distributed via Forgejo releases, not committed to tree
|
|
releases/
|