Static data mirror for SansadSaar. Holds the document-centric corpora: DRSC committee reports, CAG audits, Bills, Law Commission reports, Financial Committee reports.
Served at sansadsaar-data.naklitechie.com.
Upstream sources don't permit cross-origin browser requests. GitHub Actions scrapes each corpus on a schedule and re-publishes JSON + text at a CORS-friendly endpoint the SansadSaar app reads directly — no backend.
- SansadSaar — the app
- sansadsaar-proceedings-data — Debates, Questions
- sansadsaar-gazettes — Central Gazette
- sansadsaar-lc — Law Commission PDF archive
DRSC scraper foundation (scraper.py, pdf_utils.py, config.py) is vendored from pranaykotas/parliamentwatch. Thanks to Pranay Kotasthane for the original work. CAG, Bills, LC, and FC scrapers are independent additions.