Skip to content

NakliTechie/parliamentwatch-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4,045 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

parliamentwatch-data

Static data mirror for SansadSaar. Holds the document-centric corpora: DRSC committee reports, CAG audits, Bills, Law Commission reports, Financial Committee reports.

Served at sansadsaar-data.naklitechie.com.

Why a separate repo?

Upstream sources don't permit cross-origin browser requests. GitHub Actions scrapes each corpus on a schedule and re-publishes JSON + text at a CORS-friendly endpoint the SansadSaar app reads directly — no backend.

Family

Credits

DRSC scraper foundation (scraper.py, pdf_utils.py, config.py) is vendored from pranaykotas/parliamentwatch. Thanks to Pranay Kotasthane for the original work. CAG, Bills, LC, and FC scrapers are independent additions.

About

Static mirror of Indian Parliamentary Committee reports from sansad.in. Daily-updated JSON + extracted text on GH Pages. Powers SansadLocal.

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors