Below are instructions on how you can generate your own plots.
Make sure to get the latest versions of the required software.
- git.
-
Obtain a copy of the following files from this repository:
clean_raw_csvs.pyenroll_data_cleaner.pyfix_inconsistent_csv.pylist_all_files.ps1plot.pyrun.ps1
-
Download, or clone, one of the data repositories. You can find a link to each data repository in the main README file in this repository.
-
Next, put all the files mentioned in the first step, except the
run.ps1file, in the root directory of the data repository folder. Put therun.ps1file outside of the data repository folder.After completing this step, you should have a file structure that looks something like this:
Desktop ├─ run.ps1 └─ 2022Fall (Data Repository) ├─ clean_raw_csvs.py ├─ enroll_data_cleaner.py ├─ fix_inconsistent_csv.py ├─ list_all_files.ps1 ├─ plot.py | (remaining items are from the repository) ├─ raw ├─ overall ├─ section . . . └─ plotconfig.txt -
Install the relevant Python dependencies (
pandas,seaborn,matplotlib). Arequirements.txtfile is provided in this directory that you can use to help you install those dependencies. -
Make the appropriate modifications to the files.
- In
plot.py, you should modify the value forPROCESS_COUNTif you're planning on running this program on a significantly weaker system (e.g., instead of10, use a value like5). If you're runningplot.pyin an environment that doesn't support running multiple processes, you will need to make the appropriate modifications to the code. - Make adjustments to the
plotconfig.txtfile as needed.
- In
-
Finally, run
run.ps1to start the cleaning & processing & plotting process.
These instructions are not comprehensive, nor were they meant to be; thus, if you need any help, please create an issue here.