Department of Education Civil Rights Data Collection

Some potential uses of this dataset:

  • Explore equity between different groups of students

  • Show distribution of Advance Placement courses across schools

Content

crdc.zip (853 MB) is a 1.3 GB (unzipped) dataset from http://ocrdata.ed.gov/

crdc-csv.zip (184 MB) is a 1.6 GB (unzipped) version of the same dataset in which every Excel workbook has been extracted into its constituent CSV files. Note that:

  • The name of each resultant CSV is in the form WORKBOOK-TAB.csv

  • For example, if the original workbook is named 01-LEA Form.xlsx with two tabs named "Definitions" and "Suppressed Data", then the resultant CSVs are named 01-LEA Form-Definitions.csv and 01-LEA Form-Suppressed Data.csv

  • The transformation from Excel workbook into its constituent CSVs is performed by a small Python script named xlsx_sheets.py found at https://github.com/boscomonkey/crdc-utils

crdc-csv-dc.zip (185 MB) has DC-only CSV files in addition to the ones in crdc-csv.zip. Their names all have the form *-DC.csv.

History

The dataset was downloaded by Chris from Dept of Ed onto his laptop.

Bosco copied it for the Teacher Data project - https://hackpad.com/Education-Project-Tinkering-List-vntRvjoQ9j6 - for the Nov 12, 2014 HackNight at CodeForDC meetup; then uploaded to Internet Archive - https://archive.org/details/DoEd_CRDC - so folks don't have to bug Chris.

To Do

  • Extract a subset that's just Washington DC data

Data and Resources

Additional Info

Field Value
Last Updated October 4, 2017, 14:37 (UTC)
Created September 3, 2016, 13:38 (UTC)