{
"$type": "site.standard.document",
"canonicalUrl": "https://frankhecker.com/2015/03/01/useful-datasets-for-howard-county-election-analysis/",
"path": "/2015/03/01/useful-datasets-for-howard-county-election-analysis/",
"publishedAt": "2015-03-01T12:00:17.000Z",
"site": "at://did:plc:77mn3ult3b72tpvtqqva6tat/site.standard.publication/3mpfmfpu4u72n",
"tags": [
"datascience",
"hocodata",
"howardcounty",
"politics"
],
"textContent": "_tl;dr: I release two useful Howard County election datasets in preparation for future posts._\n\nIn the coming days and weeks I’ll be posting some analyses of Howard County election results. Unfortunately the data released by the [Howard County Board of Elections][Howard] and the [Maryland State Board of Elections][Marylan] is not always in the most useful form for analysis. In particular I was looking for per-precinct turnout statistics for the 2014 general election in Howard County, along with some way to match up precincts with the county council district of which they’re a part. That data is available in the [2014 general election results per precinct/district][2014 ge] published by the Howard County Board of Elections, but unfortunately that document is a PDF document.\n\nPDF files are great for reading by humans, but lousy for reading by machines. They violate guideline 8 in the [Open Data Policy Guidelines][Open Da] published by the [Sunlight Foundation][Sunligh]:\n\n> For maximal access, data must be released in formats that lend themselves to easy and efficient reuse via technology. … This means releasing information in open formats (or “open standards”), in machine-readable formats, that are structured (or machine-processable) appropriately. … While formats such as HTML and PDF are easily opened for most computer users, these formats are difficult to convert the information to new uses.\n\nSince the data I wanted wasn’t in a format I could use, I manually extracted the data from the PDF document and converted it into a useful format (Comma Separated Value or CSV format) myself. Then since someone else might find a use for them, I published the files online in a [datasets area][dataset] of my [Github hocodata repository][Github]. The first two files are as follows:\n\n- [hocomd-2014-precinct-council.csv][hocomd]. This dataset maps the 118 Howard County election precincts to the county council districts in which those precincts are included.\n- [hocomd-2014-general-election-turnout.csv][hocomd02]. This dataset contains turnout statistics for each of the 118 Howard County precincts in the 2014 general election, including the number of registered voters and ballots cast in each precinct on election day.\n\nStay tuned for some interesting ways to use this data.\n\n[Howard]: http://www.howardcountymd.gov/Departments.aspx?id=4294968268\n[Marylan]: http://www.elections.state.md.us\n[2014 ge]: http://www.howardcountymd.gov/WorkArea/linkit.aspx?LinkIdentifier=id&ItemID=6442477038&libID=6442477030\n[Open Da]: http://sunlightfoundation.com/opendataguidelines/\n[Sunligh]: http://sunlightfoundation.com/about/\n[dataset]: https://github.com/frankhecker/hocodata/tree/master/datasets\n[Github]: https://github.com/frankhecker/hocodata\n[hocomd]: https://raw.githubusercontent.com/frankhecker/hocodata/master/datasets/hocomd-2014-precinct-council.csv\n[hocomd02]: https://raw.githubusercontent.com/frankhecker/hocodata/master/datasets/hocomd-2014-general-election-turnout-by-precinct.csv\n\n *\n\nWalter Carson (wcarson@columbiaunion.net) - 2015-03-01 14:38{#268ded72-001}\n\nThank you. As always, of interest. How might such data be used to look at the state legislative districts, if at all? Best wishes. WEC Sent from my iPhone\n\nhecker - 2015-03-01 19:50{#268ded72-002}\n\nSee my future posts for some ideas on how this data might be used. Probably the first thing I'll do is look at different county council districts to see if there seems to be any real difference in 2014 general election turnout between the districts. A similar analysis could be done for legislative districts, or at least those portions of the districts within Howard County. (A more complete analysis would need data from Carroll County, Baltimore County, etc.)",
"title": "Useful datasets for Howard County election analysis"
}