Here’s our notes from our most recent meeting/hack night on March 15, 2017.
What we’re doing/Did:
We’re processing the Campaign Finance Contributions of Cuyhoaga County
into open, machine-readable format.
Contributions to politicians in Cuyahoga County are available at http://boe.cuyahogacounty.us/en-US/campaign-finance-reports.aspx
However, they are not machine-readable
and greatly impedes any analysis and research.
Open Cleveland its current status in a machine-readable format.
At our meeting on the 15th, we:
OCR’ed the text from the PDFs using Acrobat,
then used Tabula (http://tabula.technology/) to extract the text from the PDFs and transform it into CSV files,
then uploaded the CSVs into google sheets which is flexible (and machine-readable), easily viewable, and
lastly, reviewed each year’s data with the original PDFs, correcting any spelling mistakes or formatting errors that were introduced in the ETL process.
With the data in machine-readable format, You can begin to analyze and research the campaign contributions. Questions that can be answered with the data: How many people who live outside of Cleveland gave money to a particular politician ? Who are the biggest campaign contributors? How many PACs have given him money? Where do most of his donations come from? the west side? the south side? east side?
Frank Jackson’s 2013-2016 Campaign Finance Contributions in Machine Readable format:
Will added the 2013 documents; this does not include 2013_F_Jackson_SemiAnnual.pdf
Ron reviewed the 2014 entries and are available at
Kevin and Rob reviewed the 2015 entries ;
2016 needs to be reviewed
(Want to help? Download the 2016 campaign finance reports for jackson, F at http://boe.cuyahogacounty.us/en-US/campaign-finance-reports.aspx
then compare each in the spreadsheet to the
(need more direction or help? email us at email@example.com)
Need to geocode (geocoding is the processing of taking addresses into longitude and latitude) which is needed for any geospatial analysis and to display locations on web maps? try https://geoservices.tamu.edu/
(requiring free signup)
Other news, data sources:
Ron J has access to data from the 29 rain stations that record how much rain falls (not sure if they also collect the winter precipitation) that’s collected by the NEO Sewer District
suggested we ask the NEO Sewer District to request the data.
http://www.meetup.com/open-cleveland - Meetup group where our meetings are scheduled.
https://opencleveland.slack.com - Slack slack where our discussions happen in between meetings (need an invite? email opencleveland at gmail dot com)
https://github.com/orgs/opencleveland - code repository
© Open Cleveland 18 | Pull requests welcome!