Found a Github repository that curates links to “awesome public datasets”:
https://github.com/caesar0301/awesome-public-datasets
Judging from the fact that the repo has > 6000 stars, I might be a little late to the party …
Tons of potential here but just to spitball a few ideas:
- Shiny app with choropleth map tracking spread of Ebola
- Classification algorithm for “age-appropriateness” of a piece of text based on modeled blog data
- Visualization of proportion of work on view (versus not on view) at the Minneapolis Institute of Art by artist nationality