From Objects to Data

How we mangle the N.Y. Times datasets…
…to say something meaningful.

About this project


All humanities disciplines are confronted with more and more digital material. This digital material can be studied as individual objects or descriptions of events, but from a digital perspective they can also be considered as data points. Digital methods allow researchers to study relations between objects form a different perspective and on a larger scale.


During this course we look at the digital data sources from the N.Y. Times and at the same time work with, and look at, tools that are relevant to the humanities from a programmatic perspective. Think of AWK, sed and the CLI and how they can be used to parse data, write small programmes and help with the analyses of the available datasets.

Our Goal

Mangle the N.Y. Times datasets and say something meaningful about our findings.

Research Question

The research question that will be the focus of our mangling sessions: What could be the most likely explanation(s) for the fact that the humanities stopped being featured within front-page articles from the 1990’s and onwards?


  • S. Bergmans
  • J. Lam
  • W. Nanninga
  • C. Sleutel
  • S. Wisselink

Our Findings

You can read our findings by clicking
this link

Download the source code

You can download the source code from GitHub if you want to have a look.

Download Source

Contact Us

Feel free to e-mail us to provide some feedback on our work, give us suggestions to improve our code, wish us luck or to just say hello!