zlacker

[parent] [thread] 2 comments
1. awiese+(OP)[view] [source] 2023-07-29 09:38:02
> I have a small personal project of cataloging all the movies that played on television in the 90s.

Any plans to publish this list? Would surely make a super interesting git repo for example...

replies(1): >>Leonar+iy
2. Leonar+iy[view] [source] 2023-07-29 14:24:00
>>awiese+(OP)
I've been working on it for about a year. At the moment it exists as a git repo (as you say), but for it to be of use you need at the very least the corresponding SqliteDB (~70MB) and to be really fun (and work with the frontend) you need the listing pages themselves (high res and ~50GB), neither of which is in the git repo, I keep them separate.

If you message me privately I'd be happy to share the data. The git repos are:

https://github.com/patsmad/nyt-listings https://github.com/patsmad/nyt-listings-app

I use them for curation at the moment so the READMEs leave ... something to be desired. I hope by the end of August to have a read-only version up and running, although without a wikipedia-like effort I don't see how I would curate it fully so it'll probably always be a little touch and go as to what data is available.

The stats I have from curating are: 369345 individual movie "listing boxes" (I would guess around 98% accuracy, although if I were to field a guess the actual number there should be is probably 400K) of which 321308 are matched to a movie, and 296941 of those are for sure unique. And overall 202203 have channel + time + duration matched up using the VCR listings (which the New York Times conveniently published from around November 20th 1990, and the internet archive very nicely has the program the VCRs used to encode/decode those codes). There are 21530 unique movies at the moment.

If I understand the New York Times correctly, then none of this can be commercialized since I scraped the core data (the pages themselves) from the TimesMachine, so this really is a personal project, which I'm happy to share. I've made a few Letterboxd lists from the corresponding data, for example a series of lists with all of the movies (and play times) for films playing on September 1 in particular e.g. https://letterboxd.com/patsmad/list/television-films-septemb... It is rather consistent, around 100 films a day, for 1990-1999 it was 106, 118, 74, 74, 89, 99, 98, 110, 97, 93. As is obvious I can talk about this for days.

I'm not sure the best way to do private messages, my email is associated with this account, but I have no idea if you can see that. I usually just lurk on HN.

replies(1): >>awiese+Wb1
◧◩
3. awiese+Wb1[view] [source] [discussion] 2023-07-29 18:29:42
>>Leonar+iy
Wow, that sounds awesome! Definitely so a "Show HN" when you feel it's right!
[go to top]