zlacker

[parent] [thread] 7 comments
1. Scound+(OP)[view] [source] 2022-02-18 15:18:03
Lynx is the best reader for the economist.
replies(4): >>rahimn+z2 >>titano+Je >>networ+vp >>bduers+PS
2. rahimn+z2[view] [source] 2022-02-18 15:28:36
>>Scound+(OP)
Back in 2016, The Economist used to block access from lynx. You'd get an error like this (unless you spoofed the user agent to be something other than lynx):

Error 403 You are banned from this site. Please contact via a different client configuration if you believe that this is a mistake.

You are banned from this site. Please contact via a different client configuration if you believe that this is a mistake.

  Guru Meditation:

   XID: 84740260
     __________________________________________________________________

   Varnish cache server
replies(1): >>Scound+G5
◧◩
3. Scound+G5[view] [source] [discussion] 2022-02-18 15:41:10
>>rahimn+z2
Tbh, I was probably running some clone like bobcat.
4. titano+Je[view] [source] 2022-02-18 16:22:58
>>Scound+(OP)
I don't love reading long articles in fixed-width fonts.
replies(1): >>dredmo+bL
5. networ+vp[view] [source] 2022-02-18 17:14:04
>>Scound+(OP)
W3M is fine too.
◧◩
6. dredmo+bL[view] [source] [discussion] 2022-02-18 19:00:36
>>titano+Je
Then pipeline to a PS/PDF generator.

For most modern Web publishing, this is mostly a matter of finding and extracting the <article> block, as well as metadata (title, byline, dateline).

html-xml-tools is quite useful for this.

I'd created a WaPo extractor that reduced pagesize by about 95%, stripped the nags and paywalls, etc. Endpoint was HTML, but that could just as easily have generated PDF or ePub if I'd wanted.

replies(1): >>titano+VMj
7. bduers+PS[view] [source] 2022-02-18 19:42:56
>>Scound+(OP)
Outline works well too:

https://outline.com/jtdYRj

◧◩◪
8. titano+VMj[view] [source] [discussion] 2022-02-25 01:57:28
>>dredmo+bL
I applaud people who take advantage of the fact that the internet is still largely machine-readable and hackable.

I am much lazier, but I use "reader mode" to similar effect.

[go to top]