zlacker

[parent] [thread] 11 comments
1. ur-wha+(OP)[view] [source] 2022-02-18 13:26:48
https://archive.is/GKvGU
replies(2): >>flerch+S8 >>concin+we
2. flerch+S8[view] [source] 2022-02-18 14:20:33
>>ur-wha+(OP)
Disable javascript and the entire article loads on the economist.
replies(1): >>Scound+fm
3. concin+we[view] [source] 2022-02-18 14:44:22
>>ur-wha+(OP)
This extension is effective, and by default restricts its access only asks for permissions for supported news sites (instead of having permission to access "All site data" like too many browser extensions):

https://gitlab.com/magnolia1234/bypass-paywalls-chrome-clean

replies(1): >>Tijdre+IZ
◧◩
4. Scound+fm[view] [source] [discussion] 2022-02-18 15:18:03
>>flerch+S8
Lynx is the best reader for the economist.
replies(4): >>rahimn+Oo >>titano+YA >>networ+KL >>bduers+4f1
◧◩◪
5. rahimn+Oo[view] [source] [discussion] 2022-02-18 15:28:36
>>Scound+fm
Back in 2016, The Economist used to block access from lynx. You'd get an error like this (unless you spoofed the user agent to be something other than lynx):

Error 403 You are banned from this site. Please contact via a different client configuration if you believe that this is a mistake.

You are banned from this site. Please contact via a different client configuration if you believe that this is a mistake.

  Guru Meditation:

   XID: 84740260
     __________________________________________________________________

   Varnish cache server
replies(1): >>Scound+Vr
◧◩◪◨
6. Scound+Vr[view] [source] [discussion] 2022-02-18 15:41:10
>>rahimn+Oo
Tbh, I was probably running some clone like bobcat.
◧◩◪
7. titano+YA[view] [source] [discussion] 2022-02-18 16:22:58
>>Scound+fm
I don't love reading long articles in fixed-width fonts.
replies(1): >>dredmo+q71
◧◩◪
8. networ+KL[view] [source] [discussion] 2022-02-18 17:14:04
>>Scound+fm
W3M is fine too.
◧◩
9. Tijdre+IZ[view] [source] [discussion] 2022-02-18 18:24:39
>>concin+we
Here's the Firefox version: https://gitlab.com/magnolia1234/bypass-paywalls-firefox-clea...
◧◩◪◨
10. dredmo+q71[view] [source] [discussion] 2022-02-18 19:00:36
>>titano+YA
Then pipeline to a PS/PDF generator.

For most modern Web publishing, this is mostly a matter of finding and extracting the <article> block, as well as metadata (title, byline, dateline).

html-xml-tools is quite useful for this.

I'd created a WaPo extractor that reduced pagesize by about 95%, stripped the nags and paywalls, etc. Endpoint was HTML, but that could just as easily have generated PDF or ePub if I'd wanted.

replies(1): >>titano+a9k
◧◩◪
11. bduers+4f1[view] [source] [discussion] 2022-02-18 19:42:56
>>Scound+fm
Outline works well too:

https://outline.com/jtdYRj

◧◩◪◨⬒
12. titano+a9k[view] [source] [discussion] 2022-02-25 01:57:28
>>dredmo+q71
I applaud people who take advantage of the fact that the internet is still largely machine-readable and hackable.

I am much lazier, but I use "reader mode" to similar effect.

[go to top]