zlacker

What's up with all those equals signs anyway?

submitted by todsac+(OP) on 2026-02-03 09:37:40 | 683 points 189 comments
[view article] [source] [go to bottom]

NOTE: showing posts with links only show all posts
2. jojomo+T2[view] [source] 2026-02-03 10:00:56
>>todsac+(OP)
https://web.archive.org/web/20260203094902/https://lars.inge...

Did the site get the HN kiss of death?

◧◩
3. AlphaA+r3[view] [source] [discussion] 2026-02-03 10:05:23
>>lordna+T1
Same here. I did notice what I think was an actual error on someone's part, there was a chart in the files comparing black to white IQ distributions, and well, just look at it:

https://nitter.net/AFpost/status/2017415163763429779?s=201

Something clearly went wrong in the process.

◧◩
8. OJFord+h7[view] [source] [discussion] 2026-02-03 10:36:38
>>quibon+n5
I haven't seen them other than in the submission - but if the length matches up it may be that they were processed from raw email, the RFC defines a length to wrap at.

Edit: yes I think that's most likely what it is (and it's SHOULD 78ch; MUST 998ch) - I was forgetting that it also specifies the CRLF usage, it's not (necessarily) related to Windows at all here as described in TFA.

Here it is in my 'notmuch-more' email lib: https://github.com/OJFord/amail/blob/8904c91de6dfb5cba2b279f...

◧◩◪
14. debugn+E8[view] [source] [discussion] 2026-02-03 10:50:06
>>netsha+u5
Unicode labels U+000A as all of "LINE FEED (LF)", "new line (NL)" and "end of line (EOL)". I'm guessing different names were imported from slightly different character sets, although I understand the all-uppercase name to be the main/official one.

https://www.unicode.org/charts/PDF/U0000.pdf

◧◩
26. RHSeeg+Fc[view] [source] [discussion] 2026-02-03 11:23:10
>>ccppur+6c
It's not the character, its the way / context in which it's used

https://en.wikipedia.org/wiki/Metal_umlaut

54. maarti+an[view] [source] 2026-02-03 12:29:31
>>todsac+(OP)
Fun how the archive.today article near the top has this exact issue

https://pastes.io/correspond

>>46843805

◧◩
56. lvncel+Ko[view] [source] [discussion] 2026-02-03 12:40:16
>>ruhith+Vh
> It's the same class of bug as manually parsing HTML with regex, it works right up until it doesn't

I'm sure you already know this one, but for anyone else reading this I can share my favourite StackOverflow answer of all time: https://stackoverflow.com/a/1732454

◧◩◪◨
63. topaz0+Jt[view] [source] [discussion] 2026-02-03 13:17:51
>>db_adm+q8
https://en.wikipedia.org/wiki/Metal_umlaut

The writer presumably knows that umlauts and other non-ascii characters are functional in many languages. "rock döts" is poking fun at the trend in a certain tranche of anglophone rock/metal to use them in a purely aesthetic way in band names etc.

◧◩◪◨
72. MrGilb+lw[view] [source] [discussion] 2026-02-03 13:33:03
>>josefx+br
For anyone wondering about the railroad switch post: https://stackoverflow.com/questions/11227809/why-is-processi...
◧◩◪
74. grimgr+yx[view] [source] [discussion] 2026-02-03 13:40:41
>>SCdF+9m
you presume cor=rect

https://www.jmail.world/thread/EFTA02512824?view=person

76. JKCalh+gy[view] [source] 2026-02-03 13:44:32
>>todsac+(OP)
(The title of the blog reminded me the late Bob Pease [1] who had the signature, "What's all this XXX stuff, anyhow?" [2] where XXX might be "noise gain", "capacitor leakage"…)

[1] https://en.wikipedia.org/wiki/Bob_Pease

[2] https://www.qsl.net/n9zia/pease/index.html

◧◩
90. jcynix+8G[view] [source] [discussion] 2026-02-03 14:26:38
>>tibors+Qa
"BITNET was a co-operative university computer network in the United States founded in 1981 by Ira Fuchs at the City University of New York (CUNY) and Greydon Freeman at Yale University."

https://en.wikipedia.org/wiki/BITNET

BITNET connected mainframes, had gateways to the Unix world and was still active in the 90s. And limited line lengths … some may remember SYSIN DD DATA … oh my goodness …

https://www.ibm.com/docs/en/zos/2.1.0?topic=execution-systsi...

◧◩◪
91. jcynix+DG[view] [source] [discussion] 2026-02-03 14:29:51
>>Pinus+Uc
EBCDIC wasn't the problem, this was (part of) the problem:

https://www.ibm.com/docs/en/zos/2.1.0?topic=execution-systsi...

And BITNET …

◧◩
98. rsynno+XS[view] [source] [discussion] 2026-02-03 15:27:31
>>noduer+Tc
But... pigs _do_ have cloven hooves! The issue is that they're not ruminant.

That said, there is a _possibly_ kosher pig: https://en.wikipedia.org/wiki/Babirusa#Relationship_with_hum...

106. kstrau+341[view] [source] 2026-02-03 16:11:30
>>todsac+(OP)
For context, this is the Lars Ingebrigtsen who wrote the manual for Gnus[0], a common Emacs package for reading email and Usenet. It’s clever, funny, and wildly informative. Lars has probably forgotten more about email parsing than 99% of us here will ever have learned.

The manual itself says[1]:

> Often when I read the manual, I think that we should take a collection up to have Lars psycho-analysed.

0: https://www.gnu.org/software/emacs/manual/html_mono/gnus.htm...

1: https://www.gnus.org/manual.html

◧◩◪◨⬒⬓⬔
148. kstrau+0K1[view] [source] [discussion] 2026-02-03 18:52:56
>>saila+sz1
Ugh, I didn't realize it was still that bad.

How far can you get with setting core.autocrlf on your machine? See https://git-scm.com/book/en/v2/Customizing-Git-Git-Configura...

◧◩◪◨⬒⬓⬔⧯
156. floren+1X1[view] [source] [discussion] 2026-02-03 19:48:48
>>kstrau+lJ1
I've seen this explanation multiple times through the years, but as I said it's entirely possible it was just a post-hoc thing somebody came up with. But as you said, it's fun to argue/think about, so here's some more. I'm talking about the ASR-33 because they're the archetypal printing terminal in my mind.

If you look at the schematics for an ASR-33, there's just 2 transistors in the whole thing (https://drive.google.com/file/d/1acB3nhXU1Bb7YhQZcCb5jBA8cer...). Even the serial decoding is done electromechanically (per https://www.pdp8online.com/asr33/asr33.shtml), and the only "flow control" was that if you sent XON, the teletype would start the paper tape reader -- there was no way, as far as I can tell, for the teletype to ask the sender to pause while it processes a CR.

These things ran at 110 baud. If you can't do flow control, your only option if CR takes more than 1/10th of a second is to buffer... but if you can't do flow control, and the computer continues to send you stuff at 110 baud, you can't get that buffer emptied until the computer stops sending, so each subsequent CR will fill your buffer just a little bit more until you're screwed. You need the character following CR (which presumably takes about 2/10ths of a second) to be a non-printing character... so splitting out LF as its own thing gives you that and allows for the occasional case where doing a linefeed without a carriage return is desirable.

Curious Marc (https://www.curiousmarc.com/mechanical/teletype-asr-33) built a current loop adapter for his ASR-33, and you'll note that one of the features is "Pin #32: Send extra NUL character after CR (helps to not loose first char of new line)" -- so I'd guess that on his old and probably worn-out machine, even sending LR after CR doesn't buy enough time and the next character sometimes gets "lost" unless you send a filler NUL.

Now, I haven't really used serial communications in anger for over a decade, and I've never used a printing terminal, so somebody with actual experience is welcome to come in and tell me I'm wrong.

◧◩◪◨
170. jd3+Gr2[view] [source] [discussion] 2026-02-03 22:21:11
>>rurban+p72
I discovered X-Face[0] through gmane! Such a blast from the past.

[0]: https://en.wikipedia.org/wiki/X-Face

◧◩◪
172. btown+uD2[view] [source] [discussion] 2026-02-03 23:26:16
>>jandre+X02
Email is one of the very few success cases of the xkcd Standards meme: https://xkcd.com/927/ - and it's due to practicality and ingenuity on the part of people who made very creative parsers and placed real-world understanding behind every word of the early RFCs.

Without a unified email standard, the world would look incredibly different today, especially as it bootstrapped open communication between different countries and institutions in developing every protocol since.

◧◩◪◨
179. mxfh+5d3[view] [source] [discussion] 2026-02-04 03:29:10
>>grimgr+yx
Yet somehow there is always a version of the same thread that's not mangled https://www.jmail.world/thread/EFTA02512795?view=inbox

Can only assume DOJ overpaid the law firms like 5x by not specifying deliveries need to be deduplicated first.

◧◩◪◨⬒⬓⬔
184. Democr+H84[view] [source] [discussion] 2026-02-04 11:45:56
>>kstrau+QT1
I don't think your comment assumes the right givens. I just tried in Vivaldi (i.e. Chrome) and this snippet:

    <!doctype html>
    A<!—- Don't count <hr> this! -—> but do count <hr> that -->Z
gets fixed and rendered as

    <!DOCTYPE html>
    <html><head></head><body>A<!--—- Don't count <hr--> this! -—&gt; but do count <hr> that --&gt;Z</body></html>
Another surprise is that

    <!doctype html>
    A<!—- Don't count this! -— but do count that -->Z
gets rewritten to

    <!DOCTYPE html>
    <html><head></head><body>A<!--—- Don't count this! -— but do count that ---->Z</body></html>
Note the insertion of extra `--` minus-hyphens.

This is what MDN (https://developer.mozilla.org/en-US/docs/Web/HTML/Guides/Com...) has to say:

Comments start with the string `<!--` and end with the string `-->`, generally with text in between. This text cannot start with the string `>` or `->`, cannot contain the strings `-->` or `--!>`, nor end with the string `<!-`, though `<!` is allowed. [...] The above is true for XML comments as well. In addition, in XML, such as in SVG or MathML markup, a comment cannot contain the character sequence `--`.

Meaning that you can recognize HTML comments with (one branch of) a RegEx—you start wherever you see `<!--` and consume everything up to one of the listed alternatives. No nesting required.

Be it said that I find the precise rules too convoluted for what they do. Especially XML's prohibition on `--` in comments is ridiculous taken on its own. First you tell me that a comment ends with three characters `-->`, and then you tell me I can't use the specific substring `--`, either? And why can't I use `--!>`?

An interesting bit here is that AFAIK the `<!` syntax was used in SGML as one of the alternatives to write a 'lone tag', so instead of `<hr></hr>` or `<hr/>` (XHTML) or `<hr>` (HTML) you could write `<!hr>` to denote a tag with no content. We should have kept this IMO.

*EDIT* On the quoted HTML source you see things like `-—` (hyphen-minus, em-dash). This is how the Vivaldi DevTools render it; my text editor and HN comment system did not alter these characters. I have no idea whether Chrome's rendering engine internally uses these em-dashes or whether it's just a quirk in DevTool text output.

[go to top]