zlacker

[return to "Which programming languages are most token-efficient?"]
1. janals+Xj[view] [source] 2026-01-12 04:03:50
>>tehnub+(OP)
This is kind of just a measurement of how representative a language is in the distribution of the tokenizer training. You could have a single token equal to “public static void main”.
◧◩
2. make3+rq[view] [source] 2026-01-12 05:17:31
>>janals+Xj
If you look at the list, you'll see that you're incorrect, as C and JavaScript are not at the top.

Seeing all the C languages and JavaScript at the bottom like this makes me wonder if it's not just that Curly brackets take a lot of tokens.

[go to top]