zlacker

[return to "Which programming languages are most token-efficient?"]
1. janals+Xj[view] [source] 2026-01-12 04:03:50
>>tehnub+(OP)
This is kind of just a measurement of how representative a language is in the distribution of the tokenizer training. You could have a single token equal to “public static void main”.
◧◩
2. make3+rq[view] [source] 2026-01-12 05:17:31
>>janals+Xj
If you look at the list, you'll see that you're incorrect, as C and JavaScript are not at the top.

Seeing all the C languages and JavaScript at the bottom like this makes me wonder if it's not just that Curly brackets take a lot of tokens.

◧◩◪
3. xigoi+7A[view] [source] 2026-01-12 06:49:57
>>make3+rq
I imagine that having to write

  for (int index = 0; index < size; ++index)
instead of

  for index in 0...size
eats up a lot of tokens, especially in C where you also need this construct for iterating over arrays.
[go to top]