zlacker
[parent]
[thread]
1 comments
1. attemp+(OP)
[view]
[source]
2025-06-03 20:20:04
I checked the BlEU-Score and Perplexity of popular models and both have stagnated around 2021. As a disclaimer this was a cursory check and I didn't dive into the details of how individuals scores were evaluated.
replies(1):
>>mounta+ra2
◧
2. mounta+ra2
[view]
[source]
2025-06-04 16:37:03
>>attemp+(OP)
on what benchmarks? pretty much every major one is linear improvement
[go to top]