zlacker
[parent]
[thread]
2 comments
1. mounta+(OP)
[view]
[source]
2025-06-03 17:04:48
check the benchmarks or make one of your own
replies(1):
>>attemp+Ax
◧
2. attemp+Ax
[view]
[source]
2025-06-03 20:20:04
>>mounta+(OP)
I checked the BlEU-Score and Perplexity of popular models and both have stagnated around 2021. As a disclaimer this was a cursory check and I didn't dive into the details of how individuals scores were evaluated.
replies(1):
>>mounta+1I2
◧◩
3. mounta+1I2
[view]
[source]
[discussion]
2025-06-04 16:37:03
>>attemp+Ax
on what benchmarks? pretty much every major one is linear improvement
[go to top]