Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I don't know, but at least older Qwen models were a bit confused as to what words belong to which languages, and recent ones seem noticeably less sure about ja-JP in general. Maybe it vaguely relates Hanzi/Kanji character being more coarse grained than Latin alphabets so that there aren't enough character counts to tell apart or something.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: