Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The simplest and quickest benchmark is to do a rap battle between GPT-4 and the local models. Copy paste the responses between them to enable the cross-model battle.

It is instantly clear how strong the model is relative to GPT-4.



Have you tried it? How did it do?


Someone here did Bard v GPT-4 a few days ago, and GPT-4 mopped the floor with Bard: https://news.ycombinator.com/item?id=35252612


You're talking to the model right now.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: