Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes, but bigger models are still more capable. Models shrinking (iso-performance) just means that people will train and use more capable models with a longer context.


Of course they are! Both are important and will be around and used for different reasons




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: