Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I disagree. I'd include overfitting for LLMs as creating unreasonably strong connections to individual sequences used for training, whereas a good mix of that and connections between chunks of those sequences are required.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: