Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
qcnguy
2 days ago
|
parent
|
context
|
favorite
| on:
'Attention is all you need' coauthor says he's 'si...
OpenAI have talked about it. The neural architecture needs to let the model handle the case where there's nothing worth attending to, as softmax requires attention to be allocated to all tokens but sometimes there's nothing worth it.
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: