Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
jychang
15 days ago
|
parent
|
context
|
favorite
| on:
Claude Haiku 4.5
It’s not instantly fast though. Context is probably ~20gb of VRAM at max context size. That’s gonna take some time to get from SSD no matter what.
TtFT will get slower if you export kv cache to SSD.
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
TtFT will get slower if you export kv cache to SSD.