We understand exactly how it works. It just works in such a way that we cannot p...

spion · on March 27, 2023

We really don't know why LLMs have the most of their emergent capabilities. The entire GPT-3 paper was about being surprised about it.

throwaway290 · on March 28, 2023

Maybe we don't disagree then.

We know how LLM work. Parameters. Training data. Random number generator. That stuff.

We don't know why it outputs what it outputs because rngs are notoriously unpredictable and we know it. So we are surprised but that in itself is unsurprising.