Why not a much shorter list with adjectives? I think the last time I looked at this, a few disjoint lists of 256 words, plus a few 16 word adjective lists gets down to ~1m accuracy. Dropping adjectives just reduces accuracy.
> big red plate, small fuzzy ball, wavy green brick
With three disjoint short lists, and two disjoint adjective lists you get permutation robustness.
One word from a list of 4096 words is 12 bits. Four such words would be 48 bits.
"A few disjoint lists" are the same thing as one big list if they are used identically in your generation process. There is no difference between "flip a coin, and pick from this 256-word list or that 256-word list according to the result" and "pick from this 512-word list".
If your system is to pick from a 16-word list of adjectives, then another 16-word list of adjectives, and then from one of three 256-word lists of nouns, you are generating codes of 17.6 bits, which is a bit of a downgrade from 48 bits.
This is a really interesting idea. The big question is whether needing to use so many more words is worth it. If your adjectives are simple and memorable enough, then maybe it is.
> big red plate, small fuzzy ball, wavy green brick
With three disjoint short lists, and two disjoint adjective lists you get permutation robustness.