You must log in or register to comment.
slash inference costs by doing a bunch of useless inferences in the hope that the one the user actually wanted happened to be one of them.
It cannot be more efficient than just waiting for the input and inferring once based on that.
i mean…
your brain essentially does this it’s just that compute and memory are one system and it is as physically optimized as possible in brain systems.
this strategy is less stupid than it sounds if you abandon von neumann purism imo
it can be more efficient than just waiting for the input and inferring once based on that… you are an example of this in real life.
faster != more efficient. And you cannot compare brains to computers. Speculative execution improves speed in the cpu, at the cost of efficiency