I’m surprised, but not surprised, by the Chinese efficiency with compute. They have found a way to optimize system level inference prompting with Nvidia GPU’s. Finding efficiency in prompting is much easier than finding efficiency in training. If the frontier labs can find the inference strategy that the Chinese are using, then their models can be unreachable.
I’m surprised, but not surprised, by the Chinese efficiency with compute. They have found a way to optimize system level inference prompting with Nvidia GPU’s. Finding efficiency in prompting is much easier than finding efficiency in training. If the frontier labs can find the inference strategy that the Chinese are using, then their models can be unreachable.