They should design their own hardware, then. Somehow the other companies seem to be able to produce fast-enough models.
They made a deal with Cerebras for fast inference.
They should design their own hardware, then. Somehow the other companies seem to be able to produce fast-enough models.