I can't imagine they are. What would the training data of those models be? Why would you train the model when the user sent a request? Why would you wait responding to the request until the model is trained?
Sure, but that's not done with the kind of model this thread is about (separate training and inference). You're talking about classical ML models with continuous updates, which you wouldn't run on this kind of GPU infrastructure.
Sucks to wait for the sun to come out to make Bing answer though. "Disclaimer: Answer dependent on cloud cover or night time".
Do you seriously think that Bing trains an AI model when you send a request? Why would they do that?
Oh, they're working on it. It's dumb, but it's happening.
I can't imagine they are. What would the training data of those models be? Why would you train the model when the user sent a request? Why would you wait responding to the request until the model is trained?
Often, these models are a feedback loop. The input from one search query is itself training data that affects the result of the next query.
Sure, but that's not done with the kind of model this thread is about (separate training and inference). You're talking about classical ML models with continuous updates, which you wouldn't run on this kind of GPU infrastructure.
You'd get used to it awfully fast though.