Finding signal on Twitter is more difficult than it used to be. We curate the best tweets on topics like AI, startups, and product development every weekday so you can focus on what matters.

Space Data Centers and GPU Failure Rates for Training

Adaption Labs CEO @sarahookr says data centers in space are "pretty bonkers", and explains what the current consensus gets wrong: "Most co-located hardware is pretty much for training. I think that's why you care [about Space], otherwise you can distribute. Inference compute, which is where everything's moving towards, you can spread that compute more easily. You can have multiple data centers [on Earth]." "So if you care about space, you probably only care about training compute. I think people underestimate the amount of failures that happen, and you don't want to get your training job interrupted." "The real issue is that GPUs still have failure rates. The 2% of GPUs that are just considered done every year, that's really your cost. It's how quickly you can replace those [in space] and what that looks like."

Video thumbnail
View
12
1
0
5

Topics

Read the stories that matter.

Save hours a day in 5 minutes