Cost reduction strategies for AI model deployment
Press Space for next Tweet
.@AnkythShukla on the 25x cost trap killing AI products in production: "Most of the PMs want to do is they think that if in the prototype I have used maybe the most advanced model, I should use the same model in the production as well. And then because of cost considerations, the management has to go ahead and pull the plug." "But now there is a good possibility, which is that maybe another model, which is a cheaper model, can go ahead and produce a similar kind of output." "You can see that the best model GPT 5.1, which is mostly going to be used in your prototype because it's very intelligent. The output is $10 per million token. But for maybe a model such as GPT nano, the output is maybe 0.4, right, which is only 40 cents." "You will only get the confidence of using this when you have created the right kind of evaluations." From the @aakashgupta podcast.
Topics
Read the stories that matter.The stories and ideas that actually matter.
Save hours a day in 5 minutesTurn hours of scrolling into a five minute read.