Oh yeah this whole thing is very very much convenient to Nvidia and data center companies lmao.
I still think they're incentivized to make the models more efficient as they could then squeeze out even more profit, it's just that it's a property of the technology itself that it doesn't really work well until you have bajillions of parameters.
Are these percentages referring to total biomass or population count?