The 2nd order effect that not a lot of people talk about is price: the fact that...

samuelknight · 2025-09-29T13:24:28 1759152268

You are vastly underestimating the price decline. To cherrypick one article; in the first two years since GPT 3.5, inference price for the same amount of intelligence has decreased 10x per year according to a study by Andreessen Horowitz https://a16z.com/llmflation-llm-inference-cost/. So in a stark slowdown scenario, we could still see a 1000x decrease in the next 5 years.

Price deflation is not tied to Moore's right now because much of the performance gains are from model optimization, high bandwidth memory supply chains, and electrical capacity build out, not FLOP density.

awongh · 2025-09-29T13:31:46 1759152706

True! I just know that model optimization gains are much less guaranteed than say, FLOP density, even though model optimization has so far provided way more gains than hardware advancements.

Part of me is optimistic that when the AI bubble bursts the excess data center capacity is going to be another force driving the cost of inference down.

naasking · 2025-09-29T18:59:09 1759172349

> I just know that model optimization gains are much less guaranteed than say, FLOP density, even though model optimization has so far provided way more gains than hardware advancements.

Performance gained from model improvements has outpaced performance gained from hardware improvements for decades.

NemoNobody · 2025-09-29T16:56:04 1759164964

Haha, I love how delusional everyone is about AI.

Yeppers, when that bubble burst - that's hilarious. This is the kinda stuff grandkids won't believe someday.

throwaway314155 · 2025-09-29T20:12:08 1759176728

> has decreased 10x per year according to a study by Andreessen Horowitz

I believe you but that's not exactly an unbiased source of information.

Alex_1729 · 2025-09-30T08:08:00 1759219680

We are heading into the future of very low-cost AI inference. It's a good thing, and expected.