linky (known as griftgate lmao)
text of tweet
The Stargate Project is a new company which intends to invest $500 billion over the next four years building new AI infrastructure for OpenAI in the United States. We will begin deploying $100 billion immediately. This infrastructure will secure American leadership in AI, create hundreds of thousands of American jobs, and generate massive economic benefit for the entire world. This project will not only support the re-industrialization of the United States but also provide a strategic capability to protect the national security of America and its allies.
The initial equity funders in Stargate are SoftBank, OpenAI, Oracle, and MGX. SoftBank and OpenAI are the lead partners for Stargate, with SoftBank having financial responsibility and OpenAI having operational responsibility. Masayoshi Son will be the chairman.
Arm, Microsoft, NVIDIA, Oracle, and OpenAI are the key initial technology partners. The buildout is currently underway, starting in Texas, and we are evaluating potential sites across the country for more campuses as we finalize definitive agreements.
As part of Stargate, Oracle, NVIDIA, and OpenAI will closely collaborate to build and operate this computing system. This builds on a deep collaboration between OpenAI and NVIDIA going back to 2016 and a newer partnership between OpenAI and Oracle.
This also builds on the existing OpenAI partnership with Microsoft. OpenAI will continue to increase its consumption of Azure as OpenAI continues its work with Microsoft with this additional compute to train leading models and deliver great products and services.
All of us look forward to continuing to build and develop AI—and in particular AGI—for the benefit of all of humanity. We believe that this new step is critical on the path, and will enable creative people to figure out how to use AI to elevate humanity.
At current prices of money, this project has to return 20 billion in profit, and likely operate on 50 billion revenue at least to cover the energy costs
Lmao literally the paper in the link you shared says (emphasis mine) “DeepSeek- R1 achieves performance comparable to OpenAI-o1-1217 on reasoning tasks” next to a graph that shows that the benchmark performance is roughly the same as OpenAI – marginally better on some benchmarks and marginally worse on others.
It is the same stuff as “Google’s botched AI” but now partially open-sourced and slightly more efficient, the latter being the reason why it “upended the entire industry”.
Yes, comparable performance to a multi billion dollar company for a fraction of the cost for training and inference. It cost $5 million to train their prior model. LLama 3 took an estimated 750 million to 1 billion to train and it’s not as good.
These reductions in costs must be exciting to investors (you know, the 60 year old dipshits from the original comment you replied to) but they do not substantiate your claim that “this tech is legit and currently improving at an alarming rate” or any other beautiful epithet you used. LLMs remain hallucinating state machines with a probability attached to any word they spit out, and marginal efficiency improvements don’t make them any more exciting to the end user.
Cost means energy use in addition to hardware, so it should matter to climate concerned folks, not just investors.
Also these reasoning models are way less likely to hallucinate given time to think (test time compute). Think applications where it matters like augmenting or replacing existing labor, which will start to happen more.
The thing is, the ‘efficiency’ of AI isn’t the problem. See Jevons Paradox. Much like our history with fossil fuels, as energy efficiency goes up, total energy consumed also goes up (because you get more for your buck now).
So the more efficient AI becomes, the more energy we will waste on pointless shit. I actually agree that anti-LLM sentiment is overly high on the left, but that argument has to be made that it has very useful applications for very specific circumstances, not that “it’s getting better and that will solve all the problems”. Because it getting better is actually also a problem in many ways.
I’m still convinced that these models will never be more useful than they are now. I sometimes use them to help with coding but honestly I could easily live without them, none of them can handle complex tasks with good accuracy.
Thinking they will somehow significantly improve on this is techbro shit. I’m certain there’s more to cognition and intelligence than stochastic parroting.