Nvidia Unveils New AI Chip Set to Cut Cost of Running LLMs

Nvidia Shares Details About the New AI GH200 Chip

According to the company, the new chip has a similar GPU to its current most expensive AI chip, the H100. But there is an improvement in the area of memory and processor. The GH200 will come with 141 gigabytes of memory as opposed to the H100’s 80GB of memory. It also has a 72-core ARM central processor, says Nvidia.

Speaking about the new chip during a Tuesday conference, Nvidia CEO Jensen Huang noted that “the processor is designed for the scale-out of the world’s data centres.” So, the company set out to give the processor a boost right from the onset.

Meanwhile, AI models often work in two parts; the training and inference parts. An AI model must first be trained extensively for months, even at the expense of thousands of GPUs. It is then used in software to generate content in the inference stage.

However, inference is usually expensive and requires a lot of processing power. This cost of running the software is what Nvidia has targeted to reduce with the new GH200, Huang disclosed. And, considering that it has more memory capacity, it is only safe to conclude that the new chip has been designed for inference. Huang notes:

“You can take pretty much any large language model (LLM) you want and put it in this and it will inference like crazy.”

The CEO expects to see the inference cost of LLMs drop significantly, adding that the GH200 will be available from Nvidia’s distributors in the second quarter of 2024. However, Huang also projects that the new chip may already be available for sampling by the end of this year.

As of publication, Nvidia is yet to give an official price for the new chip.

Commodities & Futures, Market News, News, Stocks Nvidia Stock Now Toppling Gold as Viable Inflationary Hedge

By Godfrey Benjamin May 13th, 2024

NVIDIA’s robust financial outlook further fueled optimism, with the company forecasting first-quarter revenue of $24 billion, surpassing analysts’ estimates of $22.2 billion.

Artificial Intelligence, News, Technology News OpenAI Faces Privacy Issues in Austria Due to Possible EU Law Violation

By Mayowa Adebajo April 29th, 2024

NOYB’s latest complaint may just be in line with its commitment to ensuring that these firms align with the European General Data Protection Regulation laws.

Market News, News, Stocks Analysts Expect 50% Rally in Bitcoin Mining Stock Northern Data

By Bhushan Akolkar April 29th, 2024

Northern Data’s navigation into cloud solutions and data center infrastructure makes it an attractive bet with a potential 53.2% upside possibility.