Living on the Edge: Generative AI and Large Language Models Will Not Affect Edge Computing Demand All That Much

Generative AI and large language models, as applied to many, if not most applications, will boost needs for advanced processors, electricity and cooling solutions.

But it might not affect demand for edge computing in any significant way. Nor will it have measurable effect on data center or computing architectures, as AI training and inference operations still make most sense at centralized, hyperscale locations.

Nvidia CEO Jensen Huang is quite positive about what routine use of large language models will mean for sales of advanced processors, citing an acceleration in demand for Nvidia processors. Some analysts believe generative AI could add as much as $6 billion in revenue for Nvidia within three years.

Demand driver	Nvidia products that will benefit	Degree of impact
Increased demand for compute power	Datacenter GPUs, AI accelerators	High
Increased demand for data storage	DGX systems, storage solutions	Medium
Increased demand for bandwidth	Networking solutions	Medium
New security risks	Security solutions	Low

Others should benefit as well. Some argue infrastructure to support generative AI could reach $50 billion by 2028, for example. Some note that a single LLM training operation can cost millions of dollars. Those costs are going to be borne by app providers of all sorts.

Data center CxOs might also tend to agree about changes generative AI processing will create.

We already know that generative AI requires lots of computing cycles, which we might quantify as floating point operations per second. The actual impact from a single request depends on the size of the dataset that is interrogated, the complexity of the question or task or the type of query.

Text responses, perhaps ironically, might require an order of magnitude more processing than generating an image. And generation of an image might require an order of magnitude more processing operations than generating a text-to-speech use case.

Model Type	FLOPS Requirements
Large Language Model (LLM)	100+ petaFLOPS
Image Generator	10+ petaFLOPS
Speech Generator	1+ petaFLOPS

So capital investment budgets will likely be altered to add more-more processing power than we presently tend to see. As a background issue, there will be some additional demand for higher connectivity between data centers, data centers and peering points and domains to domains, though it remains unclear how much incremental demand might be generated. It is a non-zero number, but it is hard to quantify at the moment.

Change	Investment response	Needs driving investment
Increased demand for compute power	Investment in more powerful hardware	Generative AI workloads are computationally intensive
Increased demand for data storage	Investment in more storage capacity	Large datasets that are required for AI training and inference operations
Increased demand for bandwidth	Investment in more network capacity	To support the high-speed data transfer that is required for AI applications
New security risks	Investment in security measures	To protect data centers from the growing threat of AI-powered cyberattacks

It also seems likely there could be architectural impact as well. Edge computing makes sense to support lower-latency use cases and applications. It is unclear so far whether the training of large data sets or end user operations will actually drive edge computing very much.

Inference operations might still need to be conducted at large hyperscale centers, not at the edge. For that reason, use of large language models might not actually cause data center architecture to shift to greater reliance on edge computing, for example.

Higher energy requirements will likely lead to newer approaches to cooling, though.

Architecture or design change	Possible actions by data center operators	Magnitude of capex impact
Increased demand for compute and storage resources	Deploy more powerful servers and storage systems	High
Increased need for cooling and power	Upgrade cooling and power infrastructure	High
New security and compliance requirements	Implement new security and compliance measures	Medium
Changes in data center layout and design	Reconfigure data center layout and design	Low

Living on the Edge

Thursday, April 27, 2023

Generative AI and Large Language Models Will Not Affect Edge Computing Demand All That Much

No comments:

Post a Comment

Popular Posts