Score: 1

What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips

Published: May 9, 2025 | arXiv ID: 2505.05794v1

By: Renjie Li , Wenjie Wei , Qi Xin and more

Potential Business Impact:

Makes AI run much faster and use less power.

Business Areas:

Quantum Computing Science and Engineering

Large language models (LLMs) are rapidly pushing the limits of contemporary computing hardware. For example, training GPT-3 has been estimated to consume around 1300 MWh of electricity, and projections suggest future models may require city-scale (gigawatt) power budgets. These demands motivate exploration of computing paradigms beyond conventional von Neumann architectures. This review surveys emerging photonic hardware optimized for next-generation generative AI computing. We discuss integrated photonic neural network architectures (e.g., Mach-Zehnder interferometer meshes, lasers, wavelength-multiplexed microring resonators) that perform ultrafast matrix operations. We also examine promising alternative neuromorphic devices, including spiking neural network circuits and hybrid spintronic-photonic synapses, which combine memory and processing. The integration of two-dimensional materials (graphene, TMDCs) into silicon photonic platforms is reviewed for tunable modulators and on-chip synaptic elements. Transformer-based LLM architectures (self-attention and feed-forward layers) are analyzed in this context, identifying strategies and challenges for mapping dynamic matrix multiplications onto these novel hardware substrates. We then dissect the mechanisms of mainstream LLMs, such as ChatGPT, DeepSeek, and LLaMA, highlighting their architectural similarities and differences. We synthesize state-of-the-art components, algorithms, and integration methods, highlighting key advances and open issues in scaling such systems to mega-sized LLM models. We find that photonic computing systems could potentially surpass electronic processors by orders of magnitude in throughput and energy efficiency, but require breakthroughs in memory, especially for long-context windows and long token sequences, and in storage of ultra-large datasets.

Energy-Aware LLMs: A step towards sustainable AI for downstream applications

Performance

Saves energy while making AI smarter.

22 Mar 2025 0

89%

Implementation of transformer-based LLMs with large-scale optoelectronic neurons on a CMOS image sensor platform

Emerging Technologies

Makes AI run much faster and use less power.

6 Nov 2025 0

88%

A 262 TOPS Hyperdimensional Photonic AI Accelerator powered by a Si3N4 microcomb laser

Optics

Makes AI faster and use less power.

5 Mar 2025 0

View PDF Login to Bookmark

Country of Origin

🇭🇰 Hong Kong

Page Count

36 pages

What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips

Makes AI run much faster and use less power.

Technical Abstract

Energy-Aware LLMs: A step towards sustainable AI for downstream applications

Implementation of transformer-based LLMs with large-scale optoelectronic neurons on a CMOS image sensor platform

A 262 TOPS Hyperdimensional Photonic AI Accelerator powered by a Si3N4 microcomb laser