GSI Technology Reports 3-Second Time-to-First-Token for Edge Multimodal LLM Inference on Gemini-II
Using the Gemma-3 12B vision-language model on GSI’s production Gemini-II processor, GSI achieved the 3-second TTFT while consuming approximately 30 watts at the AI sub-system, including the chip. To GSI’s knowledge, this 3-second TTFT at approximately 30 watts at the AI sub-system is the lowest publicly reported result for a multimodal 12B model running on an embedded edge processor.
Independent third-party testing of the same workload on competitive embedded platforms reported TTFT measurements of roughly 12 seconds on Qualcomm Snapdragon X Elite with 30W power, and 3 seconds on NVIDIA Jetson Thor with over 100W power. With performance on par with or superior to competitive platforms at lower power usage levels, GSI concludes that Gemini-II offers a favorable responsiveness and power-efficiency profile for power- and thermally-constrained edge environments.
“These benchmark results highlight what compute-in-memory can enable for physical AI,” said
GSI believes this performance profile is well-suited to “physical AI” markets, including drones, smart city, and other edge systems where workloads are episodic and constrained by battery life, thermal design, and form factor. Faster TTFT at lower chip power can enable more responsive systems, longer duty cycles, and lower total system cost.
Edge physical AI represents a growing segment of AI compute as workloads shift from cloud-assisted models to local inference to improve latency, reliability and operational efficiency. GSI’s proprietary compute-in-memory architecture is designed to reduce data movement, which is a primary contributor to latency and power consumption in conventional architectures.
GSI’s engineering team continues to work on further optimizing Gemini-II’s responsiveness while collaborating with customers and partners, including G2 Tech, on system integration and proof-of-concept activity. Benchmark results are intended to support ongoing evaluation and do not guarantee future commercial outcomes.
ABOUT
Forward-Looking Statements
The statements contained in this press release that are not purely historical are forward-looking statements within the meaning of Section 21E of the Securities Exchange Act of 1934, as amended, including statements regarding GSI Technology’s expectations, beliefs, intentions, strategies, products, market opportunities and prospective customer engagements. All forward-looking statements included in this press release are based upon information available to
GSI Technology’s participation in a proof-of-concept is exploratory in nature and may not result in any commercial contract, extended engagement, or recurring revenue. There can be no assurance that the scope, performance, or findings of any proof-of-concept will meet customer expectations or commercial requirements, or that such activities will lead to further business opportunities, order volume, or deploy-at-scale implementations. Additional risks and uncertainties that could cause actual results to differ materially from those expected or implied include, among others: the preliminary and limited nature of benchmark results; differences in workloads, configurations, measurement boundaries, and methodologies that can materially affect TTFT and power measurements; variability in model architectures, versions and toolchains that may impact performance; the pace and extent of adoption of “physical AI” at the edge and the impact of safety, privacy, and security requirements; supply-chain constraints affecting semiconductors, components, or manufacturing partners; GSI Technology’s historical dependence on sales to a limited number of customers and fluctuations in the mix of customers and products in any period; global public health crises that reduce economic activity; the rapidly evolving markets for its products and uncertainty regarding the development of these markets; the need to develop and introduce new products to offset the historical decline in the average unit selling price of its products; intensive competition; the continued availability of government funding opportunities; delays or unanticipated costs that may be encountered in the development of new products based on its in-place associative computing technology and the establishment of new markets and customer and partner relationships for the sale of such products; and delays or unexpected challenges related to the establishment of customer relationships and orders for its radiation-hardened and tolerant SRAM products. Many of these risks are currently amplified by and will continue to be amplified by, or in the future may be amplified by, economic and geopolitical conditions, such as changing interest rates, worldwide inflationary pressures, policy unpredictability, the imposition of tariffs, export controls and other trade barriers, military conflicts, particularly in relation to
Source:
Contacts:
Investor Relations
Hayden IR
541-904-5075
Kim@HaydenIR.com
Media Relations
415-348-2724
gsi@finnpartners.com
Company
Chief Financial Officer
408-331-9802
Source: GSI Technology, Inc.
