Enhancing LLM Inference Performance on ARM CPUs Through Software and Hardware Co-Optimization Strategies

CHENG ZHANG, XINGYU ZHU, LONGHAO CHEN, TINGJIE YANG, EVENS PAN, GUOSHENG YU, YANG ZHAO, XIGUANG WU, BO LI, WEI MAO, GENQUAN HAN

Integrated Circuits and Systems ›› 2025, Vol. 2 ›› Issue (2) : 49-57.

PDF(4179 KB)
PDF(4179 KB)
Integrated Circuits and Systems ›› 2025, Vol. 2 ›› Issue (2) : 49-57. DOI: 10.23919/ICS.2025.3568404
Co-Optimization for Large Language Models: Advances in Algorithm and Hardware

Enhancing LLM Inference Performance on ARM CPUs Through Software and Hardware Co-Optimization Strategies

    {{javascript:window.custom_author_en_index=0;}}
  • {{article.zuoZhe_EN}}
Author information +
History +

HeighLight

{{article.keyPoints_en}}

Abstract

{{article.zhaiyao_en}}

Key words

QR code of this article

Cite this article

Download Citations
{{article.zuoZheEn_L}}. {{article.title_en}}[J]. {{journal.qiKanMingCheng_EN}}, 2025, 2(2): 49-57 https://doi.org/10.23919/ICS.2025.3568404

References

References

{{article.reference}}

Funding

RIGHTS & PERMISSIONS

{{article.copyrightStatement_en}}
{{article.copyrightLicense_en}}
PDF(4179 KB)

Accesses

Citation

Detail

Sections
Recommended

/