Skip to content

2024-10-06

cs.AR - Architecture

标题作者发布日期PDF摘要
Large Language Model Inference Acceleration: A Comprehensive Hardware PerspectiveJinhao Li, Jiaming Xu, Shan Huang, Yonghua Chen, Wen Li, Jun Liu, Yaoxiu Lian, Jiayi Pan, Li Ding, Hao Zhou, Yu Wang, Guohao Dai2024-10-06下载Large Language Models (LLMs) have demonstrated remarkable capabilities across various fields, from natural language understanding to text generation.
IMAGine: An In-Memory Accelerated GEMV Engine OverlayMD Arafat Kabir, Tendayi Kamucheka, Nathaniel Fredricks, Joel Mandebi, Jason Bakos, Miaoqing Huang, David Andrews2024-10-06下载Processor-in-Memory (PIM) overlays and new redesigned reconfigurable tile fabrics have been proposed to eliminate the von Neumann bottleneck and enable processing performance to scale with BRAM capaci...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering ApproachDivya Jyoti Bajpai, Manjesh Kumar Hanawal2024-10-06下载Recent advances in Deep Neural Networks (DNNs) have demonstrated outstanding performance across various domains. However, their large size is a challenge for deployment on resource-constrained devices...
CONFINE: Preserving Data Secrecy in Decentralized Process MiningValerio Goretti, Davide Basile, Luca Barbaro, Claudio Di Ciccio2024-10-06下载In the contemporary business landscape, collaboration across multiple organizations offers a multitude of opportunities, including reduced operational costs, enhanced performance, and accelerated tech...
Multi Armed Bandit Algorithms Based Virtual Machine Allocation Policy for Security in Multi-Tenant Distributed SystemsPravin Patil, Geetanjali Kale, Tanmay Karmarkar, Ruturaj Ghatage2024-10-06下载This work proposes a secure and dynamic VM allocation strategy for multi-tenant distributed systems using the Thompson sampling approach. The method proves more effective and secure compared to epsilo...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Consistent and Repeatable Testing of mMIMO O-RU across labs: A Japan-Singapore ExperienceThanh-Tam Nguyen, Mao V. Ngo, Binbin Chen, Mitsuhiro Kuchitsu, Serena Wai, Seitaro Kawai, Kenya Suzuki, Eng Wei Koo, Tony Quek2024-10-06下载Open Radio Access Networks (RAN) aim to bring a paradigm shift to telecommunications industry, by enabling an open, intelligent, virtualized, and multi-vendor interoperable RAN ecosystem.
Consistent and Repeatable Testing of O-RAN Distributed Unit (O-DU) across ContinentsTuan V. Ngo, Mao V. Ngo, Binbin Chen, Gabriele Gemmi, Eduardo Baena, Michele Polese, Tommaso Melodia, William Chien, Tony Quek2024-10-06下载Open Radio Access Networks (O-RAN) are expected to revolutionize the telecommunications industry with benefits like cost reduction, vendor diversity, and improved network performance through AI optimi...
Large Language Models for Knowledge-Free Network Management: Feasibility Study and OpportunitiesHoon Lee, Mintae Kim, Seunghwan Baek, Namyoon Lee, Merouane Debbah, Inkyu Lee2024-10-06下载Traditional network management algorithms have relied on prior knowledge of system models and networking scenarios. In practice, a universal optimization framework is desirable where a sole optimizati...

基于 VitePress 构建