Appearance
2024-10-06
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective | Jinhao Li, Jiaming Xu, Shan Huang, Yonghua Chen, Wen Li, Jun Liu, Yaoxiu Lian, Jiayi Pan, Li Ding, Hao Zhou, Yu Wang, Guohao Dai | 2024-10-06 | 下载 | Large Language Models (LLMs) have demonstrated remarkable capabilities across various fields, from natural language understanding to text generation. |
| IMAGine: An In-Memory Accelerated GEMV Engine Overlay | MD Arafat Kabir, Tendayi Kamucheka, Nathaniel Fredricks, Joel Mandebi, Jason Bakos, Miaoqing Huang, David Andrews | 2024-10-06 | 下载 | Processor-in-Memory (PIM) overlays and new redesigned reconfigurable tile fabrics have been proposed to eliminate the von Neumann bottleneck and enable processing performance to scale with BRAM capaci... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach | Divya Jyoti Bajpai, Manjesh Kumar Hanawal | 2024-10-06 | 下载 | Recent advances in Deep Neural Networks (DNNs) have demonstrated outstanding performance across various domains. However, their large size is a challenge for deployment on resource-constrained devices... |
| CONFINE: Preserving Data Secrecy in Decentralized Process Mining | Valerio Goretti, Davide Basile, Luca Barbaro, Claudio Di Ciccio | 2024-10-06 | 下载 | In the contemporary business landscape, collaboration across multiple organizations offers a multitude of opportunities, including reduced operational costs, enhanced performance, and accelerated tech... |
| Multi Armed Bandit Algorithms Based Virtual Machine Allocation Policy for Security in Multi-Tenant Distributed Systems | Pravin Patil, Geetanjali Kale, Tanmay Karmarkar, Ruturaj Ghatage | 2024-10-06 | 下载 | This work proposes a secure and dynamic VM allocation strategy for multi-tenant distributed systems using the Thompson sampling approach. The method proves more effective and secure compared to epsilo... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Consistent and Repeatable Testing of mMIMO O-RU across labs: A Japan-Singapore Experience | Thanh-Tam Nguyen, Mao V. Ngo, Binbin Chen, Mitsuhiro Kuchitsu, Serena Wai, Seitaro Kawai, Kenya Suzuki, Eng Wei Koo, Tony Quek | 2024-10-06 | 下载 | Open Radio Access Networks (RAN) aim to bring a paradigm shift to telecommunications industry, by enabling an open, intelligent, virtualized, and multi-vendor interoperable RAN ecosystem. |
| Consistent and Repeatable Testing of O-RAN Distributed Unit (O-DU) across Continents | Tuan V. Ngo, Mao V. Ngo, Binbin Chen, Gabriele Gemmi, Eduardo Baena, Michele Polese, Tommaso Melodia, William Chien, Tony Quek | 2024-10-06 | 下载 | Open Radio Access Networks (O-RAN) are expected to revolutionize the telecommunications industry with benefits like cost reduction, vendor diversity, and improved network performance through AI optimi... |
| Large Language Models for Knowledge-Free Network Management: Feasibility Study and Opportunities | Hoon Lee, Mintae Kim, Seunghwan Baek, Namyoon Lee, Merouane Debbah, Inkyu Lee | 2024-10-06 | 下载 | Traditional network management algorithms have relied on prior knowledge of system models and networking scenarios. In practice, a universal optimization framework is desirable where a sole optimizati... |