Appearance
2024-12-27
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language Models | Ze Yang, Yihong Jin, Xinhe Xu | 2024-12-27 | 下载 | Large Language Models (LLMs) have revolutionized natural language processing by understanding and generating human-like text. However, the increasing demand for more sophisticated LLMs presents signif... |
| Non-interfering On-line and In-field SoC Testing | Tobias Strauch | 2024-12-27 | 下载 | With increasing aging problems of advanced technologies, in-field testing becomes an inevitable challenge, on top of the already demanding requirements, such as the ISO26262 for automotive safety. |
| IMAGINE: An 8-to-1b 22nm FD-SOI Compute-In-Memory CNN Accelerator With an End-to-End Analog Charge-Based 0.15-8POPS/W Macro Featuring Distribution-Aware Data Reshaping | Adrian Kneip, Martin Lefebvre, Pol Maistriaux, David Bol | 2024-12-27 | 下载 | Charge-domain compute-in-memory (CIM) SRAMs have recently become an enticing compromise between computing efficiency and accuracy to process sub-8b convolutional neural networks (CNNs) at the edge. |
| ATiM: Autotuning Tensor Programs for Processing-in-DRAM | Yongwon Shin, Dookyung Kang, Hyojin Sung | 2024-12-27 | 下载 | Processing-in-DRAM (DRAM-PIM) has emerged as a promising technology for accelerating memory-intensive operations in modern applications, such as Large Language Models (LLMs). |
| A Fully Hardware Implemented Accelerator Design in ReRAM Analog Computing without ADCs | Peng Dang, Huawei Li, Wei Wang | 2024-12-27 | 下载 | Emerging ReRAM-based accelerators process neural networks via analog Computing-in-Memory (CiM) for ultra-high energy efficiency. However, significant overhead in peripheral circuits and complex nonlin... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Geometric Freeze-Tag Problem | Sharareh Alipour, Kajal Baghestani, Mahdis Mirzaei, Soroush Sahraei | 2024-12-27 | 下载 | We study the Freeze-Tag Problem (FTP), introduced by Arkin et al. (SODA'02), where the goal is to wake up a group of robots, starting from a single active robot. |
| Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis | Jiaqi Wang, Ziyi Yin, Quanzeng You, Lingjuan Lyu, Fenglong Ma | 2024-12-27 | 下载 | Geographic health disparities pose a pressing global challenge, particularly in underserved regions of low- and middle-income nations. Addressing this issue requires a collaborative approach to enhanc... |
| Distributed Download from an External Data Source in Faulty Majority Settings | John Augustine, Soumyottam Chatterjee, Valerie King, Manish Kumar, Shachar Meir, David Peleg | 2024-12-27 | 下载 | We extend the study of retrieval problems in distributed networks, focusing on improving the efficiency and resilience of protocols in the \emph{Data Retrieval (DR) Model}. |
| Adrenaline: Adaptive Rendering Optimization System for Scalable Cloud Gaming | Jin Heo, Ketan Bhardwaj, Ada Gavrilovska | 2024-12-27 | 下载 | Cloud gaming requires a low-latency network connection, making it a prime candidate for being hosted at the network edge. However, an edge server is provisioned with a fixed compute capacity, causing ... |
| A Survey on Large Language Model Acceleration based on KV Cache Management | Haoyang Li, Yiming Li, Anxin Tian, Tianhao Tang, Zhanchao Xu, Xuejia Chen, Nicole Hu, Wei Dong, Qing Li, Lei Chen | 2024-12-27 | 下载 | Large Language Models (LLMs) have revolutionized a wide range of domains such as natural language processing, computer vision, and multi-modal tasks due to their ability to comprehend context and perf... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Adaptive Context-Aware Multi-Path Transmission Control for VR/AR Content: A Deep Reinforcement Learning Approach | Shakil Ahmed, Saifur Rahman Sabuj, Ashfaq Khokhar | 2024-12-27 | 下载 | This paper introduces the Adaptive Context-Aware Multi-Path Transmission Control Protocol (ACMPTCP), an efficient approach designed to optimize the performance of Multi-Path Transmission Control Proto... |
| Performance Evaluation of IoT LoRa Networks on Mars Through ns-3 Simulations | Manuele Favero, Alessandro Canova, Marco Giordani, Michele Zorzi | 2024-12-27 | 下载 | In recent years, there has been a significant surge of interest in Mars exploration, driven by the planet's potential for human settlement and its proximity to Earth. |
| Retrieval-augmented Generation for GenAI-enabled Semantic Communications | Shunpu Tang, Ruichen Zhang, Yuxuan Yan, Qianqian Yang, Dusit Niyato, Xianbin Wang, Shiwen Mao | 2024-12-27 | 下载 | Semantic communication (SemCom) is an emerging paradigm aiming at transmitting only task-relevant semantic information to the receiver, which can significantly improve communication efficiency. |
| An Overview of Machine Learning-Driven Resource Allocation in IoT Networks | Zhengdong Li | 2024-12-27 | 下载 | In the wake of disruptive IoT technologies generating massive amounts of diverse data, Machine Learning (ML) will play a crucial role in bringing intelligence to Internet of Things (IoT) networks. |