Skip to content

2024-12-27

cs.AR - Architecture

标题作者发布日期PDF摘要
HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language ModelsZe Yang, Yihong Jin, Xinhe Xu2024-12-27下载Large Language Models (LLMs) have revolutionized natural language processing by understanding and generating human-like text. However, the increasing demand for more sophisticated LLMs presents signif...
Non-interfering On-line and In-field SoC TestingTobias Strauch2024-12-27下载With increasing aging problems of advanced technologies, in-field testing becomes an inevitable challenge, on top of the already demanding requirements, such as the ISO26262 for automotive safety.
IMAGINE: An 8-to-1b 22nm FD-SOI Compute-In-Memory CNN Accelerator With an End-to-End Analog Charge-Based 0.15-8POPS/W Macro Featuring Distribution-Aware Data ReshapingAdrian Kneip, Martin Lefebvre, Pol Maistriaux, David Bol2024-12-27下载Charge-domain compute-in-memory (CIM) SRAMs have recently become an enticing compromise between computing efficiency and accuracy to process sub-8b convolutional neural networks (CNNs) at the edge.
ATiM: Autotuning Tensor Programs for Processing-in-DRAMYongwon Shin, Dookyung Kang, Hyojin Sung2024-12-27下载Processing-in-DRAM (DRAM-PIM) has emerged as a promising technology for accelerating memory-intensive operations in modern applications, such as Large Language Models (LLMs).
A Fully Hardware Implemented Accelerator Design in ReRAM Analog Computing without ADCsPeng Dang, Huawei Li, Wei Wang2024-12-27下载Emerging ReRAM-based accelerators process neural networks via analog Computing-in-Memory (CiM) for ultra-high energy efficiency. However, significant overhead in peripheral circuits and complex nonlin...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Geometric Freeze-Tag ProblemSharareh Alipour, Kajal Baghestani, Mahdis Mirzaei, Soroush Sahraei2024-12-27下载We study the Freeze-Tag Problem (FTP), introduced by Arkin et al. (SODA'02), where the goal is to wake up a group of nn robots, starting from a single active robot.
Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical DiagnosisJiaqi Wang, Ziyi Yin, Quanzeng You, Lingjuan Lyu, Fenglong Ma2024-12-27下载Geographic health disparities pose a pressing global challenge, particularly in underserved regions of low- and middle-income nations. Addressing this issue requires a collaborative approach to enhanc...
Distributed Download from an External Data Source in Faulty Majority SettingsJohn Augustine, Soumyottam Chatterjee, Valerie King, Manish Kumar, Shachar Meir, David Peleg2024-12-27下载We extend the study of retrieval problems in distributed networks, focusing on improving the efficiency and resilience of protocols in the \emph{Data Retrieval (DR) Model}.
Adrenaline: Adaptive Rendering Optimization System for Scalable Cloud GamingJin Heo, Ketan Bhardwaj, Ada Gavrilovska2024-12-27下载Cloud gaming requires a low-latency network connection, making it a prime candidate for being hosted at the network edge. However, an edge server is provisioned with a fixed compute capacity, causing ...
A Survey on Large Language Model Acceleration based on KV Cache ManagementHaoyang Li, Yiming Li, Anxin Tian, Tianhao Tang, Zhanchao Xu, Xuejia Chen, Nicole Hu, Wei Dong, Qing Li, Lei Chen2024-12-27下载Large Language Models (LLMs) have revolutionized a wide range of domains such as natural language processing, computer vision, and multi-modal tasks due to their ability to comprehend context and perf...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Adaptive Context-Aware Multi-Path Transmission Control for VR/AR Content: A Deep Reinforcement Learning ApproachShakil Ahmed, Saifur Rahman Sabuj, Ashfaq Khokhar2024-12-27下载This paper introduces the Adaptive Context-Aware Multi-Path Transmission Control Protocol (ACMPTCP), an efficient approach designed to optimize the performance of Multi-Path Transmission Control Proto...
Performance Evaluation of IoT LoRa Networks on Mars Through ns-3 SimulationsManuele Favero, Alessandro Canova, Marco Giordani, Michele Zorzi2024-12-27下载In recent years, there has been a significant surge of interest in Mars exploration, driven by the planet's potential for human settlement and its proximity to Earth.
Retrieval-augmented Generation for GenAI-enabled Semantic CommunicationsShunpu Tang, Ruichen Zhang, Yuxuan Yan, Qianqian Yang, Dusit Niyato, Xianbin Wang, Shiwen Mao2024-12-27下载Semantic communication (SemCom) is an emerging paradigm aiming at transmitting only task-relevant semantic information to the receiver, which can significantly improve communication efficiency.
An Overview of Machine Learning-Driven Resource Allocation in IoT NetworksZhengdong Li2024-12-27下载In the wake of disruptive IoT technologies generating massive amounts of diverse data, Machine Learning (ML) will play a crucial role in bringing intelligence to Internet of Things (IoT) networks.

基于 VitePress 构建