2025-12-17

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
AIE4ML: An End-to-End Framework for Compiling Neural Networks for the Next Generation of AMD AI Engines	Dimitrios Danopoulos, Enrico Lupi, Chang Sun, Sebastian Dittmeier, Michael Kagan, Vladimir Loncar, Maurizio Pierini	2025-12-17	下载	Efficient AI inference on AMD's Versal AI Engine (AIE) is challenging due to tightly coupled VLIW execution, explicit datapaths, and local memory management.
Low-Latency FPGA Control System for Real-Time Neural Network Processing in CCD-Based Trapped-Ion Qubit Measurement	Binglei Lou, Gautham Duddi Krishnaswaroop, Filip Wojcicki, Ruilin Wu, Richard Rademacher, Zhiqiang Que, Wayne Luk, Philip H. W. Leong	2025-12-17	下载	Accurate and low-latency qubit state measurement is critical for trapped-ion quantum computing. While deep neural networks (DNNs) have been integrated to enhance detection fidelity, their latency perf...
A High-level Synthesis Toolchain for the Julia Language	Benedict Short, Ian McInerney, John Wickerson	2025-12-17	下载	With the push towards Exascale computing and data-driven methods, problem sizes have increased dramatically, increasing the computational requirements of the underlying algorithms.
Workload Characterization for Branch Predictability	FNU Vikas, Paul Gratz, Daniel Jiménez	2025-12-17	下载	Conditional branch prediction predicts the likely direction of a conditional branch instruction to support ILP extraction. Branch prediction is a pattern recognition problem that learns mappings betwe...
FAME: FPGA Acceleration of Secure Matrix Multiplication with Homomorphic Encryption	Zhihan Xu, Rajgopal Kannan, Viktor K. Prasanna	2025-12-17	下载	Homomorphic Encryption (HE) enables secure computation on encrypted data, addressing privacy concerns in cloud computing. However, the high computational cost of HE operations, particularly matrix mul...
Implementation and Analysis of Thermometer Encoding in DWN FPGA Accelerators	Michael Mecik, Martin Kumm	2025-12-17	下载	Fully parallel neural network accelerators on field-programmable gate arrays (FPGAs) offer high throughput for latency-critical applications but face hardware resource constraints.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
LOG.io: Unified Rollback Recovery and Data Lineage Capture for Distributed Data Pipelines	Eric Simon, Renato B. Hoffmann, Lucas Alf, Dalvan Griebler	2025-12-17	下载	This paper introduces LOG.io, a comprehensive solution designed for correct rollback recovery and fine-grain data lineage capture in distributed data pipelines.
Private Virtual Tree Networks for Secure Multi-Tenant Environments Based on the VIRGO Overlay Network	Lican Huang	2025-12-17	下载	Hierarchical organization is a fundamental structure in real-world society, where authority and responsibility are delegated from managers to subordinates.
Dynamic Rebatching for Efficient Early-Exit Inference with DREX	Xuting Liu, Daniel Alexander, Siva Kesava Reddy Kakarla, Behnaz Arzani, Vincent Liu	2025-12-17	下载	Early-Exit (EE) is a Large Language Model (LLM) architecture that accelerates inference by allowing easier tokens to be generated using only a subset of the model's layers.
Optimizing Agentic Language Model Inference via Speculative Tool Calls	Daniel Nichols, Prajwal Singhania, Charles Jekel, Abhinav Bhatele, Harshitha Menon	2025-12-17	下载	Language models (LMs) are becoming increasingly dependent on external tools. LM-based agentic frameworks frequently interact with their environment via such tools to search files, run code, call APIs,...
LeaseGuard: Raft Leases Done Right	A. Jesse Jiryu Davis, Murat Demirbas, Lingzhi Deng	2025-12-17	下载	Raft is a leading consensus algorithm for replicating writes in distributed databases. However, distributed databases also require consistent reads.
Optimizing Bloom Filters for Modern GPU Architectures	Daniel Jünger, Kevin Kristensen, Yunsong Wang, Xiangyao Yu, Bertil Schmidt	2025-12-17	下载	Bloom filters are a fundamental data structure for approximate membership queries, with applications ranging from data analytics to databases and genomics.
TL: Automatic End-to-End Compiler of Tile-Based Languages for Spatial Dataflow Architectures	Wei Li, Zhenyu Bai, Heru Wang, Pranav Dangi, Zhiqiang Zhang, Cheng Tan, Huiying Lan, Weng-Fai Wong, Tulika Mitra	2025-12-17	下载	Spatial dataflow accelerators are a promising direction for next-generation computer systems because they can reduce the memory bottlenecks of traditional von Neumann machines such as CPUs and GPUs.
LLMQ: Efficient Lower-Precision Pretraining for Consumer GPUs	Erik Schultheis, Dan Alistarh	2025-12-17	下载	We present LLMQ, an end-to-end CUDA/C++ implementation for medium-sized language-model training, e.g. 3B to 32B parameters, on affordable, commodity GPUs.
Reexamining Paradigms of End-to-End Data Movement	Chin Fang, Timothy Stitt, Michael J. McManus, Toshio Moriya	2025-12-17	下载	The pursuit of high-performance data transfer often focuses on raw network bandwidth, where international links of 100 Gbps or higher are frequently considered the primary enabler.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Deep Reinforcement Learning for EH-Enabled Cognitive-IoT Under Jamming Attacks	Nadia Abdolkhani, Nada Abdel Khalek, Walaa Hamouda	2025-12-17	下载	In the evolving landscape of the Internet of Things (IoT), integrating cognitive radio (CR) has become a practical solution to address the challenge of spectrum scarcity, leading to the development of...
Attention in Motion: Secure Platooning via Transformer-based Misbehavior Detection	Konstantinos Kalogiannis, Ahmed Mohamed Hussain, Hexu Li, Panos Papadimitratos	2025-12-17	下载	Vehicular platooning promises transformative improvements in transportation efficiency and safety through the coordination of multi-vehicle formations enabled by Vehicle-to-Everything (V2X) communicat...
GenAI-enabled Residual Motion Estimation for Energy-Efficient Semantic Video Communication	Shavbo Salehi, Pedro Enrique Iturria-Rivera, Medhat Elsayed, Majid Bavand, Yigit Ozcan, Melike Erol-Kantarci	2025-12-17	下载	Semantic communication addresses the limitations of the Shannon paradigm by focusing on transmitting meaning rather than exact representations, thereby reducing unnecessary resource consumption.
Packet-Level Traffic Modeling with Heavy-Tailed Payload and Inter-Arrival Distributions for Digital Twins	Enes Koktas, Peter Rost	2025-12-17	下载	Digital twins of radio access networks require packet-level traffic generators that reproduce the size and timing of packets while remaining compact and easy to recalibrate as traffic changes.
DNS-based dynamic context resolution for SCHC	Antoine Bernard, Sandoche Balakrichenan, Michel Marot, Benoit Ampeau	2025-12-17	下载	LPWANs are networks characterised by the scarcity of their radio resources and their limited payload size. LoRaWAN offers an open, easy-to-deploy and efficient solution to operate a long-range network...
More Capacity from Less Spectrum: Tapping into Optical-layer Intelligence in Optical Computing-Communication Integrated Network	Dao Thanh Hai, Shuo Li, Isaac Woungang	2025-12-17	下载	Driven by massive investments and consequently significant progresses in optical computing and all-optical signal processing technologies lately, this paper presents a new architectural paradigm for n...
UAV-enabled Computing Power Networks: Task Completion Probability Analysis	Yiqin Deng, Zhengru Fang, Senkang Hu, Yanan Ma, Haixia Zhang, Yuguang Fang	2025-12-17	下载	This paper presents an innovative framework that synergistically enhances computing performance through ubiquitous computing power distribution and dynamic computing node accessibility control via ada...
Deep Reinforcement Learning for Joint Time and Power Management in SWIPT-EH CIoT	Nadia Abdolkhani, Nada Abdel Khalek, Walaa Hamouda, Iyad Dayoub	2025-12-17	下载	This letter presents a novel deep reinforcement learning (DRL) approach for joint time allocation and power control in a cognitive Internet of Things (CIoT) system with simultaneous wireless informati...
Agentic AI for Integrated Sensing and Communication: Analysis, Framework, and Case Study	Wenwen Xie, Geng Sun, Ruichen Zhang, Xuejie Liu, Yinqiu Liu, Jiacheng Wang, Dusit Niyato, Ping Zhang	2025-12-17	下载	Integrated sensing and communication (ISAC) has emerged as a key development direction in the sixth-generation (6G) era, which provides essential support for the collaborative sensing and communicatio...
Reexamining Paradigms of End-to-End Data Movement	Chin Fang, Timothy Stitt, Michael J. McManus, Toshio Moriya	2025-12-17	下载	The pursuit of high-performance data transfer often focuses on raw network bandwidth, where international links of 100 Gbps or higher are frequently considered the primary enabler.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Reexamining Paradigms of End-to-End Data Movement	Chin Fang, Timothy Stitt, Michael J. McManus, Toshio Moriya	2025-12-17	下载	The pursuit of high-performance data transfer often focuses on raw network bandwidth, where international links of 100 Gbps or higher are frequently considered the primary enabler.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Optimizing Agentic Language Model Inference via Speculative Tool Calls	Daniel Nichols, Prajwal Singhania, Charles Jekel, Abhinav Bhatele, Harshitha Menon	2025-12-17	下载	Language models (LMs) are becoming increasingly dependent on external tools. LM-based agentic frameworks frequently interact with their environment via such tools to search files, run code, call APIs,...
Reexamining Paradigms of End-to-End Data Movement	Chin Fang, Timothy Stitt, Michael J. McManus, Toshio Moriya	2025-12-17	下载	The pursuit of high-performance data transfer often focuses on raw network bandwidth, where international links of 100 Gbps or higher are frequently considered the primary enabler.