Appearance
2025-12-17
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| AIE4ML: An End-to-End Framework for Compiling Neural Networks for the Next Generation of AMD AI Engines | Dimitrios Danopoulos, Enrico Lupi, Chang Sun, Sebastian Dittmeier, Michael Kagan, Vladimir Loncar, Maurizio Pierini | 2025-12-17 | 下载 | Efficient AI inference on AMD's Versal AI Engine (AIE) is challenging due to tightly coupled VLIW execution, explicit datapaths, and local memory management. |
| Low-Latency FPGA Control System for Real-Time Neural Network Processing in CCD-Based Trapped-Ion Qubit Measurement | Binglei Lou, Gautham Duddi Krishnaswaroop, Filip Wojcicki, Ruilin Wu, Richard Rademacher, Zhiqiang Que, Wayne Luk, Philip H. W. Leong | 2025-12-17 | 下载 | Accurate and low-latency qubit state measurement is critical for trapped-ion quantum computing. While deep neural networks (DNNs) have been integrated to enhance detection fidelity, their latency perf... |
| A High-level Synthesis Toolchain for the Julia Language | Benedict Short, Ian McInerney, John Wickerson | 2025-12-17 | 下载 | With the push towards Exascale computing and data-driven methods, problem sizes have increased dramatically, increasing the computational requirements of the underlying algorithms. |
| Workload Characterization for Branch Predictability | FNU Vikas, Paul Gratz, Daniel Jiménez | 2025-12-17 | 下载 | Conditional branch prediction predicts the likely direction of a conditional branch instruction to support ILP extraction. Branch prediction is a pattern recognition problem that learns mappings betwe... |
| FAME: FPGA Acceleration of Secure Matrix Multiplication with Homomorphic Encryption | Zhihan Xu, Rajgopal Kannan, Viktor K. Prasanna | 2025-12-17 | 下载 | Homomorphic Encryption (HE) enables secure computation on encrypted data, addressing privacy concerns in cloud computing. However, the high computational cost of HE operations, particularly matrix mul... |
| Implementation and Analysis of Thermometer Encoding in DWN FPGA Accelerators | Michael Mecik, Martin Kumm | 2025-12-17 | 下载 | Fully parallel neural network accelerators on field-programmable gate arrays (FPGAs) offer high throughput for latency-critical applications but face hardware resource constraints. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LOG.io: Unified Rollback Recovery and Data Lineage Capture for Distributed Data Pipelines | Eric Simon, Renato B. Hoffmann, Lucas Alf, Dalvan Griebler | 2025-12-17 | 下载 | This paper introduces LOG.io, a comprehensive solution designed for correct rollback recovery and fine-grain data lineage capture in distributed data pipelines. |
| Private Virtual Tree Networks for Secure Multi-Tenant Environments Based on the VIRGO Overlay Network | Lican Huang | 2025-12-17 | 下载 | Hierarchical organization is a fundamental structure in real-world society, where authority and responsibility are delegated from managers to subordinates. |
| Dynamic Rebatching for Efficient Early-Exit Inference with DREX | Xuting Liu, Daniel Alexander, Siva Kesava Reddy Kakarla, Behnaz Arzani, Vincent Liu | 2025-12-17 | 下载 | Early-Exit (EE) is a Large Language Model (LLM) architecture that accelerates inference by allowing easier tokens to be generated using only a subset of the model's layers. |
| Optimizing Agentic Language Model Inference via Speculative Tool Calls | Daniel Nichols, Prajwal Singhania, Charles Jekel, Abhinav Bhatele, Harshitha Menon | 2025-12-17 | 下载 | Language models (LMs) are becoming increasingly dependent on external tools. LM-based agentic frameworks frequently interact with their environment via such tools to search files, run code, call APIs,... |
| LeaseGuard: Raft Leases Done Right | A. Jesse Jiryu Davis, Murat Demirbas, Lingzhi Deng | 2025-12-17 | 下载 | Raft is a leading consensus algorithm for replicating writes in distributed databases. However, distributed databases also require consistent reads. |
| Optimizing Bloom Filters for Modern GPU Architectures | Daniel Jünger, Kevin Kristensen, Yunsong Wang, Xiangyao Yu, Bertil Schmidt | 2025-12-17 | 下载 | Bloom filters are a fundamental data structure for approximate membership queries, with applications ranging from data analytics to databases and genomics. |
| TL: Automatic End-to-End Compiler of Tile-Based Languages for Spatial Dataflow Architectures | Wei Li, Zhenyu Bai, Heru Wang, Pranav Dangi, Zhiqiang Zhang, Cheng Tan, Huiying Lan, Weng-Fai Wong, Tulika Mitra | 2025-12-17 | 下载 | Spatial dataflow accelerators are a promising direction for next-generation computer systems because they can reduce the memory bottlenecks of traditional von Neumann machines such as CPUs and GPUs. |
| LLMQ: Efficient Lower-Precision Pretraining for Consumer GPUs | Erik Schultheis, Dan Alistarh | 2025-12-17 | 下载 | We present LLMQ, an end-to-end CUDA/C++ implementation for medium-sized language-model training, e.g. 3B to 32B parameters, on affordable, commodity GPUs. |
| Reexamining Paradigms of End-to-End Data Movement | Chin Fang, Timothy Stitt, Michael J. McManus, Toshio Moriya | 2025-12-17 | 下载 | The pursuit of high-performance data transfer often focuses on raw network bandwidth, where international links of 100 Gbps or higher are frequently considered the primary enabler. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Deep Reinforcement Learning for EH-Enabled Cognitive-IoT Under Jamming Attacks | Nadia Abdolkhani, Nada Abdel Khalek, Walaa Hamouda | 2025-12-17 | 下载 | In the evolving landscape of the Internet of Things (IoT), integrating cognitive radio (CR) has become a practical solution to address the challenge of spectrum scarcity, leading to the development of... |
| Attention in Motion: Secure Platooning via Transformer-based Misbehavior Detection | Konstantinos Kalogiannis, Ahmed Mohamed Hussain, Hexu Li, Panos Papadimitratos | 2025-12-17 | 下载 | Vehicular platooning promises transformative improvements in transportation efficiency and safety through the coordination of multi-vehicle formations enabled by Vehicle-to-Everything (V2X) communicat... |
| GenAI-enabled Residual Motion Estimation for Energy-Efficient Semantic Video Communication | Shavbo Salehi, Pedro Enrique Iturria-Rivera, Medhat Elsayed, Majid Bavand, Yigit Ozcan, Melike Erol-Kantarci | 2025-12-17 | 下载 | Semantic communication addresses the limitations of the Shannon paradigm by focusing on transmitting meaning rather than exact representations, thereby reducing unnecessary resource consumption. |
| Packet-Level Traffic Modeling with Heavy-Tailed Payload and Inter-Arrival Distributions for Digital Twins | Enes Koktas, Peter Rost | 2025-12-17 | 下载 | Digital twins of radio access networks require packet-level traffic generators that reproduce the size and timing of packets while remaining compact and easy to recalibrate as traffic changes. |
| DNS-based dynamic context resolution for SCHC | Antoine Bernard, Sandoche Balakrichenan, Michel Marot, Benoit Ampeau | 2025-12-17 | 下载 | LPWANs are networks characterised by the scarcity of their radio resources and their limited payload size. LoRaWAN offers an open, easy-to-deploy and efficient solution to operate a long-range network... |
| More Capacity from Less Spectrum: Tapping into Optical-layer Intelligence in Optical Computing-Communication Integrated Network | Dao Thanh Hai, Shuo Li, Isaac Woungang | 2025-12-17 | 下载 | Driven by massive investments and consequently significant progresses in optical computing and all-optical signal processing technologies lately, this paper presents a new architectural paradigm for n... |
| UAV-enabled Computing Power Networks: Task Completion Probability Analysis | Yiqin Deng, Zhengru Fang, Senkang Hu, Yanan Ma, Haixia Zhang, Yuguang Fang | 2025-12-17 | 下载 | This paper presents an innovative framework that synergistically enhances computing performance through ubiquitous computing power distribution and dynamic computing node accessibility control via ada... |
| Deep Reinforcement Learning for Joint Time and Power Management in SWIPT-EH CIoT | Nadia Abdolkhani, Nada Abdel Khalek, Walaa Hamouda, Iyad Dayoub | 2025-12-17 | 下载 | This letter presents a novel deep reinforcement learning (DRL) approach for joint time allocation and power control in a cognitive Internet of Things (CIoT) system with simultaneous wireless informati... |
| Agentic AI for Integrated Sensing and Communication: Analysis, Framework, and Case Study | Wenwen Xie, Geng Sun, Ruichen Zhang, Xuejie Liu, Yinqiu Liu, Jiacheng Wang, Dusit Niyato, Ping Zhang | 2025-12-17 | 下载 | Integrated sensing and communication (ISAC) has emerged as a key development direction in the sixth-generation (6G) era, which provides essential support for the collaborative sensing and communicatio... |
| Reexamining Paradigms of End-to-End Data Movement | Chin Fang, Timothy Stitt, Michael J. McManus, Toshio Moriya | 2025-12-17 | 下载 | The pursuit of high-performance data transfer often focuses on raw network bandwidth, where international links of 100 Gbps or higher are frequently considered the primary enabler. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Reexamining Paradigms of End-to-End Data Movement | Chin Fang, Timothy Stitt, Michael J. McManus, Toshio Moriya | 2025-12-17 | 下载 | The pursuit of high-performance data transfer often focuses on raw network bandwidth, where international links of 100 Gbps or higher are frequently considered the primary enabler. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Optimizing Agentic Language Model Inference via Speculative Tool Calls | Daniel Nichols, Prajwal Singhania, Charles Jekel, Abhinav Bhatele, Harshitha Menon | 2025-12-17 | 下载 | Language models (LMs) are becoming increasingly dependent on external tools. LM-based agentic frameworks frequently interact with their environment via such tools to search files, run code, call APIs,... |
| Reexamining Paradigms of End-to-End Data Movement | Chin Fang, Timothy Stitt, Michael J. McManus, Toshio Moriya | 2025-12-17 | 下载 | The pursuit of high-performance data transfer often focuses on raw network bandwidth, where international links of 100 Gbps or higher are frequently considered the primary enabler. |