Appearance
2025-05-19
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Genesis: A Compiler Framework for Hamiltonian Simulation on Hybrid CV-DV Quantum Computers | Zihan Chen, Jiakang Li, Minghao Guo, Henry Chen, Zirui Li, Joel Bierman, Yipeng Huang, Huiyang Zhou, Yuan Liu, Eddy Z. Zhang | 2025-05-19 | 下载 | This paper introduces Genesis, the first compiler designed to support Hamiltonian Simulation on hybrid continuous-variable (CV) and discrete-variable (DV) quantum computing systems. |
| Introducing Instruction-Accurate Simulators for Performance Estimation of Autotuning Workloads | Rebecca Pelke, Nils Bosbach, Lennart M. Reimann, Rainer Leupers | 2025-05-19 | 下载 | Accelerating Machine Learning (ML) workloads requires efficient methods due to their large optimization space. Autotuning has emerged as an effective approach for systematically evaluating variations ... |
| MXDOTP: A RISC-V ISA Extension for Enabling Microscaling (MX) Floating-Point Dot Products | Gamze İslamoğlu, Luca Bertaccini, Arpan Suravi Prasad, Francesco Conti, Angelo Garofalo, Luca Benini | 2025-05-19 | 下载 | Fast and energy-efficient low-bitwidth floating-point (FP) arithmetic is essential for Artificial Intelligence (AI) systems. Microscaling (MX) standardized formats have recently emerged as a promising... |
| PIM-malloc: A Fast and Scalable Dynamic Memory Allocator for Processing-In-Memory (PIM) Architectures | Dongjae Lee, Bongjoon Hyun, Youngjin Kwon, Minsoo Rhu | 2025-05-19 | 下载 | The ability to dynamically allocate memory is fundamental in modern programming languages. However, this feature is not adequately supported in current general-purpose PIM devices. |
| Addressing memory bandwidth scalability in vector processors for streaming applications | Jordi Altayo, Paul Delestrac, David Novo, Simey Yang, Debjyoti Bhattacharjee, Francky Catthoor | 2025-05-19 | 下载 | As the size of artificial intelligence and machine learning (AI/ML) models and datasets grows, the memory bandwidth becomes a critical bottleneck. |
| 2T1R Regulated Memristor Conductance Control Array Architecture for Neuromorphic Computing using 28nm CMOS Technology | Neethu Kuriakose, Arun Ashok, Christian Grewing, André Zambanini, Stefan van Waasen | 2025-05-19 | 下载 | Memristors are promising devices for scalable and low power, in-memory computing to improve the energy efficiency of a rising computational demand. |
| FireFly-T: High-Throughput Sparsity Exploitation for Spiking Transformer Acceleration with Dual-Engine Overlay Architecture | Tenglong Li, Jindong Li, Guobin Shen, Dongcheng Zhao, Qian Zhang, Yi Zeng | 2025-05-19 | 下载 | Spiking transformers are emerging as a promising architecture that combines the energy efficiency of Spiking Neural Networks (SNNs) with the powerful attention mechanisms of transformers. |
| Sandwich: Separating Prefill-Decode Compilation for Efficient CPU LLM Serving | Juntao Zhao, Jiuru Li, Chuan Wu | 2025-05-19 | 下载 | Utilizing CPUs to serve large language models (LLMs) is a resource-friendly alternative to GPU serving. Existing CPU-based solutions ignore workload differences between the prefill and the decode phas... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Eudoxia: a FaaS scheduling simulator for the composable lakehouse | Tapan Srivastava, Jacopo Tagliabue, Ciro Greco | 2025-05-19 | 下载 | Due to the variety of its target use cases and the large API surface area to cover, a data lakehouse (DLH) is a natural candidate for a composable data system. |
| Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference | Shuqing Luo, Pingzhi Li, Jie Peng, Hanrui Wang, Yang, Zhao, Yu, Cao, Yu Cheng, Tianlong Chen | 2025-05-19 | 下载 | Mixture-of-experts (MoE) architectures could achieve impressive computational efficiency with expert parallelism, which relies heavily on all-to-all communication across devices. |
| SVAFD: A Secure and Verifiable Co-Aggregation Protocol for Federated Distillation | Tian Wen, Sheng Sun, Yuwei Wang, Peiyan Chen, Zhiyuan Wu, Min Liu, Bo Gao | 2025-05-19 | 下载 | Secure Aggregation (SA) is an indispensable component of Federated Learning (FL) that concentrates on privacy preservation while allowing for robust aggregation. |
| eBPF-Based Instrumentation for Generalisable Diagnosis of Performance Degradation | Diogo Landau, Jorge Barbosa, Nishant Saurabh | 2025-05-19 | 下载 | Online Data Intensive applications (e.g. message brokers, ML inference and databases) are core components of the modern internet, providing critical functionalities to connecting services. |
| Prink: -Anonymization for Streaming Data in Apache Flink | Philip Groneberg, Saskia Nuñez von Voigt, Thomas Janke, Louis Loechel, Karl Wolf, Elias Grünewald, Frank Pallas | 2025-05-19 | 下载 | In this paper, we present Prink, a novel and practically applicable concept and fully implemented prototype for ks-anonymizing data streams in real-world application architectures. |
| Computing the Schulze Method for Large-Scale Preference Data Sets | Theresa Csar, Martin Lackner, Reinhard Pichler | 2025-05-19 | 下载 | The Schulze method is a voting rule widely used in practice and enjoys many positive axiomatic properties. While it is computable in polynomial time, its straight-forward implementation does not scale... |
| Minos: Exploiting Cloud Performance Variation with Function-as-a-Service Instance Selection | Trever Schirmer, Natalie Carl, Nils Höller, Tobias Pfandzelter, David Bermbach | 2025-05-19 | 下载 | Serverless Function-as-a-Service (FaaS) is a popular cloud paradigm to quickly and cheaply implement complex applications. Because the function instances cloud providers start to execute user code run... |
| Optimization of Hybrid Quantum-Classical Algorithms | Lian Remme, Alexander Weinert, Andre Waschk | 2025-05-19 | 下载 | Quantum computers do not run in isolation; rather, they are embedded in quantum-classical hybrid architectures. In these setups, a quantum processing unit communicates with a classical device in near-... |
| Performance Characterization of Distributed Deep Learning Strategies: A Quantitative Evaluation of DDP, FSDP, and Parameter Server Architectures on GPU Clusters | Md Sultanul Islam Ovi | 2025-05-19 | 下载 | Efficiently scaling deep neural networks across GPU clusters requires navigating complex trade-offs between computational throughput, memory utilization, and synchronization overhead. |
| Learning In Chaos: Efficient Autoscaling and Self-Healing for Multi-Party Distributed Training | Wenjiao Feng, Rongxing Xiao, Zonghang Li, Hongfang Yu, Gang Sun, Long Luo, Mohsen Guizani, Qirong Ho, Steve Liu | 2025-05-19 | 下载 | Node and link churn in multi-party, cross-region clusters over wide-area networks (WANs) often disrupts distributed training. However, checkpoint-based recovery and cloud-centric autoscaling react slo... |
| Sandwich: Separating Prefill-Decode Compilation for Efficient CPU LLM Serving | Juntao Zhao, Jiuru Li, Chuan Wu | 2025-05-19 | 下载 | Utilizing CPUs to serve large language models (LLMs) is a resource-friendly alternative to GPU serving. Existing CPU-based solutions ignore workload differences between the prefill and the decode phas... |
| MTGenRec: An Efficient Distributed Training System for Generative Recommendation Models in Meituan | Yuxiang Wang, Xiao Yan, Chi Ma, Mincong Huang, Xiaoguang Li, Lei Yu, Chuan Liu, Ruidong Han, He Jiang, Bin Yin, Shangyu Chen, Fei Jiang, Xiang Li, Wei Lin, Haowei Han, Bo Du, Jiawei Jiang | 2025-05-19 | 下载 | Recommendation is crucial for both user experience and company revenue in Meituan as a leading lifestyle company, and generative recommendation models (GRMs) are shown to produce quality recommendatio... |
| Digital Twins in the Cloud: A Modular, Scalable and Interoperable Framework for Accelerating Verification and Validation of Autonomous Driving Solutions | Tanmay Vilas Samak, Chinmay Vilas Samak, Giovanni Martino, Pranav Nair, Venkat Krovi | 2025-05-19 | 下载 | Verification and validation (V&V) of autonomous vehicles (AVs) typically requires exhaustive testing across a variety of operating environments and driving scenarios including rare, extreme, or hazard... |
| HydraInfer: Hybrid Disaggregated Scheduling for Multimodal Large Language Model Serving | Xianzhe Dong, Tongxuan Liu, Yuting Zeng, Liangyu Liu, Yang Liu, Siyu Wu, Yu Wu, Hailong Yang, Ke Zhang, Jing Li | 2025-05-19 | 下载 | Multimodal Large Language Models (MLLMs) have been rapidly advancing, enabling cross-modal understanding and generation, and propelling artificial intelligence towards artificial general intelligence. |
| Quantum Modeling of Spatial Contiguity Constraints | Yunhan Chang, Amr Magdy, Federico M. Spedalieri | 2025-05-19 | 下载 | Quantum computing has demonstrated potential for solving complex optimization problems; however, its application to spatial regionalization remains underexplored. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Incremental Firmware Update Over-the-Air for Low-Power IoT Devices over LoRaWAN | Andrea De Simone, Giovanna Turvani, Fabrizio Riente | 2025-05-19 | 下载 | Efficiently supporting remote firmware updates in Internet of Things (IoT) devices remains a significant challenge due to the limitations of many IoT communication protocols, which often make it impra... |
| Sionna Research Kit: A GPU-Accelerated Research Platform for AI-RAN | Sebastian Cammerer, Guillermo Marcus, Tobias Zirr, Fayçal Aït Aoudia, Lorenzo Maggi, Jakob Hoydis, Alexander Keller | 2025-05-19 | 下载 | We introduce the NVIDIA Sionna Research Kit, a GPU-accelerated research platform for developing and testing AI/ML algorithms in 5G NR cellular networks. |
| Learning Driven Elastic Task Multi-Connectivity Immersive Computing Systems | Babak Badnava, Jacob Chakareski, Morteza Hashemi | 2025-05-19 | 下载 | In virtual reality (VR) environments, computational tasks exhibit an elastic nature, meaning they can dynamically adjust based on various user and system constraints. |
| Graph Neural Networks Based Anomalous RSSI Detection | Blaž Bertalanič, Matej Vnučec, Carolina Fortuna | 2025-05-19 | 下载 | In today's world, modern infrastructures are being equipped with information and communication technologies to create large IoT networks. It is essential to monitor these networks to ensure smooth o... |
| Forewarned is Forearmed: A Survey on Large Language Model-based Agents in Autonomous Cyberattacks | Minrui Xu, Jiani Fan, Xinyu Huang, Conghao Zhou, Jiawen Kang, Dusit Niyato, Shiwen Mao, Zhu Han, Xuemin, Shen, Kwok-Yan Lam | 2025-05-19 | 下载 | With the continuous evolution of Large Language Models (LLMs), LLM-based agents have advanced beyond passive chatbots to become autonomous cyber entities capable of performing complex tasks, including... |
| Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses | Yingkai Kang, Jiawen Kang, Jinbo Wen, Tao Zhang, Zhaohui Yang, Dusit Niyato, Yan Zhang | 2025-05-19 | 下载 | Vehicular metaverses are an emerging paradigm that merges intelligent transportation systems with virtual spaces, leveraging advanced digital twin and Artificial Intelligence (AI) technologies to seam... |
| An Automated Blackbox Noncompliance Checker for QUIC Server Implementations | Kian Kai Ang, Guy Farrelly, Cheryl Pope, Damith C. Ranasinghe | 2025-05-19 | 下载 | We develop QUICtester, an automated approach for uncovering non-compliant behaviors in the ratified QUIC protocol implementations (RFC 9000/9001). |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Testing Access-Control Configuration Changes for Web Applications | Chengcheng Xiang, Li Zhong, Eric Mugnier, Nathaniel Nguyen, Yuanyuan Zhou, Tianyin Xu | 2025-05-19 | 下载 | Access-control misconfigurations are among the main causes of today's data breaches in web applications. However, few techniques are available to support automatic and systematic testing for access-co... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Bayesian Hierarchical Models for Quantitative Estimates for Performance metrics applied to Saddle Search Algorithms | Rohit Goswami | 2025-05-19 | 下载 | Rigorous performance evaluation is essential for developing robust algorithms for high-throughput computational chemistry. Traditional benchmarking, however, often struggles to account for system-spec... |
| SzCORE as a benchmark: report from the seizure detection challenge at the 2025 AI in Epilepsy and Neurological Disorders Conference | Jonathan Dan, Amirhossein Shahbazinia, Christodoulos Kechris, David Atienza | 2025-05-19 | 下载 | Reliable automatic seizure detection from long-term EEG remains a challenge, as current machine learning models often fail to generalize across patients or clinical settings. |
| Net-Zero: A Comparative Study on Neural Network Design for Climate-Economic PDEs Under Uncertainty | Carlos Rodriguez-Pardo, Louis Daumas, Leonardo Chiani, Massimo Tavoni | 2025-05-19 | 下载 | Climate-economic modeling under uncertainty presents significant computational challenges that may limit policymakers' ability to address climate change effectively. |
| eBPF-Based Instrumentation for Generalisable Diagnosis of Performance Degradation | Diogo Landau, Jorge Barbosa, Nishant Saurabh | 2025-05-19 | 下载 | Online Data Intensive applications (e.g. message brokers, ML inference and databases) are core components of the modern internet, providing critical functionalities to connecting services. |
| Effects of the Auto-Correlation of Delays on the Age of Information: A Gaussian Process Framework | Atsushi Inoie, Yoshiaki Inoue | 2025-05-19 | 下载 | The age of information (AoI) has been studied actively in recent years as a performance measure for systems that require real-time performance, such as remote monitoring systems via communication netw... |