Skip to content

2026-03-09

cs.AR - Architecture

标题作者发布日期PDF摘要
The qsqs Inequality: Quantifying the Double Penalty of Mixture-of-Experts at InferenceVignesh Adhinarayanan, Nuwan Jayasena2026-03-09下载Mixture-of-Experts (MoE) models deliver high quality at low training FLOPs, but this efficiency often vanishes at inference. We identify a double penalty that structurally disadvantages MoE architectu...
Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges AheadZhongming Yu, Naicheng Yu, Hejia Zhang, Wentao Ni, Mingrui Yin, Jiaying Yang, Yujie Zhao, Jishen Zhao2026-03-09下载As LLM agents evolve into collaborative multi-agent systems, their memory requirements grow rapidly in complexity. This position paper frames multi-agent memory as a computer architecture problem.
bsort: A theoretically efficient non-comparison-based sorting algorithm for integer and floating-point numbersBenjamín Guzmán2026-03-09下载This paper presents bsort, a non-comparison-based sorting algorithm for signed and unsigned integers, and floating-point values. The algorithm unifies these cases through an approach derived from bina...
Trust Nothing: RTOS Security without Run-Time Software TCB (Extended Version)Eric Ackermann, Sven Bugiel2026-03-09下载Embedded devices face an ever-expanding threat landscape: vulnerabilities in application software, operating system kernels, and peripherals threaten the embedded device integrity.
Why Learn What Physics Already Knows? Realizing Agile mmWave-based Human Pose Estimation via Physics-Guided PreprocessingShuntian Zheng, Jiaqi Li, Minzhe Ni, Xiaoman Lu, Yu Guan2026-03-09下载We revisit millimeter-wave (mmWave) human pose estimation (HPE) from a signal preprocessing perspective. A single mmWave frame provides structured dimensions that map directly to human geometry and mo...
GOMA: Geometrically Optimal Mapping via Analytical Modeling for Spatial AcceleratorsWulve Yang, Hailong Zou, Rui Zhou, Jionghao Zhang, Qiang Li, Gang Li, Yi Zhan, Shushan Qiao2026-03-09下载General matrix multiplication (GEMM) on spatial accelerators is highly sensitive to mapping choices in both execution efficiency and energy consumption.
ConnChecker: Automated Root-Cause Analysis for Formal Connectivity Check via GraphDo Ngoc Tiep, Nguyen Linh Anh, Luu Danh Minh2026-03-09下载Formal connectivity checking offers scalable verification of signal paths in complex SoC designs, but debugging counterexamples remains a manual and time-consuming process.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Lockbox -- A Zero Trust Architecture for Secure Processing of Sensitive Cloud WorkloadsVamshi Krishna Thotempudi, Mahima Agarwal, Raghav Batta, Anjali Mangal2026-03-09下载Enterprises increasingly rely on cloud-based applications to process highly sensitive data artifacts. Although cloud adoption improves agility and scalability, it also introduces new security challeng...
The qsqs Inequality: Quantifying the Double Penalty of Mixture-of-Experts at InferenceVignesh Adhinarayanan, Nuwan Jayasena2026-03-09下载Mixture-of-Experts (MoE) models deliver high quality at low training FLOPs, but this efficiency often vanishes at inference. We identify a double penalty that structurally disadvantages MoE architectu...
A Consensus-Driven Multi-LLM Pipeline for Missing-Person InvestigationsJoshua Castillo, Ravi Mukkamala2026-03-09下载The first 72 hours of a missing-person investigation are critical for successful recovery. Guardian is an end-to-end system designed to support missing-child investigation and early search planning.
FedLECC: Cluster- and Loss-Guided Client Selection for Federated Learning under Non-IID DataDaniel M. Jimenez-Gutierrez, Giovanni Giunta, Mehrdad Hassanzadeh, Aris Anagnostopoulos, Ioannis Chatzigiannakis, Andrea Vitaletti2026-03-09下载Federated Learning (FL) enables distributed Artificial Intelligence (AI) across cloud-edge environments by allowing collaborative model training without centralizing data.
DeZent: Decentralized z-Anonymity with Privacy-Preserving CoordinationCarolin Brunn, Florian Tschorsch2026-03-09下载Analyzing large volumes of sensor network data, such as electricity consumption measurements from smart meters, is essential for modern applications but raises significant privacy concerns.
Serving Compound Inference Systems on Datacenter GPUsSriram Devata, Rahul Singh, Sarita Adve2026-03-09下载Applications in emerging domains such as XR are being built as compound inference systems, where multiple ML models are composed in the form of a task graph to service each request.
A Blockchain-based Traceability System for AI-Driven Engine Blade InspectionMahmoud Hafez, Eman Ouda, Mohammed A. Mohammed Eltoum, Khaled Salah, Yusra Abdulrahman2026-03-09下载Aircraft engine blade maintenance relies on inspection records shared across manufacturers, airlines, maintenance organizations, and regulators.
TA-RNN-Medical-Hybrid: A Time-Aware and Interpretable Framework for Mortality Risk PredictionZahra Jafari, Azadeh Zamanifar, Amirfarhad Farhadi2026-03-09下载Accurate and interpretable mortality risk prediction in intensive care units (ICUs) remains a critical challenge due to the irregular temporal structure of electronic health records (EHRs), the comple...
A Hodge-Based Framework for Service Operational Analysis in Serverless PlatformsGianluca Reali, Mauro Femminella2026-03-09下载In this paper we propose a method for analyzing services deployed in serverless platforms. These services typically consists of orchestrated functions that can exhibit complex and non-conservative inf...
Covenant-72B: Pre-Training a 72B LLM with Trustless Peers Over-the-InternetJoel Lidin, Amir Sarfi, Erfan Miahi, Quentin Anthony, Shivam Chauhan, Evangelos Pappas, Benjamin Thérien, Eugene Belilovsky, Samuel Dare2026-03-09下载Recently, there has been increased interest in globally distributed training, which has the promise to both reduce training costs and democratize participation in building large-scale foundation model...
SafarDB: FPGA-Accelerated Distributed Transactions via Replicated Data TypesJavad Saberlatibari, Prithviraj Yuvaraj, Mohsen Lesani, Philip Brisk, Mohammad Sadoghi2026-03-09下载Data replication is a critical aspect of data center design, as it ensures high availability, scalability, and fault tolerance. However, replicas need to be coordinated to maintain convergence and dat...
SI-ChainFL: Shapley-Incentivized Secure Federated Learning for High-Speed Rail Data SharingMingjie Zhao, Cheng Dai, Fei Chen, Xin Chen, Kaoru Ota, Mianxiong Dong, Bing Guo2026-03-09下载In high-speed rail (HSR) systems, federated learning (FL) enables cross-departmental flow prediction without sharing raw data. However, existing schemes suffer from two key limitations: (1) insufficie...
ACE-GF-based Attestation Relay for PQC - Lightweight Mempool Propagation Without On-Path ProofsJian Sheng Wang2026-03-09下载In post-quantum blockchain settings, objects that require validity proofs (e.g., blob roots, execution-layer or consensus-layer signature aggregates) must be broadcast through mempool and relay networ...
ZK-ACE: Identity-Centric Zero-Knowledge Authorization for Post-Quantum Blockchain SystemsJian Sheng Wang2026-03-09下载Post-quantum signature schemes introduce kilobyte-scale authorization artifacts when applied directly to blockchain transaction validation. A widely considered mitigation is to verify post-quantum sig...
RAPID: Redundancy-Aware and Compatibility-Optimal Edge-Cloud Partitioned Inference for Diverse VLA ModelsZihao Zheng, Sicheng Tian, Hangyu Cao, Chenyue Li, Jiayu Chen, Maoliang Li, Xinhao Sun, Hailong Zou, Guojie Luo, Xiang Chen2026-03-09下载Vision Language Action (VLA) models are mainstream in embodied intelligence but face high inference costs. Edge-Cloud Collaborative (ECC) inference offers an effective fix by easing edge-device comput...
SageSched: Efficient LLM Scheduling Confronting Demand Uncertainty and HybridityZhenghao Gan, Yichen Bao, Yifei Liu, Chen Chen, Quan Chen, Minyi Guo2026-03-09下载Efficient LLM inference scheduling is crucial for user experience. However, LLM inferences exhibit remarkable demand uncertainty (with unknown output length beforehand) and hybridity (being both compu...
The Consistency Correctness in CoPPar TreeXincheng Yang, Kyle Hale2026-03-09下载This article is a supplementary document for the CoPPar Tree paper, providing a detailed correctness proof for the CoPPar architecture.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Optimizing Reinforcement Learning Training over Digital Twin Enabled Multi-fidelity NetworksHanzhi Yu, Hasan Farooq, Julien Forgeat, Shruti Bothe, Kristijonas Cyras, Md Moin Uddin Chowdhury, Mingzhe Chen2026-03-09下载In this paper, we investigate a novel digital network twin (DNT) assisted deep learning (DL) model training framework. In particular, we consider a physical network where a base station (BS) uses seve...
Why Channel-Centric Models are not Enough to Predict End-to-End Performance in Private 5G: A Measurement Campaign and Case StudyNils Jörgensen2026-03-09下载Communication-aware robot planning requires accurate predictions of wireless network performance. Current approaches rely on channel-level metrics such as received signal strength and signal-to-noise ...
OAuthHub: Mitigating OAuth Data Overaccess through a Local Data HubQiyu Li, Yuhe Tian, Haojian Jin2026-03-09下载Most OAuth service providers, such as Google and Microsoft, offer only a limited range of coarse-grained data access. As a result, third-party OAuth applications often end up accessing more user data ...
Mobile Base Station Optimal Tour in Wide Area IoT Sensor NetworksSachin Kadam2026-03-09下载Wide-area IoT sensor networks require efficient data collection mechanisms when sensors are dispersed over large regions with limited communication infrastructure.
Predicting Conflict Impact on Performance in O-RANPietro Brach del Prever, Niloofar Mohamadi, Salvatore D'Oro, Leonardo Bonati, Michele Polese, Łukasz Kułacz, Piotr Jaworski, Adrian Kliks, Heiko Lehmann, Tommaso Melodia2026-03-09下载The O-RAN Alliance promotes the integration of intelligent autonomous agents to control the Radio Access Network (RAN). This improves flexibility, performance, and observability in the RAN, but introd...
Time-based Fairness Improves Performance in Multi-rate WLANsGodfrey Tan, John Guttag2026-03-09下载The performance seen by individual clients on a wireless local area network (WLAN) is heavily influenced by the manner in which wireless channel capacity is allocated.
Where Do Flow Semantics Reside? A Protocol-Native Tabular Pretraining Paradigm for Encrypted Traffic ClassificationSizhe Huang, Shujie Yang2026-03-09下载Self-supervised masked modeling shows promise for encrypted traffic classification by masking and reconstructing raw bytes. Yet recent work reveals these methods fail to reduce reliance on labeled dat...
Silicone Ethernet (SEth): a Nervous System for Robotic TouchMengyao Liu, Dag Malstaf, Jonathan Oostvogels, Sam Michiels, Alexander Badri-Spröwitz, Danny Hughes2026-03-09下载Fine-grained robotic touch sensing is essential for tasks such as robot-human interaction and the handling of hazardous materials. Yet, the sense of touch of robots is limited by the cost and complexi...
A Comparative Study of Recent Advances in Internet of Intrusion Detection ThingsMarianna Rezk, Hassan Harb, Ismail Bennis, Sebastien Bindel, Hafid Abouaissa2026-03-09下载The Internet of Things (IoT) has revolutionized the way devices communicate and interact with each other, but it has also created new challenges in terms of security.
A Hodge-Based Framework for Service Operational Analysis in Serverless PlatformsGianluca Reali, Mauro Femminella2026-03-09下载In this paper we propose a method for analyzing services deployed in serverless platforms. These services typically consists of orchestrated functions that can exhibit complex and non-conservative inf...
Not All Prefills Are Equal: PPD Disaggregation for Multi-turn LLM ServingZongze Li, Jingyu Liu, Zach Xu, Yineng Zhang, Tahseen Rabbani, Ce Zhang2026-03-09下载Prefill-Decode (PD) disaggregation has become the standard architecture for modern LLM inference engines, which alleviates the interference of two distinctive workloads.
PreHO: Predictive Handover for LEO Satellite NetworksXingqiu He, Zijie Ying, Chaoqun You, Yue Gao2026-03-09下载Low-Earth Orbit (LEO) Satellite Networks (LSNs) offer a promising solution for extending connectivity to areas not covered by Terrestrial Networks (TNs).
Energy-Efficient Online Scheduling for Wireless Powered Mobile Edge Computing NetworksXingqiu He, Chaoqun You, Yuzhi Yang, Zihan Chen, Yuhang Shen, Tony Q. S. Quek, Yue Gao2026-03-09下载Wireless Powered Mobile Edge Computing (WP-MEC) integrates mobile edge computing (MEC) with wireless power transfer (WPT) to simultaneously extend the operational lifetime and enhance the computationa...
Hard/Soft NLoS Detection via Combinatorial Data Augmentation for 6G PositioningSang-Hyeok Kim, Seung Min Yu, Jihong Park, Seung-Woo Ko2026-03-09下载A key enabler for meeting the stringent requirements of 6G positioning is the ability to exploit site-dependent information governing line-of-sight (LoS) and non-line-of-sight (NLoS) propagation.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
The Missing Memory Hierarchy: Demand Paging for LLM Context WindowsTony Mason2026-03-09下载The context window of a large language model is not memory. It is L1 cache: a small, fast, expensive resource that the field treats as the entire memory system.
Trust Nothing: RTOS Security without Run-Time Software TCB (Extended Version)Eric Ackermann, Sven Bugiel2026-03-09下载Embedded devices face an ever-expanding threat landscape: vulnerabilities in application software, operating system kernels, and peripherals threaten the embedded device integrity.

cs.PF - Performance

标题作者发布日期PDF摘要
The qsqs Inequality: Quantifying the Double Penalty of Mixture-of-Experts at InferenceVignesh Adhinarayanan, Nuwan Jayasena2026-03-09下载Mixture-of-Experts (MoE) models deliver high quality at low training FLOPs, but this efficiency often vanishes at inference. We identify a double penalty that structurally disadvantages MoE architectu...
bsort: A theoretically efficient non-comparison-based sorting algorithm for integer and floating-point numbersBenjamín Guzmán2026-03-09下载This paper presents bsort, a non-comparison-based sorting algorithm for signed and unsigned integers, and floating-point values. The algorithm unifies these cases through an approach derived from bina...
DyLLM: Efficient Diffusion LLM Inference via Saliency-based Token Selection and Partial AttentionYounjoo Lee, Junghoo Lee, Seungkyun Dan, Jaiyoung Park, Jung Ho Ahn2026-03-09下载Masked Diffusion Language Models (MDLMs) enable parallel token decoding, providing a promising alternative to the sequential nature of autoregressive generation.

基于 VitePress 构建