2026-04-10

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Sustainable Transformer Neural Network Acceleration with Stochastic Photonic Computing	S. Afifi, O. Alo, I. Thakkar, S. Pasricha	2026-04-10	下载	Transformers achieve state-of-the-art performance in natural language processing, vision, and scientific computing, but demand high computation and memory.
A 0.5-V Linear Neuromorphic Voltage-to-Spike Encoder Using a Bulk-Driven Transconductor	Meysam Akbari, Erika Covi, Kea-Tiong Tang	2026-04-10	下载	This work introduces an ultralow-power voltage-to-spike encoder that achieves near-linear voltage-to-firing-rate conversion by pairing a linearized bulk-driven transconductor with a DPI-based LIF neur...
MATCHA: Efficient Deployment of Deep Neural Networks on Multi-Accelerator Heterogeneous Edge SoCs	Enrico Russo, Mohamed Amine Hamdi, Alessandro Ottaviano, Francesco Conti, Angelo Garofalo, Daniele Jahier Pagliari, Maurizio Palesi, Luca Benini, Alessio Burrello	2026-04-10	下载	Deploying DNNs on System-on-Chips (SoC) with multiple heterogeneous acceleration engines is challenging, and the majority of deployment frameworks cannot fully exploit heterogeneity.
DRIFT: Harnessing Inherent Fault Tolerance for Efficient and Reliable Diffusion Model Inference	Jinqi Wen, Tong Xie, Runsheng Wang, Meng Li	2026-04-10	下载	Diffusion model deployment has been suffering from high energy consumption and inference latency despite its superior performance in visual generation tasks.
From Indiscriminate to Targeted: Efficient RTL Verification via Functionally Key Signal-Driven LLM Assertion Generation	Yonghao Wang, Hongqin Lyu, Boling Chen, MinYang Bao, Wenchao Ding, Feng Gu, Zhiteng Chao, Jianan Mu, Kan Shi, Tiancheng Wang, Huawei Li	2026-04-10	下载	Functional verification has become the most time-consuming phase in IC development, and Assertion-Based Verification (ABV) is key to reducing debugging time.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Sustaining Exascale Performance: Lessons from HPL and HPL-MxP on Aurora	Kazushige Goto, Huda Ibeid, Kalyan Kumaran, Servesh Muralidharan, Anthony-Trung Nguyen, Aditya Nishtala	2026-04-10	下载	Sustaining exascale performance in production requires engineering choices and operational practices that emerge only under real deployment constraints and demand coordination across system layers.
XFED: Non-Collusive Model Poisoning Attack Against Byzantine-Robust Federated Classifiers	Israt Jahan Mouri, Muhammad Ridowan, Muhammad Abdullah Adnan	2026-04-10	下载	Model poisoning attacks pose a significant security threat to Federated Learning (FL). Most existing model poisoning attacks rely on collusion, requiring adversarial clients to coordinate by exchangin...
NOMAD: Generating Embeddings for Massive Distributed Graphs	Aishwarya Sarkar, Sayan Ghosh, Nathan R. Tallent, Ali Jannesari	2026-04-10	下载	Successful machine learning on graphs or networks requires embeddings that not only represent nodes and edges as low-dimensional vectors but also preserve the graph structure.
A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs	Chen Zhang, Yan Ding, Haotian Wang, Chubo Liu, Keqin Li, Kenli Li	2026-04-10	下载	During the deployment of Large Language Models (LLMs), the autoregressive decoding phase on heterogeneous NPU platforms (e.g., Ascend 910B) faces severe memory-bound challenges.
MATCHA: Efficient Deployment of Deep Neural Networks on Multi-Accelerator Heterogeneous Edge SoCs	Enrico Russo, Mohamed Amine Hamdi, Alessandro Ottaviano, Francesco Conti, Angelo Garofalo, Daniele Jahier Pagliari, Maurizio Palesi, Luca Benini, Alessio Burrello	2026-04-10	下载	Deploying DNNs on System-on-Chips (SoC) with multiple heterogeneous acceleration engines is challenging, and the majority of deployment frameworks cannot fully exploit heterogeneity.
TensorHub: Scalable and Elastic Weight Transfer for LLM RL Training	Chenhao Ye, Huaizheng Zhang, Mingcong Han, Baoquan Zhong, Xiang Li, Qixiang Chen, Xinyi Zhang, Weidong Zhang, Kaihua Jiang, Wang Zhang, He Sun, Wencong Xiao, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau	2026-04-10	下载	Modern LLM reinforcement learning (RL) workloads require a highly efficient weight transfer system to scale training across heterogeneous computational resources.
EdgeFlow: Fast Cold Starts for LLMs on Mobile Devices	Yongsheng Yan, Jiacheng Shen, Xuchuan Luo, Yangfan Zhou	2026-04-10	下载	Deploying large language models (LLMs) on mobile devices is an emerging trend to enable data privacy and offline accessibility of LLM applications.
Watt Counts: Energy-Aware Benchmark for Sustainable LLM Inference on Heterogeneous GPU Architectures	Mauricio Fadel Argerich, Jonathan Fürst, Marta Patiño-Martínez	2026-04-10	下载	While the large energy consumption of Large Language Models (LLMs) is recognized by the community, system operators lack guidance for energy-efficient LLM inference deployments that leverage energy tr...
Finding Nemo-Nemo: CFT DAG-based Consensus in the WAN	Rithwik Kerur, Pasindu Tennage, Philipp Jovanovic, Dahlia Malkhi, Alberto Sonnino, Igor Zablotchi	2026-04-10	下载	This paper introduces Nemo-Nemo, a practical crash-fault tolerant (CFT) consensus protocol designed to outperform existing protocols in wide-area networks by bridging design principles from the CFT an...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Policy-Aware Edge LLM-RAG Framework for Internet of Battlefield Things Mission Orchestration	Om Solanki, Lopamudra Praharaj, Deepti Gupta, Maanak Gupta	2026-04-10	下载	Large Language Models (LLMs) offer a promising interface for intent-driven control of autonomous cyber-physical systems, but their direct use in mission-critical Internet of Battlefield Things (IoBT) ...
EYWA: Elastic Load-Balancing and High-Availability Wired Virtual Network Architecture	Wookjae Jeong, Jungin Jung	2026-04-10	下载	Infrastructure as a Service (IaaS) in cloud environments provides compute, storage, networking, and other fundamental resources that allow consumers to deploy and run arbitrary software, including ope...
SatQNet: Satellite-assisted Quantum Network Entanglement Routing Using Directed Line Graph Neural Networks	Tobias Meuser, Jannis Weil, Aninda Lahiri, Marius Paraschiv	2026-04-10	下载	Quantum networks are expected to become a key enabler for interconnecting quantum devices. In contrast to classical communication networks, however, information transfer in quantum networks is usually...
QuIKS: Near-Zero Latency Key Supply with Adaptive Buffering for Resource-Efficient Quantum Key Distribution Networks	Yuxin Chen, Zite Xia, Jian Li, Kaiping Xue, Zhonghui Li, Lutong Chen, Ruidong Li	2026-04-10	下载	Quantum key distribution (QKD) networks provide information-theoretically secure keys for distant parties, emerging as a vital alternative to classical cryptography infrastructures threatened by quant...
"Take Me Home, Wi-Fi Drone": A Drone-based Wireless System for Wilderness Search and Rescue	Weiying Hou, Luca Jiang-Tao Yu, Chenshu Wu	2026-04-10	下载	Wilderness Search and Rescue (WiSAR) represents a longstanding and critical societal challenge, demanding innovative and automatic technological solutions.
Scrutinizing Real-life Configurations of Random Access Procedures in Cellular Networks	Joris Belder, Anup Bhattacharjee, Fernando Kuipers	2026-04-10	下载	In cellular networks, base stations broadcast configurations that devices use for the random access procedure, which is a vital part of the connection setup.
Plasticity-Enhanced Multi-Agent Mixture of Experts for Dynamic Objective Adaptation in UAVs-Assisted Emergency Communication Networks	Wen Qiu, Zhiqiang He, Wei Zhao, Hiroshi Masui	2026-04-10	下载	Unmanned aerial vehicles serving as aerial base stations can rapidly restore connectivity after disasters, yet abrupt changes in user mobility and traffic demands shift the quality of service trade-of...
Multimodal Large Language Model Enabled Robust Beamforming for HAP Downlink Communications	Xiaoyu Xing, Peng Yang, Guoquan Tao, Dingyi Lu, Zehui Xiong, Xianbin Cao	2026-04-10	下载	Small changes in high altitude platform (HAP) attitude can cause significant deviations in HAP downlink beam directions, thereby severely degrading HAP downlink communication performance.
Generative AI Agent Empowered Power Allocation for HAP Propulsion and Communication Systems	Xiaoyu Xing, Dingyi Lu, Peng Yang, Zehui Xiong, Xianbin Cao, Tony Q. S. Quek	2026-04-10	下载	High altitude platforms (HAPs) are emerging as a key enabler for 6G coverage, yet limited energy must be split between propulsion and communications.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Decoupling Vector Data and Index Storage for Space Efficiency	Yuanming Ren, Juncheng Zhang, Yanjing Ren, Rui Yang, Di Wu, Patrick P. C. Lee	2026-04-10	下载	Managing large-scale vector datasets with disk-based approximate nearest neighbor search (ANNS) systems faces critical efficiency challenges stemming from the co-location of vector data and auxiliary ...
EdgeFlow: Fast Cold Starts for LLMs on Mobile Devices	Yongsheng Yan, Jiacheng Shen, Xuchuan Luo, Yangfan Zhou	2026-04-10	下载	Deploying large language models (LLMs) on mobile devices is an emerging trend to enable data privacy and offline accessibility of LLM applications.