Skip to content

2026-04-10

cs.AR - Architecture

标题作者发布日期PDF摘要
Sustainable Transformer Neural Network Acceleration with Stochastic Photonic ComputingS. Afifi, O. Alo, I. Thakkar, S. Pasricha2026-04-10下载Transformers achieve state-of-the-art performance in natural language processing, vision, and scientific computing, but demand high computation and memory.
A 0.5-V Linear Neuromorphic Voltage-to-Spike Encoder Using a Bulk-Driven TransconductorMeysam Akbari, Erika Covi, Kea-Tiong Tang2026-04-10下载This work introduces an ultralow-power voltage-to-spike encoder that achieves near-linear voltage-to-firing-rate conversion by pairing a linearized bulk-driven transconductor with a DPI-based LIF neur...
MATCHA: Efficient Deployment of Deep Neural Networks on Multi-Accelerator Heterogeneous Edge SoCsEnrico Russo, Mohamed Amine Hamdi, Alessandro Ottaviano, Francesco Conti, Angelo Garofalo, Daniele Jahier Pagliari, Maurizio Palesi, Luca Benini, Alessio Burrello2026-04-10下载Deploying DNNs on System-on-Chips (SoC) with multiple heterogeneous acceleration engines is challenging, and the majority of deployment frameworks cannot fully exploit heterogeneity.
DRIFT: Harnessing Inherent Fault Tolerance for Efficient and Reliable Diffusion Model InferenceJinqi Wen, Tong Xie, Runsheng Wang, Meng Li2026-04-10下载Diffusion model deployment has been suffering from high energy consumption and inference latency despite its superior performance in visual generation tasks.
From Indiscriminate to Targeted: Efficient RTL Verification via Functionally Key Signal-Driven LLM Assertion GenerationYonghao Wang, Hongqin Lyu, Boling Chen, MinYang Bao, Wenchao Ding, Feng Gu, Zhiteng Chao, Jianan Mu, Kan Shi, Tiancheng Wang, Huawei Li2026-04-10下载Functional verification has become the most time-consuming phase in IC development, and Assertion-Based Verification (ABV) is key to reducing debugging time.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Sustaining Exascale Performance: Lessons from HPL and HPL-MxP on AuroraKazushige Goto, Huda Ibeid, Kalyan Kumaran, Servesh Muralidharan, Anthony-Trung Nguyen, Aditya Nishtala2026-04-10下载Sustaining exascale performance in production requires engineering choices and operational practices that emerge only under real deployment constraints and demand coordination across system layers.
XFED: Non-Collusive Model Poisoning Attack Against Byzantine-Robust Federated ClassifiersIsrat Jahan Mouri, Muhammad Ridowan, Muhammad Abdullah Adnan2026-04-10下载Model poisoning attacks pose a significant security threat to Federated Learning (FL). Most existing model poisoning attacks rely on collusion, requiring adversarial clients to coordinate by exchangin...
NOMAD: Generating Embeddings for Massive Distributed GraphsAishwarya Sarkar, Sayan Ghosh, Nathan R. Tallent, Ali Jannesari2026-04-10下载Successful machine learning on graphs or networks requires embeddings that not only represent nodes and edges as low-dimensional vectors but also preserve the graph structure.
A-IO: Adaptive Inference Orchestration for Memory-Bound NPUsChen Zhang, Yan Ding, Haotian Wang, Chubo Liu, Keqin Li, Kenli Li2026-04-10下载During the deployment of Large Language Models (LLMs), the autoregressive decoding phase on heterogeneous NPU platforms (e.g., Ascend 910B) faces severe memory-bound challenges.
MATCHA: Efficient Deployment of Deep Neural Networks on Multi-Accelerator Heterogeneous Edge SoCsEnrico Russo, Mohamed Amine Hamdi, Alessandro Ottaviano, Francesco Conti, Angelo Garofalo, Daniele Jahier Pagliari, Maurizio Palesi, Luca Benini, Alessio Burrello2026-04-10下载Deploying DNNs on System-on-Chips (SoC) with multiple heterogeneous acceleration engines is challenging, and the majority of deployment frameworks cannot fully exploit heterogeneity.
TensorHub: Scalable and Elastic Weight Transfer for LLM RL TrainingChenhao Ye, Huaizheng Zhang, Mingcong Han, Baoquan Zhong, Xiang Li, Qixiang Chen, Xinyi Zhang, Weidong Zhang, Kaihua Jiang, Wang Zhang, He Sun, Wencong Xiao, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau2026-04-10下载Modern LLM reinforcement learning (RL) workloads require a highly efficient weight transfer system to scale training across heterogeneous computational resources.
EdgeFlow: Fast Cold Starts for LLMs on Mobile DevicesYongsheng Yan, Jiacheng Shen, Xuchuan Luo, Yangfan Zhou2026-04-10下载Deploying large language models (LLMs) on mobile devices is an emerging trend to enable data privacy and offline accessibility of LLM applications.
Watt Counts: Energy-Aware Benchmark for Sustainable LLM Inference on Heterogeneous GPU ArchitecturesMauricio Fadel Argerich, Jonathan Fürst, Marta Patiño-Martínez2026-04-10下载While the large energy consumption of Large Language Models (LLMs) is recognized by the community, system operators lack guidance for energy-efficient LLM inference deployments that leverage energy tr...
Finding Nemo-Nemo: CFT DAG-based Consensus in the WANRithwik Kerur, Pasindu Tennage, Philipp Jovanovic, Dahlia Malkhi, Alberto Sonnino, Igor Zablotchi2026-04-10下载This paper introduces Nemo-Nemo, a practical crash-fault tolerant (CFT) consensus protocol designed to outperform existing protocols in wide-area networks by bridging design principles from the CFT an...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Policy-Aware Edge LLM-RAG Framework for Internet of Battlefield Things Mission OrchestrationOm Solanki, Lopamudra Praharaj, Deepti Gupta, Maanak Gupta2026-04-10下载Large Language Models (LLMs) offer a promising interface for intent-driven control of autonomous cyber-physical systems, but their direct use in mission-critical Internet of Battlefield Things (IoBT) ...
EYWA: Elastic Load-Balancing and High-Availability Wired Virtual Network ArchitectureWookjae Jeong, Jungin Jung2026-04-10下载Infrastructure as a Service (IaaS) in cloud environments provides compute, storage, networking, and other fundamental resources that allow consumers to deploy and run arbitrary software, including ope...
SatQNet: Satellite-assisted Quantum Network Entanglement Routing Using Directed Line Graph Neural NetworksTobias Meuser, Jannis Weil, Aninda Lahiri, Marius Paraschiv2026-04-10下载Quantum networks are expected to become a key enabler for interconnecting quantum devices. In contrast to classical communication networks, however, information transfer in quantum networks is usually...
QuIKS: Near-Zero Latency Key Supply with Adaptive Buffering for Resource-Efficient Quantum Key Distribution NetworksYuxin Chen, Zite Xia, Jian Li, Kaiping Xue, Zhonghui Li, Lutong Chen, Ruidong Li2026-04-10下载Quantum key distribution (QKD) networks provide information-theoretically secure keys for distant parties, emerging as a vital alternative to classical cryptography infrastructures threatened by quant...
"Take Me Home, Wi-Fi Drone": A Drone-based Wireless System for Wilderness Search and RescueWeiying Hou, Luca Jiang-Tao Yu, Chenshu Wu2026-04-10下载Wilderness Search and Rescue (WiSAR) represents a longstanding and critical societal challenge, demanding innovative and automatic technological solutions.
Scrutinizing Real-life Configurations of Random Access Procedures in Cellular NetworksJoris Belder, Anup Bhattacharjee, Fernando Kuipers2026-04-10下载In cellular networks, base stations broadcast configurations that devices use for the random access procedure, which is a vital part of the connection setup.
Plasticity-Enhanced Multi-Agent Mixture of Experts for Dynamic Objective Adaptation in UAVs-Assisted Emergency Communication NetworksWen Qiu, Zhiqiang He, Wei Zhao, Hiroshi Masui2026-04-10下载Unmanned aerial vehicles serving as aerial base stations can rapidly restore connectivity after disasters, yet abrupt changes in user mobility and traffic demands shift the quality of service trade-of...
Multimodal Large Language Model Enabled Robust Beamforming for HAP Downlink CommunicationsXiaoyu Xing, Peng Yang, Guoquan Tao, Dingyi Lu, Zehui Xiong, Xianbin Cao2026-04-10下载Small changes in high altitude platform (HAP) attitude can cause significant deviations in HAP downlink beam directions, thereby severely degrading HAP downlink communication performance.
Generative AI Agent Empowered Power Allocation for HAP Propulsion and Communication SystemsXiaoyu Xing, Dingyi Lu, Peng Yang, Zehui Xiong, Xianbin Cao, Tony Q. S. Quek2026-04-10下载High altitude platforms (HAPs) are emerging as a key enabler for 6G coverage, yet limited energy must be split between propulsion and communications.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Decoupling Vector Data and Index Storage for Space EfficiencyYuanming Ren, Juncheng Zhang, Yanjing Ren, Rui Yang, Di Wu, Patrick P. C. Lee2026-04-10下载Managing large-scale vector datasets with disk-based approximate nearest neighbor search (ANNS) systems faces critical efficiency challenges stemming from the co-location of vector data and auxiliary ...
EdgeFlow: Fast Cold Starts for LLMs on Mobile DevicesYongsheng Yan, Jiacheng Shen, Xuchuan Luo, Yangfan Zhou2026-04-10下载Deploying large language models (LLMs) on mobile devices is an emerging trend to enable data privacy and offline accessibility of LLM applications.

基于 VitePress 构建