2026-03-08

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Mitigating the Memory Bottleneck with Machine Learning-Driven and Data-Aware Microarchitectural Techniques	Rahul Bera	2026-03-08	下载	Modern applications process massive data volumes that overwhelm the storage and retrieval capabilities of memory systems, making memory the primary performance and energy-efficiency bottleneck of comp...
Accelerating Diffusion Models for Generative AI Applications with Silicon Photonics	Tharini Suresh, Salma Afifi, Sudeep Pasricha	2026-03-08	下载	Diffusion models have revolutionized generative AI, with their inherent capacity to generate highly realistic state-of-the-art synthetic data.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
A Lock-Free, Fully GPU-Resident Architecture for the Verification of Goldbach's Conjecture	Isaac Llorente-Saguer	2026-03-08	下载	We present a fully device-resident, multi-GPU architecture for the large-scale computational verification of Goldbach's conjecture. In prior work, a segmented double-sieve eliminated monolithic VRAM b...
ArcLight: A Lightweight LLM Inference Architecture for Many-Core CPUs	Yuzhuang Xu, Xu Han, Yuxuan Li, Wanxiang Che	2026-03-08	下载	Although existing frameworks for large language model (LLM) inference on CPUs are mature, they fail to fully exploit the computation potential of many-core CPU platforms.
Structured Gossip: A Partition-Resilient DNS for Internet-Scale Dynamic Networks	Priyanka Sinha, Dilys Thomas	2026-03-08	下载	Network partitions pose fundamental challenges to distributed name resolution in mobile ad-hoc networks (MANETs) and edge computing. Existing solutions either require active coordination that fails to...
Scalable Training of Mixture-of-Experts Models with Megatron Core	Zijie Yan, Hongxiao Bai, Xin Yao, Dennis Liu, Tong Liu, Hongbin Liu, Pingtian Li, Evan Wu, Shiqing Fan, Li Tao, Robin Zhang, Yuzhong Wang, Shifang Xu, Jack Chang, Xuwen Chen, Kunlun Li, Yan Bai, Gao Deng, Nan Zheng, Vijay Anand Korthikanti, Abhinav Khattar, Ethan He, Soham Govande, Sangkug Lym, Zhongbo Zhu, Qi Zhang, Haochen Yuan, Xiaowei Ren, Deyu Fu, Tailai Ma, Shunkang Zhang, Jiang Shao, Ray Wang, Vasudevan Rengasamy, Rachit Garg, Santosh Bhavani, Xipeng Li, Chandler Zhou, David Wu, Yingcan Wei, Ashwath Aithal, Michael Andersch, Mohammad Shoeybi, Jiajie Yao, June Yang	2026-03-08	下载	Scaling Mixture-of-Experts (MoE) training introduces systems challenges absent in dense models. Because each token activates only a subset of experts, this sparsity allows total parameters to grow muc...
Mitigating the Memory Bottleneck with Machine Learning-Driven and Data-Aware Microarchitectural Techniques	Rahul Bera	2026-03-08	下载	Modern applications process massive data volumes that overwhelm the storage and retrieval capabilities of memory systems, making memory the primary performance and energy-efficiency bottleneck of comp...
Performance Evaluation of Automated Multi-Service Deployment in Edge-Cloud Environments with the CODECO Toolkit	Georgios Koukis, Ioannis Dermentzis, Vassilis Tsaoussidis, Jan Lenke, Fabian Wolk, Daniel Uceda, Guillermo Sanchez, Miguel A. Puentes, Javier Serrano, Panagiotis Karamolegkos, Rute C. Sofia	2026-03-08	下载	Containerized microservices are widely adopted for latency-sensitive and compute-intensive applications, with Kubernetes (K8s) as the dominant orchestration platform.
MAS-H2: A Hierarchical Multi-Agent System for Holistic Cloud-Native Autoscaling	Hamed Hamzeh, Parisa Vahdatian	2026-03-08	下载	Autoscaling in cloud-native platforms like Kubernetes is reactive and metric-driven, leading to a strategic void problem. This comes from the decoupling of higher-level business policies from lower-le...
Agentic AI-Driven UAV Network Deployment: A LLM-Enhanced Exact Potential Game Approach	Xin Tang, Qian Chen, Binhan Liao, Yaqi Zhang, Jianxin Chen, Changyuan Zhao, Junchuan Fan, Junxi Tian, Xiaohuan Li	2026-03-08	下载	Unmanned Aerial Vehicular Networks (UAVNs) are envisioned to provide flexible connectivity, wide-area coverage, and low-latency services in dynamic environments.
Link Wars: The Semantic Crisis. Is the debate over or is it just beginning?	Paul Borrill	2026-03-08	下载	For fifty years, networking has fragmented whenever new workloads exposed hidden assumptions about time, ordering, failure, and trust. This paper argues that the current interconnect landscape -- NVLi...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Structured Gossip: A Partition-Resilient DNS for Internet-Scale Dynamic Networks	Priyanka Sinha, Dilys Thomas	2026-03-08	下载	Network partitions pose fundamental challenges to distributed name resolution in mobile ad-hoc networks (MANETs) and edge computing. Existing solutions either require active coordination that fails to...
Learning the APT Kill Chain: Temporal Reasoning over Provenance Data for Attack Stage Estimation	Trung V. Phan, Thomas Bauschert	2026-03-08	下载	Advanced Persistent Threats (APTs) evolve through multiple stages, each exhibiting distinct temporal and structural behaviors. Accurate stage estimation is critical for enabling adaptive cyber defense...
Toward Real-Time Mirrors Intelligence: System-Level Latency and Computation Evaluation in Internet of Mirrors (IoM)	Haneen Fatima, Muhammad Ali Imran, Ahmad Taha, Lina Mohjazi	2026-03-08	下载	The Internet of Mirrors (IoM) is an emerging IoT ecosystem of interconnected smart mirrors designed to deliver personalised services across a three-tier node hierarchy spanning consumer, professional,...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Structured Gossip: A Partition-Resilient DNS for Internet-Scale Dynamic Networks	Priyanka Sinha, Dilys Thomas	2026-03-08	下载	Network partitions pose fundamental challenges to distributed name resolution in mobile ad-hoc networks (MANETs) and edge computing. Existing solutions either require active coordination that fails to...
Mitigating the Memory Bottleneck with Machine Learning-Driven and Data-Aware Microarchitectural Techniques	Rahul Bera	2026-03-08	下载	Modern applications process massive data volumes that overwhelm the storage and retrieval capabilities of memory systems, making memory the primary performance and energy-efficiency bottleneck of comp...
Quine: Realizing LLM Agents as Native POSIX Processes	Hao Ke	2026-03-08	下载	Current LLM agent frameworks often implement isolation, scheduling, and communication at the application layer, even though these mechanisms are already provided by mature operating systems.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
A Lock-Free, Fully GPU-Resident Architecture for the Verification of Goldbach's Conjecture	Isaac Llorente-Saguer	2026-03-08	下载	We present a fully device-resident, multi-GPU architecture for the large-scale computational verification of Goldbach's conjecture. In prior work, a segmented double-sieve eliminated monolithic VRAM b...