Skip to content

2026-03-08

cs.AR - Architecture

标题作者发布日期PDF摘要
Mitigating the Memory Bottleneck with Machine Learning-Driven and Data-Aware Microarchitectural TechniquesRahul Bera2026-03-08下载Modern applications process massive data volumes that overwhelm the storage and retrieval capabilities of memory systems, making memory the primary performance and energy-efficiency bottleneck of comp...
Accelerating Diffusion Models for Generative AI Applications with Silicon PhotonicsTharini Suresh, Salma Afifi, Sudeep Pasricha2026-03-08下载Diffusion models have revolutionized generative AI, with their inherent capacity to generate highly realistic state-of-the-art synthetic data.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
A Lock-Free, Fully GPU-Resident Architecture for the Verification of Goldbach's ConjectureIsaac Llorente-Saguer2026-03-08下载We present a fully device-resident, multi-GPU architecture for the large-scale computational verification of Goldbach's conjecture. In prior work, a segmented double-sieve eliminated monolithic VRAM b...
ArcLight: A Lightweight LLM Inference Architecture for Many-Core CPUsYuzhuang Xu, Xu Han, Yuxuan Li, Wanxiang Che2026-03-08下载Although existing frameworks for large language model (LLM) inference on CPUs are mature, they fail to fully exploit the computation potential of many-core CPU platforms.
Structured Gossip: A Partition-Resilient DNS for Internet-Scale Dynamic NetworksPriyanka Sinha, Dilys Thomas2026-03-08下载Network partitions pose fundamental challenges to distributed name resolution in mobile ad-hoc networks (MANETs) and edge computing. Existing solutions either require active coordination that fails to...
Scalable Training of Mixture-of-Experts Models with Megatron CoreZijie Yan, Hongxiao Bai, Xin Yao, Dennis Liu, Tong Liu, Hongbin Liu, Pingtian Li, Evan Wu, Shiqing Fan, Li Tao, Robin Zhang, Yuzhong Wang, Shifang Xu, Jack Chang, Xuwen Chen, Kunlun Li, Yan Bai, Gao Deng, Nan Zheng, Vijay Anand Korthikanti, Abhinav Khattar, Ethan He, Soham Govande, Sangkug Lym, Zhongbo Zhu, Qi Zhang, Haochen Yuan, Xiaowei Ren, Deyu Fu, Tailai Ma, Shunkang Zhang, Jiang Shao, Ray Wang, Vasudevan Rengasamy, Rachit Garg, Santosh Bhavani, Xipeng Li, Chandler Zhou, David Wu, Yingcan Wei, Ashwath Aithal, Michael Andersch, Mohammad Shoeybi, Jiajie Yao, June Yang2026-03-08下载Scaling Mixture-of-Experts (MoE) training introduces systems challenges absent in dense models. Because each token activates only a subset of experts, this sparsity allows total parameters to grow muc...
Mitigating the Memory Bottleneck with Machine Learning-Driven and Data-Aware Microarchitectural TechniquesRahul Bera2026-03-08下载Modern applications process massive data volumes that overwhelm the storage and retrieval capabilities of memory systems, making memory the primary performance and energy-efficiency bottleneck of comp...
Performance Evaluation of Automated Multi-Service Deployment in Edge-Cloud Environments with the CODECO ToolkitGeorgios Koukis, Ioannis Dermentzis, Vassilis Tsaoussidis, Jan Lenke, Fabian Wolk, Daniel Uceda, Guillermo Sanchez, Miguel A. Puentes, Javier Serrano, Panagiotis Karamolegkos, Rute C. Sofia2026-03-08下载Containerized microservices are widely adopted for latency-sensitive and compute-intensive applications, with Kubernetes (K8s) as the dominant orchestration platform.
MAS-H2: A Hierarchical Multi-Agent System for Holistic Cloud-Native AutoscalingHamed Hamzeh, Parisa Vahdatian2026-03-08下载Autoscaling in cloud-native platforms like Kubernetes is reactive and metric-driven, leading to a strategic void problem. This comes from the decoupling of higher-level business policies from lower-le...
Agentic AI-Driven UAV Network Deployment: A LLM-Enhanced Exact Potential Game ApproachXin Tang, Qian Chen, Binhan Liao, Yaqi Zhang, Jianxin Chen, Changyuan Zhao, Junchuan Fan, Junxi Tian, Xiaohuan Li2026-03-08下载Unmanned Aerial Vehicular Networks (UAVNs) are envisioned to provide flexible connectivity, wide-area coverage, and low-latency services in dynamic environments.
Link Wars: The Semantic Crisis. Is the debate over or is it just beginning?Paul Borrill2026-03-08下载For fifty years, networking has fragmented whenever new workloads exposed hidden assumptions about time, ordering, failure, and trust. This paper argues that the current interconnect landscape -- NVLi...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Structured Gossip: A Partition-Resilient DNS for Internet-Scale Dynamic NetworksPriyanka Sinha, Dilys Thomas2026-03-08下载Network partitions pose fundamental challenges to distributed name resolution in mobile ad-hoc networks (MANETs) and edge computing. Existing solutions either require active coordination that fails to...
Learning the APT Kill Chain: Temporal Reasoning over Provenance Data for Attack Stage EstimationTrung V. Phan, Thomas Bauschert2026-03-08下载Advanced Persistent Threats (APTs) evolve through multiple stages, each exhibiting distinct temporal and structural behaviors. Accurate stage estimation is critical for enabling adaptive cyber defense...
Toward Real-Time Mirrors Intelligence: System-Level Latency and Computation Evaluation in Internet of Mirrors (IoM)Haneen Fatima, Muhammad Ali Imran, Ahmad Taha, Lina Mohjazi2026-03-08下载The Internet of Mirrors (IoM) is an emerging IoT ecosystem of interconnected smart mirrors designed to deliver personalised services across a three-tier node hierarchy spanning consumer, professional,...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Structured Gossip: A Partition-Resilient DNS for Internet-Scale Dynamic NetworksPriyanka Sinha, Dilys Thomas2026-03-08下载Network partitions pose fundamental challenges to distributed name resolution in mobile ad-hoc networks (MANETs) and edge computing. Existing solutions either require active coordination that fails to...
Mitigating the Memory Bottleneck with Machine Learning-Driven and Data-Aware Microarchitectural TechniquesRahul Bera2026-03-08下载Modern applications process massive data volumes that overwhelm the storage and retrieval capabilities of memory systems, making memory the primary performance and energy-efficiency bottleneck of comp...
Quine: Realizing LLM Agents as Native POSIX ProcessesHao Ke2026-03-08下载Current LLM agent frameworks often implement isolation, scheduling, and communication at the application layer, even though these mechanisms are already provided by mature operating systems.

cs.PF - Performance

标题作者发布日期PDF摘要
A Lock-Free, Fully GPU-Resident Architecture for the Verification of Goldbach's ConjectureIsaac Llorente-Saguer2026-03-08下载We present a fully device-resident, multi-GPU architecture for the large-scale computational verification of Goldbach's conjecture. In prior work, a segmented double-sieve eliminated monolithic VRAM b...

基于 VitePress 构建