2025-12-31

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Democratizing Electronic-Photonic AI Systems: An Open-Source AI-Infused Cross-Layer Co-Design and Design Automation Toolflow	Hongjian Zhou, Ziang Yin, Jiaqi Gu	2025-12-31	下载	Photonics is becoming a cornerstone technology for high-performance AI systems and scientific computing, offering unparalleled speed, parallelism, and energy efficiency.
Toward Large-Scale Photonics-Empowered AI Systems: From Physical Design Automation to System-Algorithm Co-Exploration	Ziang Yin, Hongjian Zhou, Nicholas Gangi, Meng Zhang, Jeff Zhang, Zhaoran Rena Huang, Jiaqi Gu	2025-12-31	下载	In this work, we identify three considerations that are essential for realizing practical photonic AI systems at scale: (1) dynamic tensor operation support for modern models rather than only weight-s...
Advances in Agentic AI: Back to the Future	Sergio Alvarez-Telena, Marta Diez-Fernandez	2025-12-31	下载	In light of the recent convergence between Agentic AI and our field of Algorithmization, this paper seeks to restore conceptual clarity and provide a structured analytical framework for an increasingl...
FPGA Co-Design for Efficient N:M Sparse and Quantized Model Inference	Fen-Yu Hsieh, Yun-Chang Teng, Ding-Yong Hong, Jan-Jan Wu	2025-12-31	下载	Large language models (LLMs) have demonstrated remarkable performance across a wide range of language processing tasks. However, this success comes at the cost of substantial computation and memory re...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Vulcan: Instance-Optimal Systems Heuristics Through LLM-Driven Search	Rohit Dwivedula, Divyanshu Saxena, Sujay Yadalam, Daehyeok Kim, Aditya Akella	2025-12-31	下载	Resource-management tasks in modern operating and distributed systems continue to rely primarily on hand-designed heuristics for tasks such as scheduling, caching, or active queue management.
Reliable and Resilient Collective Communication Library for LLM Training and Serving	Wei Wang, Nengneng Yu, Sixian Xiong, Zaoxing Liu	2025-12-31	下载	Modern ML training and inference now span tens to tens of thousands of GPUs, where network faults can waste 10--15% of GPU hours due to slow recovery.
AI-Driven Cloud Resource Optimization for Multi-Cluster Environments	Vinoth Punniyamoorthy, Akash Kumar Agarwal, Bikesh Kumar, Abhirup Mazumder, Kabilan Kannan, Sumit Saha	2025-12-31	下载	Modern cloud-native systems increasingly rely on multi-cluster deployments to support scalability, resilience, and geographic distribution. However, existing resource management approaches remain larg...
Adaptive Resource Orchestration for Distributed Quantum Computing Systems	Kuan-Cheng Chen, Felix Burt, Nitish K. Panigrahy, Kin K. Leung	2025-12-31	下载	Scaling quantum computing beyond a single device requires networking many quantum processing units (QPUs) into a coherent quantum-HPC system. We propose the Modular Entanglement Hub (ModEn-Hub) archit...
Distributed Bilevel Optimization with Dual Pruning for Resource-limited Clients	Mingyi Li, Xiao Zhang, Ruisheng Zheng, Hongjian Shi, Yuan Yuan, Xiuzhen Cheng, Dongxiao Yu	2025-12-31	下载	With the development of large-scale models, traditional distributed bilevel optimization algorithms cannot be applied directly in low-resource clients.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
CTMap: LLM-Enabled Connectivity-Aware Path Planning in Millimeter-Wave Digital Twin Networks	Md Salik Parwez, Sai Teja Srivillibhutturu, Sai Venkat Reddy Kopparthi, Asfiya Misba, Debashri Roy, Habeeb Olufowobi, Charles Kim	2025-12-31	下载	In this paper, we present \textit{CTMAP}, a large language model (LLM) empowered digital twin framework for connectivity-aware route navigation in millimeter-wave (mmWave) wireless networks.
Delay-Tolerant Networking for Tsunami Evacuation on the Small Island of Hachijojima: A Study of Epidemic and Prophet Routing	Keiya Kawano, Milena Radenkovic	2025-12-31	下载	Tsunami disasters pose a serious and recurring threat to coastal and island communities. When a large earthquake occurs, people are forced to make evacuation decisions under extreme time pressure, oft...
Reliable and Resilient Collective Communication Library for LLM Training and Serving	Wei Wang, Nengneng Yu, Sixian Xiong, Zaoxing Liu	2025-12-31	下载	Modern ML training and inference now span tens to tens of thousands of GPUs, where network faults can waste 10--15% of GPU hours due to slow recovery.
Sidelink Positioning: Standardization Advancements, Challenges and Opportunities	Yuan Gao, Guangjin Pan, Zhiyong Zhong, Zhengyu Jin, Yichen Hu, Yifei Jin, Shugong Xu	2025-12-31	下载	With the integration of cellular networks in vertical industries that demand precise location information, such as vehicle-to-everything (V2X), public safety, and Industrial Internet of Things (IIoT),...
Auction-Driven Spectrum Allocation With AutoEncoder-Based Compression in Rural Wireless Networks: A Novel Framework for Reliable Telemedicine	Nadjemat El Houda Issaad, Ismail Lotfi, Mohamed Senouci, Zekri Lougmiri	2025-12-31	下载	Rural healthcare faces numerous challenges, including limited access to specialized medical services and diagnostic equipment, which delays patient care.
Analyzing Communication Predictability in LLM Training	Wenxue Li, Xiangzhou Liu, Yuxuan Li, Yilun Jin, Zhenghang Ren, Xudong Liao, Han Tian, Bo Ren, Zhizhen Zhong, Guyue Liu, Ying Zhang, Kai Chen	2025-12-31	下载	Effective communication is essential in distributed training, with predictability being one of its most significant characteristics. However, existing studies primarily focus on exploiting predictabil...
Hierarchical Online Optimization Approach for IRS-enabled Low-altitude MEC in Vehicular Networks	Yixian Wang, Geng Sun, Zemin Sun, Jiacheng Wang, Changyuan Zhao, Daxin Tian, Dusit Niyato, Shiwen Mao	2025-12-31	下载	In this paper, we propose an intelligent reflecting surface (IRS)-enabled low-altitude multi-access edge computing (MEC) architecture, where an aerial MEC server cooperates with a terrestrial MEC serv...
Chat-Driven Optimal Management for Virtual Network Services	Yuya Miyaoka, Masaki Inoue, Kengo Urata, Shigeaki Harada	2025-12-31	下载	This paper proposes a chat-driven network management framework that integrates natural language processing (NLP) with optimization-based virtual network allocation, enabling intuitive and reliable rec...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Vulcan: Instance-Optimal Systems Heuristics Through LLM-Driven Search	Rohit Dwivedula, Divyanshu Saxena, Sujay Yadalam, Daehyeok Kim, Aditya Akella	2025-12-31	下载	Resource-management tasks in modern operating and distributed systems continue to rely primarily on hand-designed heuristics for tasks such as scheduling, caching, or active queue management.
Towards Fully-fledged GPU Multitasking via Proactive Memory Scheduling	Weihang Shen, Yinqiu Chen, Rong Chen, Haibo Chen	2025-12-31	下载	The limited HBM capacity has become the primary bottleneck for hosting an increasing number of larger-scale GPU tasks. While demand paging extends capacity via host DRAM, it incurs up to 78x slowdown ...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
A Magnified View into Heterogeneous-ISA Thread Migration Performance without State Transformation	Nikolaos Mavrogeorgis, Christos Vasiladiotis, Pei Mu, Amir Khordadi, Björn Franke, Antonio Barbalace	2025-12-31	下载	Heterogeneous-ISA processor designs have attracted considerable research interest. However, unlike their homogeneous-ISA counterparts, explicit software support for bridging ISA heterogeneity is requi...