Skip to content

2025-12-31

cs.AR - Architecture

标题作者发布日期PDF摘要
Democratizing Electronic-Photonic AI Systems: An Open-Source AI-Infused Cross-Layer Co-Design and Design Automation ToolflowHongjian Zhou, Ziang Yin, Jiaqi Gu2025-12-31下载Photonics is becoming a cornerstone technology for high-performance AI systems and scientific computing, offering unparalleled speed, parallelism, and energy efficiency.
Toward Large-Scale Photonics-Empowered AI Systems: From Physical Design Automation to System-Algorithm Co-ExplorationZiang Yin, Hongjian Zhou, Nicholas Gangi, Meng Zhang, Jeff Zhang, Zhaoran Rena Huang, Jiaqi Gu2025-12-31下载In this work, we identify three considerations that are essential for realizing practical photonic AI systems at scale: (1) dynamic tensor operation support for modern models rather than only weight-s...
Advances in Agentic AI: Back to the FutureSergio Alvarez-Telena, Marta Diez-Fernandez2025-12-31下载In light of the recent convergence between Agentic AI and our field of Algorithmization, this paper seeks to restore conceptual clarity and provide a structured analytical framework for an increasingl...
FPGA Co-Design for Efficient N:M Sparse and Quantized Model InferenceFen-Yu Hsieh, Yun-Chang Teng, Ding-Yong Hong, Jan-Jan Wu2025-12-31下载Large language models (LLMs) have demonstrated remarkable performance across a wide range of language processing tasks. However, this success comes at the cost of substantial computation and memory re...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Vulcan: Instance-Optimal Systems Heuristics Through LLM-Driven SearchRohit Dwivedula, Divyanshu Saxena, Sujay Yadalam, Daehyeok Kim, Aditya Akella2025-12-31下载Resource-management tasks in modern operating and distributed systems continue to rely primarily on hand-designed heuristics for tasks such as scheduling, caching, or active queue management.
Reliable and Resilient Collective Communication Library for LLM Training and ServingWei Wang, Nengneng Yu, Sixian Xiong, Zaoxing Liu2025-12-31下载Modern ML training and inference now span tens to tens of thousands of GPUs, where network faults can waste 10--15% of GPU hours due to slow recovery.
AI-Driven Cloud Resource Optimization for Multi-Cluster EnvironmentsVinoth Punniyamoorthy, Akash Kumar Agarwal, Bikesh Kumar, Abhirup Mazumder, Kabilan Kannan, Sumit Saha2025-12-31下载Modern cloud-native systems increasingly rely on multi-cluster deployments to support scalability, resilience, and geographic distribution. However, existing resource management approaches remain larg...
Adaptive Resource Orchestration for Distributed Quantum Computing SystemsKuan-Cheng Chen, Felix Burt, Nitish K. Panigrahy, Kin K. Leung2025-12-31下载Scaling quantum computing beyond a single device requires networking many quantum processing units (QPUs) into a coherent quantum-HPC system. We propose the Modular Entanglement Hub (ModEn-Hub) archit...
Distributed Bilevel Optimization with Dual Pruning for Resource-limited ClientsMingyi Li, Xiao Zhang, Ruisheng Zheng, Hongjian Shi, Yuan Yuan, Xiuzhen Cheng, Dongxiao Yu2025-12-31下载With the development of large-scale models, traditional distributed bilevel optimization algorithms cannot be applied directly in low-resource clients.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
CTMap: LLM-Enabled Connectivity-Aware Path Planning in Millimeter-Wave Digital Twin NetworksMd Salik Parwez, Sai Teja Srivillibhutturu, Sai Venkat Reddy Kopparthi, Asfiya Misba, Debashri Roy, Habeeb Olufowobi, Charles Kim2025-12-31下载In this paper, we present \textit{CTMAP}, a large language model (LLM) empowered digital twin framework for connectivity-aware route navigation in millimeter-wave (mmWave) wireless networks.
Delay-Tolerant Networking for Tsunami Evacuation on the Small Island of Hachijojima: A Study of Epidemic and Prophet RoutingKeiya Kawano, Milena Radenkovic2025-12-31下载Tsunami disasters pose a serious and recurring threat to coastal and island communities. When a large earthquake occurs, people are forced to make evacuation decisions under extreme time pressure, oft...
Reliable and Resilient Collective Communication Library for LLM Training and ServingWei Wang, Nengneng Yu, Sixian Xiong, Zaoxing Liu2025-12-31下载Modern ML training and inference now span tens to tens of thousands of GPUs, where network faults can waste 10--15% of GPU hours due to slow recovery.
Sidelink Positioning: Standardization Advancements, Challenges and OpportunitiesYuan Gao, Guangjin Pan, Zhiyong Zhong, Zhengyu Jin, Yichen Hu, Yifei Jin, Shugong Xu2025-12-31下载With the integration of cellular networks in vertical industries that demand precise location information, such as vehicle-to-everything (V2X), public safety, and Industrial Internet of Things (IIoT),...
Auction-Driven Spectrum Allocation With AutoEncoder-Based Compression in Rural Wireless Networks: A Novel Framework for Reliable TelemedicineNadjemat El Houda Issaad, Ismail Lotfi, Mohamed Senouci, Zekri Lougmiri2025-12-31下载Rural healthcare faces numerous challenges, including limited access to specialized medical services and diagnostic equipment, which delays patient care.
Analyzing Communication Predictability in LLM TrainingWenxue Li, Xiangzhou Liu, Yuxuan Li, Yilun Jin, Zhenghang Ren, Xudong Liao, Han Tian, Bo Ren, Zhizhen Zhong, Guyue Liu, Ying Zhang, Kai Chen2025-12-31下载Effective communication is essential in distributed training, with predictability being one of its most significant characteristics. However, existing studies primarily focus on exploiting predictabil...
Hierarchical Online Optimization Approach for IRS-enabled Low-altitude MEC in Vehicular NetworksYixian Wang, Geng Sun, Zemin Sun, Jiacheng Wang, Changyuan Zhao, Daxin Tian, Dusit Niyato, Shiwen Mao2025-12-31下载In this paper, we propose an intelligent reflecting surface (IRS)-enabled low-altitude multi-access edge computing (MEC) architecture, where an aerial MEC server cooperates with a terrestrial MEC serv...
Chat-Driven Optimal Management for Virtual Network ServicesYuya Miyaoka, Masaki Inoue, Kengo Urata, Shigeaki Harada2025-12-31下载This paper proposes a chat-driven network management framework that integrates natural language processing (NLP) with optimization-based virtual network allocation, enabling intuitive and reliable rec...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Vulcan: Instance-Optimal Systems Heuristics Through LLM-Driven SearchRohit Dwivedula, Divyanshu Saxena, Sujay Yadalam, Daehyeok Kim, Aditya Akella2025-12-31下载Resource-management tasks in modern operating and distributed systems continue to rely primarily on hand-designed heuristics for tasks such as scheduling, caching, or active queue management.
Towards Fully-fledged GPU Multitasking via Proactive Memory SchedulingWeihang Shen, Yinqiu Chen, Rong Chen, Haibo Chen2025-12-31下载The limited HBM capacity has become the primary bottleneck for hosting an increasing number of larger-scale GPU tasks. While demand paging extends capacity via host DRAM, it incurs up to 78x slowdown ...

cs.PF - Performance

标题作者发布日期PDF摘要
A Magnified View into Heterogeneous-ISA Thread Migration Performance without State TransformationNikolaos Mavrogeorgis, Christos Vasiladiotis, Pei Mu, Amir Khordadi, Björn Franke, Antonio Barbalace2025-12-31下载Heterogeneous-ISA processor designs have attracted considerable research interest. However, unlike their homogeneous-ISA counterparts, explicit software support for bridging ISA heterogeneity is requi...

基于 VitePress 构建