2025-07-20

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Morphlux: Transforming Torus Fabrics for Efficient Multi-tenant ML	Abhishek Vijaya Kumar, Eric Ding, Arjun Devraj, Darius Bunandar, Rachee Singh	2025-07-20	下载	We develop Morphlux, a server-scale programmable photonic fabric to interconnect accelerators within servers. We show that augmenting state-of-the-art torus-based ML data-centers with Morphlux can imp...
Piano: A Multi-Constraint Pin Assignment-Aware Floorplanner	Zhexuan Xu, Kexin Zhou, Jie Wang, Zijie Geng, Siyuan Xu, Shixiong Kai, Mingxuan Yuan, Feng Wu	2025-07-20	下载	Floorplanning is a critical step in VLSI physical design, increasingly complicated by modern constraints such as fixed-outline requirements, whitespace removal, and the presence of pre-placed modules.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Dynatune: Dynamic Tuning of Raft Election Parameters Using Network Measurement	Kohya Shiozaki, Junya Nakamura	2025-07-20	下载	Raft is a leader-based consensus algorithm that implements State Machine Replication (SMR), which replicates the service state across multiple servers to enhance fault tolerance.
AMPED: Accelerating MTTKRP for Billion-Scale Sparse Tensor Decomposition on Multiple GPUs	Sasindu Wijeratne, Rajgopal Kannan, Viktor Prasanna	2025-07-20	下载	Matricized Tensor Times Khatri-Rao Product (MTTKRP) is the computational bottleneck in sparse tensor decomposition. As real-world sparse tensors grow to billions of nonzeros, they increasingly demand ...
Byzantine-Robust Decentralized Coordination of LLM Agents	Yongrae Jo, Chanik Park	2025-07-20	下载	Collaboration among multiple large language model (LLM) agents is a promising approach to overcome inherent limitations of single-agent systems, such as hallucinations and single points of failure.
Mayura: Exploiting Similarities in Motifs for Temporal Co-Mining	Sanjay Sri Vallabh Singapuram, Ronald Dreslinski, Nishil Talati	2025-07-20	下载	Temporal graphs serve as a critical foundation for modeling evolving interactions in domains ranging from financial networks to social media. Mining temporal motifs is essential for applications such ...
ACME: Adaptive Customization of Large Models via Distributed Systems	Ziming Dai, Chao Qiu, Fei Gao, Yunfeng Zhao, Xiaofei Wang	2025-07-20	下载	Pre-trained Transformer-based large models have revolutionized personal virtual assistants, but their deployment in cloud environments faces challenges related to data privacy and response latency.
MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation	Zhongzhen Wen, Yinghui Zhang, Zhong Li, Zhongxin Liu, Linna Xie, Tian Zhang	2025-07-20	下载	The automatic generation of deep learning (DL) kernels using large language models (LLMs) has emerged as a promising approach to reduce the manual effort and hardware-specific expertise required for w...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Quantum Machine Learning for Secure Cooperative Multi-Layer Edge AI with Proportional Fairness	Thai T. Vu, John Le	2025-07-20	下载	This paper proposes a communication-efficient, event-triggered inference framework for cooperative edge AI systems comprising multiple user devices and edge servers.
Morphlux: Transforming Torus Fabrics for Efficient Multi-tenant ML	Abhishek Vijaya Kumar, Eric Ding, Arjun Devraj, Darius Bunandar, Rachee Singh	2025-07-20	下载	We develop Morphlux, a server-scale programmable photonic fabric to interconnect accelerators within servers. We show that augmenting state-of-the-art torus-based ML data-centers with Morphlux can imp...
FENIX: Enabling In-Network DNN Inference with FPGA-Enhanced Programmable Switches	Xiangyu Gao, Tong Li, Yinchao Zhang, Ziqiang Wang, Xiangsheng Zeng, Su Yao, Ke Xu	2025-07-20	下载	Machine learning (ML) is increasingly used in network data planes for advanced traffic analysis, but existing solutions (such as FlowLens, N3IC, BoS) still struggle to simultaneously achieve low laten...
Tidal-Like Concept Drift in RIS-Covered Buildings: When Programmable Wireless Environments Meet Human Behaviors	Zi-Yang Wu, Muhammad Ismail, Jiliang Zhang, Jie Zhang	2025-07-20	下载	Indoor mobile networks handle the majority of data traffic, with their performance limited by building materials and structures. However, building designs have historically not prioritized wireless pe...
Data-Plane Telemetry to Mitigate Long-Distance BGP Hijacks	Satadal Sengupta, Hyojoon Kim, Daniel Jubas, Maria Apostolaki, Jennifer Rexford	2025-07-20	下载	Poor security of Internet routing enables adversaries to divert user data through unintended infrastructures (hijack). Of particular concern -- and the focus of this paper -- are cases where attackers...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices	Saeid Ghafouri, Mohsen Fayyaz, Xiangchen Li, Deepu John, Bo Ji, Dimitrios Nikolopoulos, Hans Vandierendonck	2025-07-20	下载	Real-time multi-label video classification on embedded devices is constrained by limited compute and energy budgets. Yet, video streams exhibit structural properties such as label sparsity, temporal c...
Mayura: Exploiting Similarities in Motifs for Temporal Co-Mining	Sanjay Sri Vallabh Singapuram, Ronald Dreslinski, Nishil Talati	2025-07-20	下载	Temporal graphs serve as a critical foundation for modeling evolving interactions in domains ranging from financial networks to social media. Mining temporal motifs is essential for applications such ...
MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation	Zhongzhen Wen, Yinghui Zhang, Zhong Li, Zhongxin Liu, Linna Xie, Tian Zhang	2025-07-20	下载	The automatic generation of deep learning (DL) kernels using large language models (LLMs) has emerged as a promising approach to reduce the manual effort and hardware-specific expertise required for w...