Skip to content

2025-07-20

cs.AR - Architecture

标题作者发布日期PDF摘要
Morphlux: Transforming Torus Fabrics for Efficient Multi-tenant MLAbhishek Vijaya Kumar, Eric Ding, Arjun Devraj, Darius Bunandar, Rachee Singh2025-07-20下载We develop Morphlux, a server-scale programmable photonic fabric to interconnect accelerators within servers. We show that augmenting state-of-the-art torus-based ML data-centers with Morphlux can imp...
Piano: A Multi-Constraint Pin Assignment-Aware FloorplannerZhexuan Xu, Kexin Zhou, Jie Wang, Zijie Geng, Siyuan Xu, Shixiong Kai, Mingxuan Yuan, Feng Wu2025-07-20下载Floorplanning is a critical step in VLSI physical design, increasingly complicated by modern constraints such as fixed-outline requirements, whitespace removal, and the presence of pre-placed modules.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Dynatune: Dynamic Tuning of Raft Election Parameters Using Network MeasurementKohya Shiozaki, Junya Nakamura2025-07-20下载Raft is a leader-based consensus algorithm that implements State Machine Replication (SMR), which replicates the service state across multiple servers to enhance fault tolerance.
AMPED: Accelerating MTTKRP for Billion-Scale Sparse Tensor Decomposition on Multiple GPUsSasindu Wijeratne, Rajgopal Kannan, Viktor Prasanna2025-07-20下载Matricized Tensor Times Khatri-Rao Product (MTTKRP) is the computational bottleneck in sparse tensor decomposition. As real-world sparse tensors grow to billions of nonzeros, they increasingly demand ...
Byzantine-Robust Decentralized Coordination of LLM AgentsYongrae Jo, Chanik Park2025-07-20下载Collaboration among multiple large language model (LLM) agents is a promising approach to overcome inherent limitations of single-agent systems, such as hallucinations and single points of failure.
Mayura: Exploiting Similarities in Motifs for Temporal Co-MiningSanjay Sri Vallabh Singapuram, Ronald Dreslinski, Nishil Talati2025-07-20下载Temporal graphs serve as a critical foundation for modeling evolving interactions in domains ranging from financial networks to social media. Mining temporal motifs is essential for applications such ...
ACME: Adaptive Customization of Large Models via Distributed SystemsZiming Dai, Chao Qiu, Fei Gao, Yunfeng Zhao, Xiaofei Wang2025-07-20下载Pre-trained Transformer-based large models have revolutionized personal virtual assistants, but their deployment in cloud environments faces challenges related to data privacy and response latency.
MultiKernelBench: A Multi-Platform Benchmark for Kernel GenerationZhongzhen Wen, Yinghui Zhang, Zhong Li, Zhongxin Liu, Linna Xie, Tian Zhang2025-07-20下载The automatic generation of deep learning (DL) kernels using large language models (LLMs) has emerged as a promising approach to reduce the manual effort and hardware-specific expertise required for w...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Quantum Machine Learning for Secure Cooperative Multi-Layer Edge AI with Proportional FairnessThai T. Vu, John Le2025-07-20下载This paper proposes a communication-efficient, event-triggered inference framework for cooperative edge AI systems comprising multiple user devices and edge servers.
Morphlux: Transforming Torus Fabrics for Efficient Multi-tenant MLAbhishek Vijaya Kumar, Eric Ding, Arjun Devraj, Darius Bunandar, Rachee Singh2025-07-20下载We develop Morphlux, a server-scale programmable photonic fabric to interconnect accelerators within servers. We show that augmenting state-of-the-art torus-based ML data-centers with Morphlux can imp...
FENIX: Enabling In-Network DNN Inference with FPGA-Enhanced Programmable SwitchesXiangyu Gao, Tong Li, Yinchao Zhang, Ziqiang Wang, Xiangsheng Zeng, Su Yao, Ke Xu2025-07-20下载Machine learning (ML) is increasingly used in network data planes for advanced traffic analysis, but existing solutions (such as FlowLens, N3IC, BoS) still struggle to simultaneously achieve low laten...
Tidal-Like Concept Drift in RIS-Covered Buildings: When Programmable Wireless Environments Meet Human BehaviorsZi-Yang Wu, Muhammad Ismail, Jiliang Zhang, Jie Zhang2025-07-20下载Indoor mobile networks handle the majority of data traffic, with their performance limited by building materials and structures. However, building designs have historically not prioritized wireless pe...
Data-Plane Telemetry to Mitigate Long-Distance BGP HijacksSatadal Sengupta, Hyojoon Kim, Daniel Jubas, Maria Apostolaki, Jennifer Rexford2025-07-20下载Poor security of Internet routing enables adversaries to divert user data through unintended infrastructures (hijack). Of particular concern -- and the focus of this paper -- are cases where attackers...

cs.PF - Performance

标题作者发布日期PDF摘要
Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded DevicesSaeid Ghafouri, Mohsen Fayyaz, Xiangchen Li, Deepu John, Bo Ji, Dimitrios Nikolopoulos, Hans Vandierendonck2025-07-20下载Real-time multi-label video classification on embedded devices is constrained by limited compute and energy budgets. Yet, video streams exhibit structural properties such as label sparsity, temporal c...
Mayura: Exploiting Similarities in Motifs for Temporal Co-MiningSanjay Sri Vallabh Singapuram, Ronald Dreslinski, Nishil Talati2025-07-20下载Temporal graphs serve as a critical foundation for modeling evolving interactions in domains ranging from financial networks to social media. Mining temporal motifs is essential for applications such ...
MultiKernelBench: A Multi-Platform Benchmark for Kernel GenerationZhongzhen Wen, Yinghui Zhang, Zhong Li, Zhongxin Liu, Linna Xie, Tian Zhang2025-07-20下载The automatic generation of deep learning (DL) kernels using large language models (LLMs) has emerged as a promising approach to reduce the manual effort and hardware-specific expertise required for w...

基于 VitePress 构建