Skip to content

2025-05-18

cs.AR - Architecture

标题作者发布日期PDF摘要
Energy-Aware Deep Learning on Resource-Constrained HardwareJosh Millar, Hamed Haddadi, Anil Madhavapeddy2025-05-18下载The use of deep learning (DL) on Internet of Things (IoT) and mobile devices offers numerous advantages over cloud-based processing. However, such devices face substantial energy constraints to prolon...
SpikeX: Exploring Accelerator Architecture and Network-Hardware Co-Optimization for Sparse Spiking Neural NetworksBoxun Xu, Richard Boone, Peng Li2025-05-18下载Spiking Neural Networks (SNNs) are promising biologically plausible models of computation which utilize a spiking binary activation function similar to that of biological neurons.
LLM-DSE: Searching Accelerator Parameters with LLM AgentsHanyu Wang, Xinrui Wu, Zijian Ding, Su Zheng, Chengyue Wang, Neha Prakriya, Tony Nowatzki, Yizhou Sun, Jason Cong2025-05-18下载Even though high-level synthesis (HLS) tools mitigate the challenges of programming domain-specific accelerators (DSAs) by raising the abstraction level, optimizing hardware directive parameters remai...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
SGDPO: Self-Guided Direct Preference Optimization for Language Model AlignmentWenqiao Zhu, Ji Liu, Lulu Wang, Jun Wu, Yulun Zhang2025-05-18下载Direct Preference Optimization (DPO) is broadly utilized for aligning Large Language Models (LLMs) with human values because of its flexibility.
ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous UpdatesTingfeng Lan, Yusen Wu, Bin Ma, Zhaoyuan Su, Rui Yang, Tekin Bicer, Masahiro Tanaka, Olatunji Ruwase, Dong Li, Yue Cheng2025-05-18下载Fine-tuning large language models (LLMs) often exceeds GPU memory limits, prompting systems to offload model states to CPU memory. However, existing offloaded training frameworks like ZeRO-Offload tre...
Investigating Timing-Based Information Leakage in Data Flow-Driven Real-Time SystemsMohammad Fakhruddin Babar, Zain A. H. Hammadeh, Mohammad Hamad, Monowar Hasan2025-05-18下载Leaking information about the execution behavior of critical real-time tasks may lead to serious consequences, including violations of temporal constraints and even severe failures.
Workflow-Driven Modeling for the Compute Continuum: An Optimization Approach to Automated System and Workload SchedulingAasish Kumar Sharma, Christian Boehme, Patrick Gelß, Ramin Yahyapour, Julian Kunkel2025-05-18下载The convergence of IoT, Edge, Cloud, and HPC technologies creates a compute continuum that merges cloud scalability and flexibility with HPC's computational power and specialized optimizations.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Unleashing Automated Congestion Control Customization in the WildAmit Cohen, Lev Gloukhenki, Ravid Hadar, Eden Itah, Yehuda Shvut, Michael Schapira2025-05-18下载Congestion control (CC) crucially impacts user experience across Internet services like streaming, gaming, AR/VR, and connected cars. Traditionally, CC algorithm design seeks universal control rules t...
Modeling and Performance Analysis of IoT-over-LEO Satellite Systems under Realistic Operational Constraints: A Stochastic Geometry ApproachWen-Yu Dong, Shaoshi Yang, Ping Zhang, Sheng Chen2025-05-18下载Current theoretical studies on IoT-over-LEO satellite systems often rely on unrealistic assumptions, such as infinite terrestrial areas and omnidirectional satellite coverage, leaving significant gaps...
LAMeTA: Intent-Aware Agentic Network Optimization via a Large AI Model-Empowered Two-Stage ApproachYinqiu Liu, Guangyuan Liu, Jiacheng Wang, Ruichen Zhang, Dusit Niyato, Geng Sun, Zehui Xiong, Zhu Han2025-05-18下载Nowadays, Generative AI (GenAI) reshapes numerous domains by enabling machines to create content across modalities. As GenAI evolves into autonomous agents capable of reasoning, collaboration, and int...

cs.PF - Performance

标题作者发布日期PDF摘要
Unleashing Automated Congestion Control Customization in the WildAmit Cohen, Lev Gloukhenki, Ravid Hadar, Eden Itah, Yehuda Shvut, Michael Schapira2025-05-18下载Congestion control (CC) crucially impacts user experience across Internet services like streaming, gaming, AR/VR, and connected cars. Traditionally, CC algorithm design seeks universal control rules t...

基于 VitePress 构建