Skip to content

2025-10-30

cs.AR - Architecture

标题作者发布日期PDF摘要
Practical Timing Closure in FPGA and ASIC Designs: Methods, Challenges, and Case StudiesMostafa Darvishi2025-10-30下载This paper presents an in-depth analysis of timing closure challenges and constraints in Field Programmable Gate Arrays (FPGAs) and Application Specific Integrated Circuits (ASICs).
Choreographer: A Full-System Framework for Fine-Grained Tasks in Cache HierarchiesHoa Nguyen, Pongstorn Maidee, Jason Lowe-Power, Alireza Kaviani2025-10-30下载In this paper, we introduce Choreographer, a simulation framework that enables a holistic system-level evaluation of fine-grained accelerators designed for latency-sensitive tasks.
Wireless Sensor Networks as Parallel and Distributed Hardware Platform for Artificial Neural NetworksGursel Serpen2025-10-30下载We are proposing fully parallel and maximally distributed hardware realization of a generic neuro-computing system. More specifically, the proposal relates to the wireless sensor networks technology t...
MIREDO: MIP-Driven Resource-Efficient Dataflow Optimization for Computing-in-Memory AcceleratorXiaolin He, Cenlin Duan, Yingjie Qi, Xiao Ma, Jianlei Yang2025-10-30下载Computing-in-Memory (CIM) architectures have emerged as a promising solution for accelerating Deep Neural Networks (DNNs) by mitigating data movement bottlenecks.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
A Cloud-Based Spatio-Temporal GNN-Transformer Hybrid Model for Traffic Flow Forecasting with External Feature IntegrationZhuo Zheng, Lingran Meng, Ziyu Lin2025-10-30下载Accurate traffic flow forecasting is essential for the development of intelligent transportation systems (ITS), supporting tasks such as traffic signal optimization, congestion management, and route p...
Boosting performance: Gradient Clock Synchronisation with two-way measured linksSophie Wenning2025-10-30下载This master thesis extends the formal model of the GCS algorithm as presented by (Fan and Lynch 2004, 325), (Lenzen, Locher and Wattenhofer 2008, 510) and (Függer et al.
Mind the Gap: Revealing Inconsistencies Across Heterogeneous AI AcceleratorsElliott Wen, Sean Ma, Ewan Tempero, Jens Dietrich, Daniel Luo, Jiaxing Shen, Kaiqi Zhao, Bruce Sham, Yousong Song, Jiayi Hua, Jia Hong2025-10-30下载While NVIDIA remains the dominant provider of AI accelerators within cloud data center, emerging vendors such as AMD, Intel, Mac, and Huawei offer cost-effective alternatives with claims of compatibil...
FlowMesh: A Service Fabric for Composable LLM WorkflowsJunyi Shen, Noppanat Wadlom, Lingfeng Zhou, Dequan Wang, Xu Miao, Lei Fang, Yao Lu2025-10-30下载AI deployment increasingly resembles a pipeline of data transformation, fine-tuning, and agent interactions rather than a monolithic LLM job; recent examples include RLHF/RLAIF training and agentic wo...
ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE InferenceZixu Shen, Kexin Chu, Yifan Zhang, Dawei Xiang, Runxin Wu, Wei Zhang2025-10-30下载The expansion of large language models is increasingly limited by the constrained memory capacity of modern GPUs. To mitigate this, Mixture-of-Experts (MoE) architectures activate only a small portion...
Non-Convex Over-the-Air Heterogeneous Federated Learning: A Bias-Variance Trade-offMuhammad Faraz Ul Abrar, Nicolò Michelusi2025-10-30下载Over-the-air (OTA) federated learning (FL) has been well recognized as a scalable paradigm that exploits the waveform superposition of the wireless multiple-access channel to aggregate model updates i...
An All-Reduce Compatible Top-K Compressor for Communication-Efficient Distributed LearningChuyan Chen, Chenyang Ma, Zhangxin Li, Yutong He, Yanjie Dong, Kun Yuan2025-10-30下载Communication remains a central bottleneck in large-scale distributed machine learning, and gradient sparsification has emerged as a promising strategy to alleviate this challenge.
Wireless Sensor Networks as Parallel and Distributed Hardware Platform for Artificial Neural NetworksGursel Serpen2025-10-30下载We are proposing fully parallel and maximally distributed hardware realization of a generic neuro-computing system. More specifically, the proposal relates to the wireless sensor networks technology t...
ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning SystemsQiaoling Chen, Zijun Liu, Peng Sun, Shenggui Li, Guoteng Wang, Ziming Liu, Yonggang Wen, Siyuan Feng, Tianwei Zhang2025-10-30下载Adapting large language models (LLMs) via reinforcement learning (RL) is often bottlenecked by the generation stage, which can consume over 75% of the training time.
Environmental Impact of CI/CD PipelinesNuno Saavedra, Alexandra Mendes, João F. Ferreira2025-10-30下载CI/CD pipelines are widely used in software development, yet their environmental impact, particularly carbon and water footprints (CWF), remains largely unknown to developers, as CI service providers ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
TheaterQ: A Qdisc for Dynamic Network EmulationMartin Ottens, Kai-Steffen Hielscher, Reinhard German2025-10-30下载TheaterQ is a Linux qdisc designed for dynamic network emulation, addressing the limitations of static parameters in traditional tools like NetEm.
Trace-driven Path Emulation of Satellite Networks using HypatiaMartin Ottens, Kai-Steffen Hielscher, Reinhard German2025-10-30下载The increasing prevalence LEO satellite mega-constellations for global Internet coverage requires new approaches to evaluate the behavior of existing Internet protocols and applications.
Low-Altitude UAV-Carried Movable Antenna for Joint Wireless Power Transfer and Covert CommunicationsChuang Zhang, Geng Sun, Jiahui Li, Jiacheng Wang, Qingqing Wu, Dusit Niyato, Shiwen Mao, Tony Q. S. Quek2025-10-30下载The proliferation of Internet of Things (IoT) networks has created an urgent need for sustainable energy solutions, particularly for the battery-constrained spatially distributed IoT nodes.
Quantum Gated Recurrent GAN with Gaussian Uncertainty for Network Anomaly DetectionWajdi Hammami, Soumaya Cherkaoui, Jean-Frederic Laprade, Ola Ahmad, Shengrui Wang2025-10-30下载Anomaly detection in time-series data is a critical challenge with significant implications for network security. Recent quantum machine learning approaches, such as quantum kernel methods and variati...
Wireless Memory Approximation for Energy-efficient Task-specific IoT Data RetrievalJunya Shiraishi, Shashi Raj Pandey, Israel Leyva-Mayorga, Petar Popovski2025-10-30下载The use of Dynamic Random Access Memory (DRAM) for storing Machine Learning (ML) models plays a critical role in accelerating ML inference tasks in the next generation of communication systems.
Joint Computing Resource Allocation and Task Offloading in Vehicular Fog Computing Systems Under Asymmetric InformationGeng Sun, Siyi Chen, Zemin Sun, Long He, Jiacheng Wang, Dusit Niyato, Zhu Han, Dong In Kim2025-10-30下载Vehicular fog computing (VFC) has emerged as a promising paradigm, which leverages the idle computational resources of nearby fog vehicles (FVs) to complement the computing capabilities of conventiona...
From req/res to pub/sub: Exploring Media over QUIC Transport for DNSMathis Engelbart, Mike Kosek, Lars Eggert, Jörg Ott2025-10-30下载The DNS is a key component of the Internet. Originally designed to facilitate the resolution of host names to IP addresses, its scope has continuously expanded over the years, today covering use cases...
Denoising Refinement Diffusion Models for Simultaneous Generation of Multi-scale Mobile Network TrafficXiaoqian Qi, Haoye Chai, Sichang Liu, Lei Yue, Raoyuan Pan, Yue Wang, Yong Li2025-10-30下载The planning, management, and resource scheduling of cellular mobile networks require joint estimation of mobile traffic across different layers and nodes.
FGGM: Formal Grey-box Gradient Method for Attacking DRL-based MU-MIMO SchedulerThanh Le, Hai Duong, Yusheng Ji, ThanhVu Nguyen, John C. S. Lui2025-10-30下载In 5G mobile communication systems, MU-MIMO has been applied to enhance spectral efficiency and support high data rates. To maximize spectral efficiency while providing fairness among users, the base ...
Symmetry-Driven Asynchronous Forwarding for Reliable Distributed Coordination in Toroidal NetworksShenshen Luan, Yumo Tian, Xinyu Zhang, Qingwen Zhang, Tianheng Wang, Yan Yang, Shuguo Xie2025-10-30下载The proliferation of large-scale distributed systems, such as satellite constellations and high-performance computing clusters, demands robust communication primitives that maintain coordination under...
Performance Analysis of Dynamic Equilibria in Joint Path Selection and Congestion Control in Path-Aware NetworksSina Keshvadi2025-10-30下载Path-aware networking (PAN) architectures, such as SCION and emerging LEO constellations, expose tens to hundreds of verifiable paths to endpoints.

cs.PF - Performance

标题作者发布日期PDF摘要
ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE InferenceZixu Shen, Kexin Chu, Yifan Zhang, Dawei Xiang, Runxin Wu, Wei Zhang2025-10-30下载The expansion of large language models is increasingly limited by the constrained memory capacity of modern GPUs. To mitigate this, Mixture-of-Experts (MoE) architectures activate only a small portion...
Approximating Heavy-Tailed Distributions with a Mixture of Bernstein Phase-Type and Hyperexponential ModelsAbdelhakim Ziani, András Horváth, Paolo Ballarini2025-10-30下载Heavy-tailed distributions, prevalent in a lot of real-world applications such as finance, telecommunications, queuing theory, and natural language processing, are challenging to model accurately owin...

基于 VitePress 构建