2025-05-26

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Efficient Optimization Accelerator Framework for Multistate Ising Problems	Chirag Garg, Sayeef Salahuddin	2025-05-26	下载	Ising Machines are emerging hardware architectures that efficiently solve NP-Hard combinatorial optimization problems. Generally, combinatorial problems are transformed into quadratic unconstrained bi...
ReChisel: Effective Automatic Chisel Code Generation by LLM with Reflection	Juxin Niu, Xiangfeng Liu, Dan Niu, Xi Wang, Zhe Jiang, Nan Guan	2025-05-26	下载	Coding with hardware description languages (HDLs) such as Verilog is a time-intensive and laborious task. With the rapid advancement of large language models (LLMs), there is increasing interest in ap...
Enhancing Test Efficiency through Automated ATPG-Aware Lightweight Scan Instrumentation	Sudipta Paria, Md Rezoan Ferdous, Aritra Dasgupta, Atri Chatterjee, Swarup Bhunia	2025-05-26	下载	Scan-based Design-for-Testability (DFT) measures are prevalent in modern digital integrated circuits to achieve high test quality at low hardware cost.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Avoid Forgetting by Preserving Global Knowledge Gradients in Federated Learning with Non-IID Data	Abhijit Chunduru, Majid Morafah, Mahdi Morafah, Vishnu Pandi Chellapandi, Ang Li	2025-05-26	下载	The inevitable presence of data heterogeneity has made federated learning very challenging. There are numerous methods to deal with this issue, such as local regularization, better model fusion techni...
Efficient Optimization Accelerator Framework for Multistate Ising Problems	Chirag Garg, Sayeef Salahuddin	2025-05-26	下载	Ising Machines are emerging hardware architectures that efficiently solve NP-Hard combinatorial optimization problems. Generally, combinatorial problems are transformed into quadratic unconstrained bi...
Optimizing edge AI models on HPC systems with the edge in the loop	Marcel Aach, Cyril Blanc, Andreas Lintermann, Kurt De Grave	2025-05-26	下载	Artificial intelligence and machine learning models deployed on edge devices, e.g., for quality control in Additive Manufacturing (AM), are frequently small in size.
From Few to Many Faults: Optimal Adaptive Byzantine Agreement	Andrei Constantinescu, Marc Dufay, Anton Paramonov, Roger Wattenhofer	2025-05-26	下载	Achieving agreement among distributed parties is a fundamental task in modern systems, underpinning applications such as consensus in blockchains, coordination in cloud infrastructure, and fault toler...
Differential Privacy Analysis of Decentralized Gossip Averaging under Varying Threat Models	Antti Koskela, Tejas Kulkarni	2025-05-26	下载	Achieving differential privacy (DP) guarantees in fully decentralized machine learning is challenging due to the absence of a central aggregator and varying trust assumptions among nodes.
Universal Workers: A Vision for Eliminating Cold Starts in Serverless Computing	Saman Akbari, Manfred Hauswirth	2025-05-26	下载	Serverless computing enables developers to deploy code without managing infrastructure, but suffers from cold start overhead when initializing new function instances.
DGRAG: Distributed Graph-based Retrieval-Augmented Generation in Edge-Cloud Systems	Wenqing Zhou, Yuxuan Yan, Qianqian Yang	2025-05-26	下载	Retrieval-Augmented Generation (RAG) improves factuality by grounding LLMs in external knowledge, yet conventional centralized RAG requires aggregating distributed data, raising privacy risks and incu...
Justin: Hybrid CPU/Memory Elastic Scaling for Distributed Stream Processing	Donatien Schmitz, Guillaume Rosinosky, Etienne Rivière	2025-05-26	下载	Distributed Stream Processing (DSP) engines analyze continuous data via queries expressed as a graph of operators. Auto-scalers adjust the number of parallel instances of these operators to support a ...
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments	Junming Liu, Yanting Gao, Siyuan Meng, Yifei Sun, Aoqi Wu, Yufei Jin, Yirong Chen, Ding Wang, Guosun Zeng	2025-05-26	下载	Federated Learning (FL) is a decentralized machine learning paradigm that enables clients to collaboratively train models while preserving data privacy.
Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs	Hao Kang, Qingru Zhang, Han Cai, Weiyuan Xu, Tushar Krishna, Yilun Du, Tsachy Weissman	2025-05-26	下载	Large language models (LLMs) have shown remarkable performance across diverse reasoning and generation tasks, and are increasingly deployed as agents in dynamic environments such as code generation an...
GPU acceleration of non-equilibrium Green's function calculation using OpenACC and CUDA FORTRAN	Jia Yin, Khaled Z. Ibrahim, Mauro Del Ben, Jack Deslippe, Yang-hao Chan, Chao Yang	2025-05-26	下载	The numerical solution of the Kadanoff-Baym nonlinear integro-differential equations, which yields the non-equilibrium Green's functions (NEGFs) of quantum many-body systems, poses significant computa...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Unleashing 5G Seamless Integration with TSN for Industry 5.0: Frame Forwarding and QoS Treatment	Oscar Adamuz-Hinojosa, Felix Delgado-Ferro, Jorge Navarro-Ortiz, Pablo Muñoz, Pablo Ameigeiras	2025-05-26	下载	Integrating Time-Sensitive Networking (TSN) and 5th Generation (5G) systems is key for providing wireless low-latency services in industry. Despite research efforts, challenges remain.
A Cost-efficient Credit-Based Shaper Deployment Framework for Time-Sensitive Networks	Santiago Torres-Borda, Ahlem Mifdaoui	2025-05-26	下载	Time-sensitive networks are designed to meet stringent Quality of Service (QoS) requirements for mixed-criticality traffic with diverse performance demands.
MetaSTNet: Multimodal Meta-learning for Cellular Traffic Conformal Prediction	Hui Ma, Kai Yang	2025-05-26	下载	Network traffic prediction techniques have attracted much attention since they are valuable for network congestion control and user experience improvement.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Avoid Forgetting by Preserving Global Knowledge Gradients in Federated Learning with Non-IID Data	Abhijit Chunduru, Majid Morafah, Mahdi Morafah, Vishnu Pandi Chellapandi, Ang Li	2025-05-26	下载	The inevitable presence of data heterogeneity has made federated learning very challenging. There are numerous methods to deal with this issue, such as local regularization, better model fusion techni...
Universal Workers: A Vision for Eliminating Cold Starts in Serverless Computing	Saman Akbari, Manfred Hauswirth	2025-05-26	下载	Serverless computing enables developers to deploy code without managing infrastructure, but suffers from cold start overhead when initializing new function instances.
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation	Dong Liu, Yanxuan Yu, Jiayi Zhang, Yifan Li, Ben Lengerich, Ying Nian Wu	2025-05-26	下载	Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due to their iterative structure and deep transformer stacks.