Skip to content

2025-05-26

cs.AR - Architecture

标题作者发布日期PDF摘要
Efficient Optimization Accelerator Framework for Multistate Ising ProblemsChirag Garg, Sayeef Salahuddin2025-05-26下载Ising Machines are emerging hardware architectures that efficiently solve NP-Hard combinatorial optimization problems. Generally, combinatorial problems are transformed into quadratic unconstrained bi...
ReChisel: Effective Automatic Chisel Code Generation by LLM with ReflectionJuxin Niu, Xiangfeng Liu, Dan Niu, Xi Wang, Zhe Jiang, Nan Guan2025-05-26下载Coding with hardware description languages (HDLs) such as Verilog is a time-intensive and laborious task. With the rapid advancement of large language models (LLMs), there is increasing interest in ap...
Enhancing Test Efficiency through Automated ATPG-Aware Lightweight Scan InstrumentationSudipta Paria, Md Rezoan Ferdous, Aritra Dasgupta, Atri Chatterjee, Swarup Bhunia2025-05-26下载Scan-based Design-for-Testability (DFT) measures are prevalent in modern digital integrated circuits to achieve high test quality at low hardware cost.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Avoid Forgetting by Preserving Global Knowledge Gradients in Federated Learning with Non-IID DataAbhijit Chunduru, Majid Morafah, Mahdi Morafah, Vishnu Pandi Chellapandi, Ang Li2025-05-26下载The inevitable presence of data heterogeneity has made federated learning very challenging. There are numerous methods to deal with this issue, such as local regularization, better model fusion techni...
Efficient Optimization Accelerator Framework for Multistate Ising ProblemsChirag Garg, Sayeef Salahuddin2025-05-26下载Ising Machines are emerging hardware architectures that efficiently solve NP-Hard combinatorial optimization problems. Generally, combinatorial problems are transformed into quadratic unconstrained bi...
Optimizing edge AI models on HPC systems with the edge in the loopMarcel Aach, Cyril Blanc, Andreas Lintermann, Kurt De Grave2025-05-26下载Artificial intelligence and machine learning models deployed on edge devices, e.g., for quality control in Additive Manufacturing (AM), are frequently small in size.
From Few to Many Faults: Optimal Adaptive Byzantine AgreementAndrei Constantinescu, Marc Dufay, Anton Paramonov, Roger Wattenhofer2025-05-26下载Achieving agreement among distributed parties is a fundamental task in modern systems, underpinning applications such as consensus in blockchains, coordination in cloud infrastructure, and fault toler...
Differential Privacy Analysis of Decentralized Gossip Averaging under Varying Threat ModelsAntti Koskela, Tejas Kulkarni2025-05-26下载Achieving differential privacy (DP) guarantees in fully decentralized machine learning is challenging due to the absence of a central aggregator and varying trust assumptions among nodes.
Universal Workers: A Vision for Eliminating Cold Starts in Serverless ComputingSaman Akbari, Manfred Hauswirth2025-05-26下载Serverless computing enables developers to deploy code without managing infrastructure, but suffers from cold start overhead when initializing new function instances.
DGRAG: Distributed Graph-based Retrieval-Augmented Generation in Edge-Cloud SystemsWenqing Zhou, Yuxuan Yan, Qianqian Yang2025-05-26下载Retrieval-Augmented Generation (RAG) improves factuality by grounding LLMs in external knowledge, yet conventional centralized RAG requires aggregating distributed data, raising privacy risks and incu...
Justin: Hybrid CPU/Memory Elastic Scaling for Distributed Stream ProcessingDonatien Schmitz, Guillaume Rosinosky, Etienne Rivière2025-05-26下载Distributed Stream Processing (DSP) engines analyze continuous data via queries expressed as a graph of operators. Auto-scalers adjust the number of parallel instances of these operators to support a ...
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed EnvironmentsJunming Liu, Yanting Gao, Siyuan Meng, Yifei Sun, Aoqi Wu, Yufei Jin, Yirong Chen, Ding Wang, Guosun Zeng2025-05-26下载Federated Learning (FL) is a decentralized machine learning paradigm that enables clients to collaboratively train models while preserving data privacy.
Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMsHao Kang, Qingru Zhang, Han Cai, Weiyuan Xu, Tushar Krishna, Yilun Du, Tsachy Weissman2025-05-26下载Large language models (LLMs) have shown remarkable performance across diverse reasoning and generation tasks, and are increasingly deployed as agents in dynamic environments such as code generation an...
GPU acceleration of non-equilibrium Green's function calculation using OpenACC and CUDA FORTRANJia Yin, Khaled Z. Ibrahim, Mauro Del Ben, Jack Deslippe, Yang-hao Chan, Chao Yang2025-05-26下载The numerical solution of the Kadanoff-Baym nonlinear integro-differential equations, which yields the non-equilibrium Green's functions (NEGFs) of quantum many-body systems, poses significant computa...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Unleashing 5G Seamless Integration with TSN for Industry 5.0: Frame Forwarding and QoS TreatmentOscar Adamuz-Hinojosa, Felix Delgado-Ferro, Jorge Navarro-Ortiz, Pablo Muñoz, Pablo Ameigeiras2025-05-26下载Integrating Time-Sensitive Networking (TSN) and 5th Generation (5G) systems is key for providing wireless low-latency services in industry. Despite research efforts, challenges remain.
A Cost-efficient Credit-Based Shaper Deployment Framework for Time-Sensitive NetworksSantiago Torres-Borda, Ahlem Mifdaoui2025-05-26下载Time-sensitive networks are designed to meet stringent Quality of Service (QoS) requirements for mixed-criticality traffic with diverse performance demands.
MetaSTNet: Multimodal Meta-learning for Cellular Traffic Conformal PredictionHui Ma, Kai Yang2025-05-26下载Network traffic prediction techniques have attracted much attention since they are valuable for network congestion control and user experience improvement.

cs.PF - Performance

标题作者发布日期PDF摘要
Avoid Forgetting by Preserving Global Knowledge Gradients in Federated Learning with Non-IID DataAbhijit Chunduru, Majid Morafah, Mahdi Morafah, Vishnu Pandi Chellapandi, Ang Li2025-05-26下载The inevitable presence of data heterogeneity has made federated learning very challenging. There are numerous methods to deal with this issue, such as local regularization, better model fusion techni...
Universal Workers: A Vision for Eliminating Cold Starts in Serverless ComputingSaman Akbari, Manfred Hauswirth2025-05-26下载Serverless computing enables developers to deploy code without managing infrastructure, but suffers from cold start overhead when initializing new function instances.
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear ApproximationDong Liu, Yanxuan Yu, Jiayi Zhang, Yifan Li, Ben Lengerich, Ying Nian Wu2025-05-26下载Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due to their iterative structure and deep transformer stacks.

基于 VitePress 构建