Skip to content

2025-04-07

cs.AR - Architecture

标题作者发布日期PDF摘要
FERIVer: An FPGA-assisted Emulated Framework for RTL Verification of RISC-V ProcessorsKun Qin, Xiaorang Guo, Martin Schulz, Carsten Trinitis2025-04-07下载Processor design and verification require a synergistic approach that combines instruction-level functional simulations with precise hardware emulations.
Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function SelectionJon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe2025-04-07下载Machine learning-based embedded systems for safety-critical applications, such as aerospace and autonomous driving, must be robust to perturbations caused by soft errors.
AccLLM: Accelerating Long-Context LLM Inference Via Algorithm-Hardware Co-DesignYanbiao Liang, Huihong Shi, Haikuo Shao, Zhongfeng Wang2025-04-07下载Recently, large language models (LLMs) have achieved huge success in the natural language processing (NLP) field, driving a growing demand to extend their deployment from the cloud to edge devices.
N-TORC: Native Tensor Optimizer for Real-time ConstraintsSuyash Vardhan Singh, Iftakhar Ahmad, David Andrews, Miaoqing Huang, Austin R. J. Downey, Jason D. Bakos2025-04-07下载Compared to overlay-based tensor architectures like VTA or Gemmini, compilers that directly translate machine learning models into a dataflow architecture as HLS code, such as HLS4ML and FINN, general...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
dpBento: Benchmarking DPUs for Data ProcessingJiasheng Hu, Chihan Cui, Anna Li, Raahil Vora, Yuanfan Chen, Philip A. Bernstein, Jialin Li, Qizhen Zhang2025-04-07下载Data processing units (DPUs, SoC-based SmartNICs) are emerging data center hardware that provide opportunities to address cloud data processing challenges.
Constraint Programming Models For Serial Batch Scheduling With Minimum Batch SizeJorge A. Huertas, Pascal Van Hentenryck2025-04-07下载In serial batch (s-batch) scheduling, jobs are grouped in batches and processed sequentially within their batch. This paper considers multiple parallel machines, nonidentical job weights and release t...
Federated Learning for Medical Image Classification: A Comprehensive BenchmarkZhekai Zhou, Guibo Luo, Mingzhi Chen, Zhenyu Weng, Yuesheng Zhu2025-04-07下载The federated learning paradigm is wellsuited for the field of medical image analysis, as it can effectively cope with machine learning on isolated multicenter data while protecting the privacy of par...
Reducing the Communication of Distributed Model Predictive Control: Autoencoders and Formation ControlTorben Schiz, Henrik Ebel2025-04-07下载Communication remains a key factor limiting the applicability of distributed model predictive control (DMPC) in realistic settings, despite advances in wireless communication.
Distributed Quantum Advantage in Locally Checkable Labeling ProblemsAlkida Balliu, Filippo Casagrande, Francesco d'Amore, Massimo Equi, Barbara Keller, Henrik Lievonen, Dennis Olivetti, Gustav Schmid, Jukka Suomela2025-04-07下载In this paper, we present the first known example of a locally checkable labeling problem (LCL) that admits asymptotic distributed quantum advantage in the LOCAL model of distributed computing: our pr...
PRDTs: Composable Knowledge-Based Consensus Protocols with Replicated Data TypesJulian Haas, Ragnar Mogk, Annette Bieniusa, Mira Mezini2025-04-07下载Consensus protocols are fundamental in distributed systems as they enable software with strong consistency properties. However, designing optimized protocols for specific use-cases under certain syste...
Towards Optimal Heterogeneous Client Sampling in Multi-Model Federated LearningHaoran Zhang, Zejun Gong, Zekai Li, Marie Siew, Carlee Joe-Wong, Rachid El-Azouzi2025-04-07下载Federated learning (FL) allows edge devices to collaboratively train models without sharing local data. As FL gains popularity, clients may need to train multiple unrelated FL models, but communicatio...
Decentralized Semantic Federated Learning for Real-Time Public Safety Tasks: Challenges, Methods, and DirectionsBaosheng Li, Weifeng Gao, Zehui Xiong, Jin Xie, Binquan Guo, Miao Du2025-04-07下载Public safety tasks rely on the collaborative functioning of multiple edge devices (MEDs) and base stations (BSs) in different regions, consuming significant communication energy and computational res...
Prima.cpp: Fast 30-70B LLM Inference on Heterogeneous and Low-Resource Home ClustersZonghang Li, Tao Li, Wenjiao Feng, Rongxing Xiao, Jianshu She, Hong Huang, Mohsen Guizani, Hongfang Yu, Qirong Ho, Wei Xiang, Steve Liu2025-04-07下载On-device inference offers privacy, offline use, and instant response, but consumer hardware restricts large language models (LLMs) to low throughput and capability.
Serverless Approach to Running Resource-Intensive STAR AlignerPiotr Kica, Michał Orzechowski, Maciej Malawski2025-04-07下载The application of serverless computing for alignment of RNA-sequences can improve many existing bioinformatics workflows by reducing operational costs and execution times.
Transforming Future Data Center Operations and Management via Physical AIZhiwei Cao, Minghao Li, Feng Lin, Jimin Jia, Yonggang Wen, Jianxiong Yin, Simon See2025-04-07下载Data centers (DCs) as mission-critical infrastructures are pivotal in powering the growth of artificial intelligence (AI) and the digital economy.
Enhancing Trust in AI Marketplaces: Evaluating On-Chain Verification of Personalized AI models using zk-SNARKsNishant Jagannath, Christopher Wong, Braden Mcgrath, Md Farhad Hossain, Asuquo A. Okon, Abbas Jamalipour, Kumudu S. Munasinghe2025-04-07下载The rapid advancement of artificial intelligence (AI) has brought about sophisticated models capable of various tasks ranging from image recognition to natural language processing.
Scaling Graph Neural Networks for Particle Track ReconstructionAlok Tripathy, Alina Lazar, Xiangyang Ju, Paolo Calafiura, Katherine Yelick, Aydin Buluc2025-04-07下载Particle track reconstruction is an important problem in high-energy physics (HEP), necessary to study properties of subatomic particles. Traditional track reconstruction algorithms scale poorly with ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Status Updating with Time Stamp ErrorsMd Nurul Absar Siddiky, Ahmed Arafa2025-04-07下载A status updating system is considered in which multiple processes are sampled and transmitted through a shared channel. Each process has its dedicated server that processes its samples before time st...
Security Risks in Vision-Based Beam Prediction: From Spatial Proxy Attacks to Feature RefinementAvi Deb Raha, Kitae Kim, Mrityunjoy Gain, Apurba Adhikary, Zhu Han, Eui-Nam Huh, Choong Seon Hong2025-04-07下载The rapid evolution towards the sixth-generation (6G) networks demands advanced beamforming techniques to address challenges in dynamic, high-mobility scenarios, such as vehicular communications.
Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation FrameworkYu Min Park, Yan Kyaw Tun, Eui-Nam Huh, Walid Saad, Choong Seon Hong2025-04-07下载Beamforming is a key technology in millimeter-wave (mmWave) communications that improves signal transmission by optimizing directionality and intensity.
Cellular Network Design for UAV Corridors via Data-driven High-dimensional Bayesian OptimizationMohamed Benzaghta, Giovanni Geraci, David López-Pérez, Alvaro Valcarce2025-04-07下载We address the challenge of designing cellular networks for uncrewed aerial vehicles (UAVs) corridors through a novel data-driven approach. We assess multiple state-of-the-art high-dimensional Bayesia...
Federated Learning over 5G, WiFi, and Ethernet: Measurements and EvaluationRobert J. Hayek, Joaquin Chung, Kayla Comer, Chandra R. Murthy, Rajkumar Kettimuthu, Igor Kadota2025-04-07下载Federated Learning (FL) deployments using IoT devices is an area that is poised to significantly benefit from advances in NextG wireless. In this paper, we deploy a FL application using a 5G-NR Standa...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Futureproof Static Memory PlanningChristos Lamprakos, Panagiotis Xanthopoulos, Manolis Katsaragakis, Sotirios Xydis, Dimitrios Soudris, Francky Catthoor2025-04-07下载The NP-complete combinatorial optimization task of assigning offsets to a set of buffers with known sizes and lifetimes so as to minimize total memory usage is called dynamic storage allocation (DSA).

基于 VitePress 构建