Skip to content

2025-07-23

cs.AR - Architecture

标题作者发布日期PDF摘要
Clo-HDnn: A 4.66 TFLOPS/W and 3.78 TOPS/W Continual On-Device Learning Accelerator with Energy-efficient Hyperdimensional Computing via Progressive SearchChang Eun Song, Weihong Xu, Keming Fan, Soumil Jain, Gopabandhu Hota, Haichao Yang, Leo Liu, Kerem Akarvardar, Meng-Fan Chang, Carlos H. Diaz, Gert Cauwenberghs, Tajana Rosing, Mingu Kang2025-07-23下载Clo-HDnn is an on-device learning (ODL) accelerator designed for emerging continual learning (CL) tasks. Clo-HDnn integrates hyperdimensional computing (HDC) along with low-cost Kronecker HD Encoder a...
Neuromorphic Computing: A Theoretical Framework for Time, Space, and Energy ScalingJames B Aimone2025-07-23下载Neuromorphic computing (NMC) is increasingly viewed as a low-power alternative to conventional von Neumann architectures such as central processing units (CPUs) and graphics processing units (GPUs), h...
FedChip: Federated LLM for Artificial Intelligence Accelerator Chip DesignMahmoud Nazzal, Khoa Nguyen, Deepak Vungarala, Ramtin Zand, Shaahin Angizi, Hai Phan, Abdallah Khreishah2025-07-23下载AI hardware design is advancing rapidly, driven by the promise of design automation to make chip development faster, more efficient, and more accessible to a wide range of users.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Enabling Scalability in Asynchronous and Bidirectional Communication in LPWANMahbubur Rahman2025-07-23下载LPWANs have become ubiquitous due to their ability to connect sensors over large geographic areas in a single hop. It is, however, very challenging to achieve massive scalability in LPWANs, where nume...
PowerTrip: Exploiting Federated Heterogeneous Datacenter Power for Distributed ML TrainingTalha Mehboob, Luanzheng Guo, Nathan Tallent, Michael Zink, David Irwin2025-07-23下载The exponential growth of large-scale AI models has led to computational and power demands that can exceed the capacity of a single data center.
Neuromorphic Computing: A Theoretical Framework for Time, Space, and Energy ScalingJames B Aimone2025-07-23下载Neuromorphic computing (NMC) is increasingly viewed as a low-power alternative to conventional von Neumann architectures such as central processing units (CPUs) and graphics processing units (GPUs), h...
Optimizing Edge Gaming Slices through an Enhanced User Plane Function and Analytics in Beyond-5G NetworksBruno Marques da Silva, Larissa Ferreira Rodrigues Moreira, Flávio de Oliveira Silva, Rodrigo Moreira2025-07-23下载The latest generation of games and pervasive communication technologies poses challenges in service management and Service-Level Agreement compliance for mobile users.
Comparing performance of variational quantum algorithm simulations on HPC systemsMarco De Pascale, Tobias Valentin Bauer, Yaknan John Gambo, Mario Hernández Vera, Stefan Huber, Burak Mete, Amit Jamadagni, Amine Bentellis, Marita Oliv, Luigi Iapichino, Jeanette Miriam Lorenz2025-07-23下载Variational quantum algorithms are of special importance in the research on quantum computing applications because of their applicability to current Noisy Intermediate-Scale Quantum (NISQ) devices.
Enhancing Quantum Federated Learning with Fisher Information-Based OptimizationAmandeep Singh Bhatia, Sabre Kais2025-07-23下载Federated Learning (FL) has become increasingly popular across different sectors, offering a way for clients to work together to train a global model without sharing sensitive data.
Distributed P2P quantile tracking with relative value errorMarco Pulimeno, Italo Epicoco, Massimo Cafaro2025-07-23下载In this paper we present \textsc{DUDDSketch}, a distributed version of the \textsc{UDDSketch} algorithm for accurate tracking of quantiles. The algorithm is a fully decentralized, gossip-based distrib...
Multiprocessor Scheduling with Memory Constraints: Fundamental Properties and Finding Optimal SolutionsPál András Papp, Toni Böhnlein, A. N. Yzelman2025-07-23下载We study the problem of scheduling a general computational DAG on multiple processors in a 2-level memory hierarchy. This setting is a natural generalization of several prominent models in the literat...
CHAMP: A Configurable, Hot-Swappable Edge Architecture for Adaptive Biometric TasksJoel Brogan, Matthew Yohe, David Cornett2025-07-23下载What if you could piece together your own custom biometrics and AI analysis system, a bit like LEGO blocks? We aim to bring that technology to field operators in the field who require flexible, high-p...
Efficient Column-Wise N:M Pruning on RISC-V CPUChi-Wei Chu, Ding-Yong Hong, Jan-Jan Wu2025-07-23下载In deep learning frameworks, weight pruning is a widely used technique for improving computational efficiency by reducing the size of large models.
Eco-Friendly AI: Unleashing Data Power for Green Federated LearningMattia Sabella, Monica Vitali2025-07-23下载The widespread adoption of Artificial Intelligence (AI) and Machine Learning (ML) comes with a significant environmental impact, particularly in terms of energy consumption and carbon emissions.
P3SL: Personalized Privacy-Preserving Split Learning on Heterogeneous Edge DevicesWei Fan, JinYi Yoon, Xiaochang Li, Huajie Shao, Bo Ji2025-07-23下载Split Learning (SL) is an emerging privacy-preserving machine learning technique that enables resource constrained edge devices to participate in model training by partitioning a model into client-sid...
BrownoutServe: SLO-Aware Inference Serving under Bursty Workloads for MoE-based LLMsJianmin Hu, Minxian Xu, Kejiang Ye, Chengzhong Xu2025-07-23下载In recent years, the Mixture-of-Experts (MoE) architecture has been widely applied to large language models (LLMs), providing a promising solution that activates only a subset of the model's parameter...
Auto-scaling Approaches for Microservice Applications: A Survey and TaxonomyMinxian Xu, Junhan Liao, Linfeng Wen, Huaming Wu, Kejiang Ye, Rajkumar Buyya, Chengzhong Xu2025-07-23下载Microservice applications are created as loosely coupled application components and they leverage cloud elasticity to reduce costs and increase development speed.
BucketServe: Bucket-Based Dynamic Batching for Smart and Efficient LLM Inference ServingWanyi Zheng, Minxian Xu, Shengye Song, Kejiang Ye2025-07-23下载Large language models (LLMs) have become increasingly popular in various areas, traditional business gradually shifting from rule-based systems to LLM-based solutions.
PathWeaver: A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor SearchSukjin Kim, Seongyeon Park, Si Ung Noh, Junguk Hong, Taehee Kwon, Hunseong Lim, Jinho Lee2025-07-23下载Graph-based Approximate Nearest Neighbor Search (ANNS) is widely adopted in numerous applications, such as recommendation systems, natural language processing, and computer vision.
Mapple: A Domain-Specific Language for Mapping Distributed ProgramsAnjiang Wei, Rohan Yadav, Hang Song, Wonchan Lee, Ke Wang, Alex Aiken2025-07-23下载Optimizing parallel programs for distributed systems is a complex task, often requiring significant code modifications. Task-based programming systems improve modularity by separating performance deci...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Enabling Scalability in Asynchronous and Bidirectional Communication in LPWANMahbubur Rahman2025-07-23下载LPWANs have become ubiquitous due to their ability to connect sensors over large geographic areas in a single hop. It is, however, very challenging to achieve massive scalability in LPWANs, where nume...
Talk with the Things: Integrating LLMs into IoT NetworksAlakesh Kalita2025-07-23下载The convergence of Large Language Models (LLMs) and Internet of Things (IoT) networks open new opportunities for building intelligent, responsive, and user-friendly systems.
ARCADE: A RAN Diagnosis Methodology in a Hybrid AI Environment for 6G NetworksDaniel Ricardo Cunha Oliveira, Rodrigo Moreira, Flávio de Oliveira Silva2025-07-23下载Artificial Intelligence (AI) plays a key role in developing 6G networks. While current specifications already include Network Data Analytics Function (NWDAF) as a network element responsible for provi...
Frame-Based Zero-Shot Semantic Channel Equalization for AI-Native CommunicationsSimone Fiorellino, Claudio Battiloro, Emilio Calvanese Strinati, Paolo Di Lorenzo2025-07-23下载In future AI-native wireless networks, the presence of mismatches between the latent spaces of independently designed and trained deep neural network (DNN) encoders may impede mutual understanding due...
Symmetric Private Information Retrieval (SPIR) on Graph-Based Replicated SystemsShreya Meel, Sennur Ulukus2025-07-23下载We introduce the problem of symmetric private information retrieval (SPIR) on replicated databases modeled by a simple graph. In this model, each vertex corresponds to a server, and a message is repli...
Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven NetworksIlias Chatzistefanidis, Navid Nikaein2025-07-23下载Large Language Model (LLM)-based autonomous agents are expected to play a vital role in the evolution of 6G networks, by empowering real-time decision-making related to management and service provisio...
HSM and TPM Failures in Cloud: A Real-World Taxonomy and Emerging DefensesShams Shaikh, Trima P. Fernandes e Fizardo2025-07-23下载As cloud infrastructure becomes the backbone of modern organizations, the security of cryptographic key management, especially using Hardware Security Modules (HSMs) and Trusted Platform Modules (TPMs...
A Virtual Quantum Network Prototype for Open AccessRaj Kamleshkumar Madhu, Visuttha Manthamkarn, Zheshen Zhang, Jianqing Liu2025-07-23下载The rise of quantum networks has revolutionized domains such as communication, sensing, and cybersecurity. Despite this progress, current quantum network systems remain limited in scale, are highly ap...
Active Attack Resilience in 5G: A New Take on Authentication and Key AgreementNazatul H. Sultan, Xinlong Guan, Josef Pieprzyk, Wei Ni, Sharif Abuadbba, Hajime Suzuki2025-07-23下载As 5G networks expand into critical infrastructure, secure and efficient user authentication is more important than ever. The 5G-AKA protocol, standardized by 3GPP in TS 33.
Information Entropy-Based Scheduling for Communication-Efficient Decentralized LearningJaiprakash Nagar, Zheng Chen, Marios Kountouris, Photios A. Stavrou2025-07-23下载This paper addresses decentralized stochastic gradient descent (D-SGD) over resource-constrained networks by introducing node-based and link-based scheduling strategies to enhance communication effici...
Custody Transfer and Compressed Status Reporting for Bundle Protocol Version 7Alice Le Bihan, Felix Flentge, Juan A. Fraire2025-07-23下载As space missions increase, there is a growing need to replace point-to-point communication with an efficient and reliable network-centric communication approach.
Our Cars Can Talk: How IoT Brings AI to VehiclesAmod Kant Agrawal2025-07-23下载Bringing AI to vehicles and enabling them as sensing platforms is key to transforming maintenance from reactive to proactive. Now is the time to integrate AI copilots that speak both languages: machin...
Closed-Form and Boundary Expressions for Task-Success Probability in Status-Driven SystemsJianpeng Qi, Chao Liu, Rui Wang, Junyu Dong, Yanwei Yu2025-07-23下载Timely and efficient dissemination of server status is critical in compute-first networking systems, where user tasks arrive dynamically and computing resources are limited and stochastic.
LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV NetworksLijie Zheng, Ji He, Shih Yu Chang, Yulong Shen, Dusit Niyato2025-07-23下载This work tackles the physical layer security (PLS) problem of maximizing the secrecy rate in heterogeneous UAV networks (HetUAVNs) under propulsion energy constraints.

cs.PF - Performance

标题作者发布日期PDF摘要
SimLens for Early Exit in Large Language Models: Eliciting Accurate Latent Predictions with One More TokenMing Ma, Bowen Zheng, Zhongqiao Lin, Tianming Yang2025-07-23下载Intermediate-layer predictions in large language models (LLMs) are informative but hard to decode accurately, especially at early layers. Existing lens-style methods typically rely on direct linear re...

基于 VitePress 构建