Appearance
2025-07-23
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Clo-HDnn: A 4.66 TFLOPS/W and 3.78 TOPS/W Continual On-Device Learning Accelerator with Energy-efficient Hyperdimensional Computing via Progressive Search | Chang Eun Song, Weihong Xu, Keming Fan, Soumil Jain, Gopabandhu Hota, Haichao Yang, Leo Liu, Kerem Akarvardar, Meng-Fan Chang, Carlos H. Diaz, Gert Cauwenberghs, Tajana Rosing, Mingu Kang | 2025-07-23 | 下载 | Clo-HDnn is an on-device learning (ODL) accelerator designed for emerging continual learning (CL) tasks. Clo-HDnn integrates hyperdimensional computing (HDC) along with low-cost Kronecker HD Encoder a... |
| Neuromorphic Computing: A Theoretical Framework for Time, Space, and Energy Scaling | James B Aimone | 2025-07-23 | 下载 | Neuromorphic computing (NMC) is increasingly viewed as a low-power alternative to conventional von Neumann architectures such as central processing units (CPUs) and graphics processing units (GPUs), h... |
| FedChip: Federated LLM for Artificial Intelligence Accelerator Chip Design | Mahmoud Nazzal, Khoa Nguyen, Deepak Vungarala, Ramtin Zand, Shaahin Angizi, Hai Phan, Abdallah Khreishah | 2025-07-23 | 下载 | AI hardware design is advancing rapidly, driven by the promise of design automation to make chip development faster, more efficient, and more accessible to a wide range of users. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Enabling Scalability in Asynchronous and Bidirectional Communication in LPWAN | Mahbubur Rahman | 2025-07-23 | 下载 | LPWANs have become ubiquitous due to their ability to connect sensors over large geographic areas in a single hop. It is, however, very challenging to achieve massive scalability in LPWANs, where nume... |
| PowerTrip: Exploiting Federated Heterogeneous Datacenter Power for Distributed ML Training | Talha Mehboob, Luanzheng Guo, Nathan Tallent, Michael Zink, David Irwin | 2025-07-23 | 下载 | The exponential growth of large-scale AI models has led to computational and power demands that can exceed the capacity of a single data center. |
| Neuromorphic Computing: A Theoretical Framework for Time, Space, and Energy Scaling | James B Aimone | 2025-07-23 | 下载 | Neuromorphic computing (NMC) is increasingly viewed as a low-power alternative to conventional von Neumann architectures such as central processing units (CPUs) and graphics processing units (GPUs), h... |
| Optimizing Edge Gaming Slices through an Enhanced User Plane Function and Analytics in Beyond-5G Networks | Bruno Marques da Silva, Larissa Ferreira Rodrigues Moreira, Flávio de Oliveira Silva, Rodrigo Moreira | 2025-07-23 | 下载 | The latest generation of games and pervasive communication technologies poses challenges in service management and Service-Level Agreement compliance for mobile users. |
| Comparing performance of variational quantum algorithm simulations on HPC systems | Marco De Pascale, Tobias Valentin Bauer, Yaknan John Gambo, Mario Hernández Vera, Stefan Huber, Burak Mete, Amit Jamadagni, Amine Bentellis, Marita Oliv, Luigi Iapichino, Jeanette Miriam Lorenz | 2025-07-23 | 下载 | Variational quantum algorithms are of special importance in the research on quantum computing applications because of their applicability to current Noisy Intermediate-Scale Quantum (NISQ) devices. |
| Enhancing Quantum Federated Learning with Fisher Information-Based Optimization | Amandeep Singh Bhatia, Sabre Kais | 2025-07-23 | 下载 | Federated Learning (FL) has become increasingly popular across different sectors, offering a way for clients to work together to train a global model without sharing sensitive data. |
| Distributed P2P quantile tracking with relative value error | Marco Pulimeno, Italo Epicoco, Massimo Cafaro | 2025-07-23 | 下载 | In this paper we present \textsc{DUDDSketch}, a distributed version of the \textsc{UDDSketch} algorithm for accurate tracking of quantiles. The algorithm is a fully decentralized, gossip-based distrib... |
| Multiprocessor Scheduling with Memory Constraints: Fundamental Properties and Finding Optimal Solutions | Pál András Papp, Toni Böhnlein, A. N. Yzelman | 2025-07-23 | 下载 | We study the problem of scheduling a general computational DAG on multiple processors in a 2-level memory hierarchy. This setting is a natural generalization of several prominent models in the literat... |
| CHAMP: A Configurable, Hot-Swappable Edge Architecture for Adaptive Biometric Tasks | Joel Brogan, Matthew Yohe, David Cornett | 2025-07-23 | 下载 | What if you could piece together your own custom biometrics and AI analysis system, a bit like LEGO blocks? We aim to bring that technology to field operators in the field who require flexible, high-p... |
| Efficient Column-Wise N:M Pruning on RISC-V CPU | Chi-Wei Chu, Ding-Yong Hong, Jan-Jan Wu | 2025-07-23 | 下载 | In deep learning frameworks, weight pruning is a widely used technique for improving computational efficiency by reducing the size of large models. |
| Eco-Friendly AI: Unleashing Data Power for Green Federated Learning | Mattia Sabella, Monica Vitali | 2025-07-23 | 下载 | The widespread adoption of Artificial Intelligence (AI) and Machine Learning (ML) comes with a significant environmental impact, particularly in terms of energy consumption and carbon emissions. |
| P3SL: Personalized Privacy-Preserving Split Learning on Heterogeneous Edge Devices | Wei Fan, JinYi Yoon, Xiaochang Li, Huajie Shao, Bo Ji | 2025-07-23 | 下载 | Split Learning (SL) is an emerging privacy-preserving machine learning technique that enables resource constrained edge devices to participate in model training by partitioning a model into client-sid... |
| BrownoutServe: SLO-Aware Inference Serving under Bursty Workloads for MoE-based LLMs | Jianmin Hu, Minxian Xu, Kejiang Ye, Chengzhong Xu | 2025-07-23 | 下载 | In recent years, the Mixture-of-Experts (MoE) architecture has been widely applied to large language models (LLMs), providing a promising solution that activates only a subset of the model's parameter... |
| Auto-scaling Approaches for Microservice Applications: A Survey and Taxonomy | Minxian Xu, Junhan Liao, Linfeng Wen, Huaming Wu, Kejiang Ye, Rajkumar Buyya, Chengzhong Xu | 2025-07-23 | 下载 | Microservice applications are created as loosely coupled application components and they leverage cloud elasticity to reduce costs and increase development speed. |
| BucketServe: Bucket-Based Dynamic Batching for Smart and Efficient LLM Inference Serving | Wanyi Zheng, Minxian Xu, Shengye Song, Kejiang Ye | 2025-07-23 | 下载 | Large language models (LLMs) have become increasingly popular in various areas, traditional business gradually shifting from rule-based systems to LLM-based solutions. |
| PathWeaver: A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search | Sukjin Kim, Seongyeon Park, Si Ung Noh, Junguk Hong, Taehee Kwon, Hunseong Lim, Jinho Lee | 2025-07-23 | 下载 | Graph-based Approximate Nearest Neighbor Search (ANNS) is widely adopted in numerous applications, such as recommendation systems, natural language processing, and computer vision. |
| Mapple: A Domain-Specific Language for Mapping Distributed Programs | Anjiang Wei, Rohan Yadav, Hang Song, Wonchan Lee, Ke Wang, Alex Aiken | 2025-07-23 | 下载 | Optimizing parallel programs for distributed systems is a complex task, often requiring significant code modifications. Task-based programming systems improve modularity by separating performance deci... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Enabling Scalability in Asynchronous and Bidirectional Communication in LPWAN | Mahbubur Rahman | 2025-07-23 | 下载 | LPWANs have become ubiquitous due to their ability to connect sensors over large geographic areas in a single hop. It is, however, very challenging to achieve massive scalability in LPWANs, where nume... |
| Talk with the Things: Integrating LLMs into IoT Networks | Alakesh Kalita | 2025-07-23 | 下载 | The convergence of Large Language Models (LLMs) and Internet of Things (IoT) networks open new opportunities for building intelligent, responsive, and user-friendly systems. |
| ARCADE: A RAN Diagnosis Methodology in a Hybrid AI Environment for 6G Networks | Daniel Ricardo Cunha Oliveira, Rodrigo Moreira, Flávio de Oliveira Silva | 2025-07-23 | 下载 | Artificial Intelligence (AI) plays a key role in developing 6G networks. While current specifications already include Network Data Analytics Function (NWDAF) as a network element responsible for provi... |
| Frame-Based Zero-Shot Semantic Channel Equalization for AI-Native Communications | Simone Fiorellino, Claudio Battiloro, Emilio Calvanese Strinati, Paolo Di Lorenzo | 2025-07-23 | 下载 | In future AI-native wireless networks, the presence of mismatches between the latent spaces of independently designed and trained deep neural network (DNN) encoders may impede mutual understanding due... |
| Symmetric Private Information Retrieval (SPIR) on Graph-Based Replicated Systems | Shreya Meel, Sennur Ulukus | 2025-07-23 | 下载 | We introduce the problem of symmetric private information retrieval (SPIR) on replicated databases modeled by a simple graph. In this model, each vertex corresponds to a server, and a message is repli... |
| Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks | Ilias Chatzistefanidis, Navid Nikaein | 2025-07-23 | 下载 | Large Language Model (LLM)-based autonomous agents are expected to play a vital role in the evolution of 6G networks, by empowering real-time decision-making related to management and service provisio... |
| HSM and TPM Failures in Cloud: A Real-World Taxonomy and Emerging Defenses | Shams Shaikh, Trima P. Fernandes e Fizardo | 2025-07-23 | 下载 | As cloud infrastructure becomes the backbone of modern organizations, the security of cryptographic key management, especially using Hardware Security Modules (HSMs) and Trusted Platform Modules (TPMs... |
| A Virtual Quantum Network Prototype for Open Access | Raj Kamleshkumar Madhu, Visuttha Manthamkarn, Zheshen Zhang, Jianqing Liu | 2025-07-23 | 下载 | The rise of quantum networks has revolutionized domains such as communication, sensing, and cybersecurity. Despite this progress, current quantum network systems remain limited in scale, are highly ap... |
| Active Attack Resilience in 5G: A New Take on Authentication and Key Agreement | Nazatul H. Sultan, Xinlong Guan, Josef Pieprzyk, Wei Ni, Sharif Abuadbba, Hajime Suzuki | 2025-07-23 | 下载 | As 5G networks expand into critical infrastructure, secure and efficient user authentication is more important than ever. The 5G-AKA protocol, standardized by 3GPP in TS 33. |
| Information Entropy-Based Scheduling for Communication-Efficient Decentralized Learning | Jaiprakash Nagar, Zheng Chen, Marios Kountouris, Photios A. Stavrou | 2025-07-23 | 下载 | This paper addresses decentralized stochastic gradient descent (D-SGD) over resource-constrained networks by introducing node-based and link-based scheduling strategies to enhance communication effici... |
| Custody Transfer and Compressed Status Reporting for Bundle Protocol Version 7 | Alice Le Bihan, Felix Flentge, Juan A. Fraire | 2025-07-23 | 下载 | As space missions increase, there is a growing need to replace point-to-point communication with an efficient and reliable network-centric communication approach. |
| Our Cars Can Talk: How IoT Brings AI to Vehicles | Amod Kant Agrawal | 2025-07-23 | 下载 | Bringing AI to vehicles and enabling them as sensing platforms is key to transforming maintenance from reactive to proactive. Now is the time to integrate AI copilots that speak both languages: machin... |
| Closed-Form and Boundary Expressions for Task-Success Probability in Status-Driven Systems | Jianpeng Qi, Chao Liu, Rui Wang, Junyu Dong, Yanwei Yu | 2025-07-23 | 下载 | Timely and efficient dissemination of server status is critical in compute-first networking systems, where user tasks arrive dynamically and computing resources are limited and stochastic. |
| LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks | Lijie Zheng, Ji He, Shih Yu Chang, Yulong Shen, Dusit Niyato | 2025-07-23 | 下载 | This work tackles the physical layer security (PLS) problem of maximizing the secrecy rate in heterogeneous UAV networks (HetUAVNs) under propulsion energy constraints. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SimLens for Early Exit in Large Language Models: Eliciting Accurate Latent Predictions with One More Token | Ming Ma, Bowen Zheng, Zhongqiao Lin, Tianming Yang | 2025-07-23 | 下载 | Intermediate-layer predictions in large language models (LLMs) are informative but hard to decode accurately, especially at early layers. Existing lens-style methods typically rely on direct linear re... |