2025-08-05

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
TROOP: At-the-Roofline Performance for Vector Processors on Low Operational Intensity Workloads	Navaneeth Kunhi Purayil, Diyou Shen, Matteo Perotti, Luca Benini	2025-08-05	下载	The fast evolution of Machine Learning (ML) models requires flexible and efficient hardware solutions as hardwired accelerators face rapid obsolescence.
Flexible In-NAND Cryptographic Processing for Secure Flash Storage	Seock-Hwan Noh, Hoyeon Lee, Junkyum Kim, Junsu Im, Jay H. Park, Sungjin Lee, Sam H. Noh, Yeseong Kim, Jaeha Kung	2025-08-05	下载	We present FlashVault, an in-NAND self-encryption architecture that embeds a reconfigurable cryptographic engine into the unused silicon area of a state-of-the-art 4D V-NAND structure.
Rhea: a Framework for Fast Design and Validation of RTL Cache-Coherent Memory Subsystems	Davide Zoni, Andrea Galimberti, Adriano Guarisco	2025-08-05	下载	Designing and validating efficient cache-coherent memory subsystems is a critical yet complex task in the development of modern multi-core system-on-chip architectures.
Towards Memory Specialization: A Case for Long-Term and Short-Term RAM	Peijing Li, Muhammad Shahir Abdurraman, Rachel Cleaveland, Sergey Legtchenko, Philip Levis, Ioan Stefanovici, Thierry Tambe, David Tennenhouse, Caroline Trippel	2025-08-05	下载	Both SRAM and DRAM have stopped scaling: there is no technical roadmap to reduce their cost (per byte/GB). As a result, memory now dominates system cost.
Mamba-X: An End-to-End Vision Mamba Accelerator for Edge Computing Devices	Dongho Yoon, Gungyu Lee, Jaewon Chang, Yunjae Lee, Dongjae Lee, Minsoo Rhu	2025-08-05	下载	Transformers have proven effective in language modeling but are limited by high computational and memory demands that grow quadratically with input sequence length.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Intelligent Sampling of Extreme-Scale Turbulence Datasets for Accurate and Efficient Spatiotemporal Model Training	Wesley Brewer, Murali Meena Gopalakrishnan, Matthias Maiterth, Aditya Kashi, Jong Youl Choi, Pei Zhang, Stephen Nichols, Riccardo Balin, Miles Couchman, Stephen de Bruyn Kops, P. K. Yeung, Daniel Dotson, Rohini Uma-Vaideswaran, Sarp Oral, Feiyi Wang	2025-08-05	下载	With the end of Moore's law and Dennard scaling, efficient training increasingly requires rethinking data volume. Can we train better models with significantly less data via intelligent subsampling? T...
Two-dimensional Sparse Parallelism for Large Scale Deep Learning Recommendation Model Training	Xin Zhang, Quanyu Zhu, Liangbei Xu, Zain Huda, Wang Zhou, Jin Fang, Dennis van der Staay, Yuxi Hu, Jade Nie, Jiyan Yang, Chunzhi Yang	2025-08-05	下载	The increasing complexity of deep learning recommendation models (DLRM) has led to a growing need for large-scale distributed systems that can efficiently train vast amounts of data.
Block: Balancing Load in LLM Serving with Context, Knowledge and Predictive Scheduling	Wei Da, Evangelia Kalyvianaki	2025-08-05	下载	This paper presents Block, a distributed scheduling framework designed to optimize load balancing and auto-provisioning across instances in large language model serving frameworks by leveraging contex...
In-Memory Non-Binary LDPC Decoding	Oscar Ferraz, Vitor Silva, Gabriel Falcao	2025-08-05	下载	Low-density parity-check (LDPC) codes are an important feature of several communication and storage applications, offering a flexible and effective method for error correction.
Understanding the Landscape of Ampere GPU Memory Errors	Zhu Zhu, Yu Sun, Dhatri Parakal, Bo Fang, Steven Farrell, Gregory H. Bauer, Brett Bode, Ian T. Foster, Michael E. Papka, William Gropp, Zhao Zhang, Lishan Yang	2025-08-05	下载	Graphics Processing Units (GPUs) have become a de facto solution for accelerating high-performance computing (HPC) applications. Understanding their memory error behavior is an essential step toward a...
Optimal Simultaneous Byzantine Agreement, Common Knowledge and Limited Information Exchange	Ron van der Meyden	2025-08-05	下载	In order to develop solutions that perform actions as early as possible, analysis of distributed algorithms using epistemic logic has generally concentrated on ``full information protocols'', which ma...
Directives for Function Offloading in 5G Networks Based on a Performance Characteristics Analysis	Falk Dettinger, Matthias Weiß, Daniel Baumann, Martin Sommer, Michael Weyrich	2025-08-05	下载	Cloud-based offloading helps address energy consumption and performance challenges in executing resource-intensive vehicle algorithms. Utilizing 5G, with its low latency and high bandwidth, enables se...
Frontier: Simulating the Next Generation of LLM Inference Systems	Yicheng Feng, Xin Tan, Kin Hang Sew, Yimin Jiang, Yibo Zhu, Hong Xu	2025-08-05	下载	Large Language Model (LLM) inference is growing increasingly complex with the rise of Mixture-of-Experts (MoE) models and disaggregated architectures that decouple components like prefill/decode (PD) ...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Confidence Driven Classification of Application Types in the Presence of Background Network Traffic	Eun Hun Choi, Jasleen Kaur, Vladas Pipiras, Nelson Gomes Rodrigues Antunes, Brendan Massey	2025-08-05	下载	Accurately classifying the application types of network traffic using deep learning models has recently gained popularity. However, we find that these classifiers do not perform well on real-world tra...
Data-Driven Spectrum Demand Prediction: A Spatio-Temporal Framework with Transfer Learning	Amin Farajzadeh, Hongzhao Zheng, Sarah Dumoulin, Trevor Ha, Halim Yanikomeroglu, Amir Ghasemi	2025-08-05	下载	Accurate spectrum demand prediction is crucial for informed spectrum allocation, effective regulatory planning, and fostering sustainable growth in modern wireless communication networks.
CASH: Context-Aware Smart Handover for Reliable UAV Connectivity on Aerial Corridors	Abdul Saboor, Zhuangzhuang Cui, Achiel Colpaert, Evgenii Vinogradov, Sofie Pollin	2025-08-05	下载	Urban Air Mobility (UAM) envisions aerial corridors for Unmanned Aerial Vehicles (UAVs) to reduce ground traffic congestion by supporting 3D mobility, such as air taxis.
What If, But Privately: Private Counterfactual Retrieval	Shreya Meel, Mohamed Nomeir, Pasan Dissanayake, Sanghamitra Dutta, Sennur Ulukus	2025-08-05	下载	Transparency and explainability are two important aspects to be considered when employing black-box machine learning models in high-stake applications.
Decoding and Engineering the Phytobiome Communication for Smart Agriculture	Fatih Gulec, Hamdan Awan, Nigel Wallbridge, Andrew W. Eckford	2025-08-05	下载	Smart agriculture applications, integrating technologies like the Internet of Things and machine learning/artificial intelligence (ML/AI) into agriculture, hold promise to address modern challenges of...
Heterogeneity-Oblivious Robust Federated Learning	Weiyao Zhang, Jinyang Li, Qi Song, Miao Wang, Chungang Lin, Haitong Luo, Xuying Meng, Yujun Zhang	2025-08-05	下载	Federated Learning (FL) remains highly vulnerable to poisoning attacks, especially under real-world hyper-heterogeneity, where clients differ significantly in data distributions, communication capabil...
Agoran: An Agentic Open Marketplace for 6G RAN Automation	Ilias Chatzistefanidis, Navid Nikaein, Andrea Leone, Ali Maatouk, Leandros Tassiulas, Roberto Morabito, Ioannis Pitsiorlas, Marios Kountouris	2025-08-05	下载	Next-generation mobile networks must reconcile the often-conflicting goals of multiple service owners. However, today's network slice controllers remain rigid, policy-bound, and unaware of the busines...
Bidirectional TLS Handshake Caching for Constrained Industrial IoT Scenarios	Jörn Bodenhausen, Simon Mangel, Thomas Vogt, Martin Henze	2025-08-05	下载	While TLS has become the de-facto standard for end-to-end security, its use to secure critical communication in evolving industrial IoT scenarios is severely limited by prevalent resource constraints ...
Directives for Function Offloading in 5G Networks Based on a Performance Characteristics Analysis	Falk Dettinger, Matthias Weiß, Daniel Baumann, Martin Sommer, Michael Weyrich	2025-08-05	下载	Cloud-based offloading helps address energy consumption and performance challenges in executing resource-intensive vehicle algorithms. Utilizing 5G, with its low latency and high bandwidth, enables se...
Energy-efficient Federated Learning for UAV Communications	Chien-Wei Fu, Meng-Lin Ku	2025-08-05	下载	In this paper, we propose an unmanned aerial vehicle (UAV)-assisted federated learning (FL) framework that jointly optimizes UAV trajectory, user participation, power allocation, and data volume contr...
Scalability and Performance Evaluation of IEEE 802.11ah IoT Deployments: A Testbed Approach	Kostas Chounos, Katerina Kyriakou, Thanasis Korakis	2025-08-05	下载	This work focuses on the development and assessment of modern wireless Internet of Things (IoT) architectures, with relevance to emerging 5G and beyond applications.
NANDA Adaptive Resolver: Architecture for Dynamic Resolution of AI Agent Names	John Zinky, Hema Seshadri, Mahesh Lambe, Pradyumna Chari, Ramesh Raskar	2025-08-05	下载	AdaptiveResolver is a dynamic microservice architecture designed to address the limitations of static endpoint resolution for AI agent communication in distributed, heterogeneous environments.
Using the NANDA Index Architecture in Practice: An Enterprise Perspective	Sichao Wang, Ramesh Raskar, Mahesh Lambe, Pradyumna Chari, Rekha Singhal, Shailja Gupta, Rajesh Ranjan, Ken Huang	2025-08-05	下载	The proliferation of autonomous AI agents represents a paradigmatic shift from traditional web architectures toward collaborative intelligent systems requiring sophisticated mechanisms for discovery, ...
Evolution of AI Agent Registry Solutions: Centralized, Enterprise, and Distributed Approaches	Aditi Singh, Abul Ehtesham, Mahesh Lambe, Jared James Grogan, Abhishek Singh, Saket Kumar, Luca Muscariello, Vijoy Pandey, Guillaume Sauvage De Saint Marc, Pradyumna Chari, Ramesh Raskar	2025-08-05	下载	Autonomous AI agents now operate across cloud, enterprise, and decentralized domains, creating demand for registry infrastructures that enable trustworthy discovery, capability negotiation, and identi...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
RX-INT: A Kernel Engine for Real-Time Detection and Analysis of In-Memory Threats	Arjun Juneja	2025-08-05	下载	Malware and cheat developers use fileless execution techniques to evade traditional, signature-based security products. These methods include various types of manual mapping, module stomping, and thre...
MaLV-OS: Rethinking the Operating System Architecture for Machine Learning in Virtualized Clouds	Stella Bitchebe, Oana Balmau	2025-08-05	下载	A large body of research has employed Machine Learning (ML) models to develop learned operating systems (OSes) and kernels. The latter dynamically adapts to the job load and dynamically adjusts resour...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
A Novel Hybrid Optical and STAR IRS System for NTN Communications	Shunyuan Shang, Emna Zedini, Abla Kammoun, Mohamed-Slim Alouini	2025-08-05	下载	This paper proposes a novel non-terrestrial networks (NTNs) system that integrates optical intelligent reflecting surfaces (OIRS) and simultaneous transmitting and reflecting Intelligent reflecting su...