Appearance
2025-08-05
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| TROOP: At-the-Roofline Performance for Vector Processors on Low Operational Intensity Workloads | Navaneeth Kunhi Purayil, Diyou Shen, Matteo Perotti, Luca Benini | 2025-08-05 | 下载 | The fast evolution of Machine Learning (ML) models requires flexible and efficient hardware solutions as hardwired accelerators face rapid obsolescence. |
| Flexible In-NAND Cryptographic Processing for Secure Flash Storage | Seock-Hwan Noh, Hoyeon Lee, Junkyum Kim, Junsu Im, Jay H. Park, Sungjin Lee, Sam H. Noh, Yeseong Kim, Jaeha Kung | 2025-08-05 | 下载 | We present FlashVault, an in-NAND self-encryption architecture that embeds a reconfigurable cryptographic engine into the unused silicon area of a state-of-the-art 4D V-NAND structure. |
| Rhea: a Framework for Fast Design and Validation of RTL Cache-Coherent Memory Subsystems | Davide Zoni, Andrea Galimberti, Adriano Guarisco | 2025-08-05 | 下载 | Designing and validating efficient cache-coherent memory subsystems is a critical yet complex task in the development of modern multi-core system-on-chip architectures. |
| Towards Memory Specialization: A Case for Long-Term and Short-Term RAM | Peijing Li, Muhammad Shahir Abdurraman, Rachel Cleaveland, Sergey Legtchenko, Philip Levis, Ioan Stefanovici, Thierry Tambe, David Tennenhouse, Caroline Trippel | 2025-08-05 | 下载 | Both SRAM and DRAM have stopped scaling: there is no technical roadmap to reduce their cost (per byte/GB). As a result, memory now dominates system cost. |
| Mamba-X: An End-to-End Vision Mamba Accelerator for Edge Computing Devices | Dongho Yoon, Gungyu Lee, Jaewon Chang, Yunjae Lee, Dongjae Lee, Minsoo Rhu | 2025-08-05 | 下载 | Transformers have proven effective in language modeling but are limited by high computational and memory demands that grow quadratically with input sequence length. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Intelligent Sampling of Extreme-Scale Turbulence Datasets for Accurate and Efficient Spatiotemporal Model Training | Wesley Brewer, Murali Meena Gopalakrishnan, Matthias Maiterth, Aditya Kashi, Jong Youl Choi, Pei Zhang, Stephen Nichols, Riccardo Balin, Miles Couchman, Stephen de Bruyn Kops, P. K. Yeung, Daniel Dotson, Rohini Uma-Vaideswaran, Sarp Oral, Feiyi Wang | 2025-08-05 | 下载 | With the end of Moore's law and Dennard scaling, efficient training increasingly requires rethinking data volume. Can we train better models with significantly less data via intelligent subsampling? T... |
| Two-dimensional Sparse Parallelism for Large Scale Deep Learning Recommendation Model Training | Xin Zhang, Quanyu Zhu, Liangbei Xu, Zain Huda, Wang Zhou, Jin Fang, Dennis van der Staay, Yuxi Hu, Jade Nie, Jiyan Yang, Chunzhi Yang | 2025-08-05 | 下载 | The increasing complexity of deep learning recommendation models (DLRM) has led to a growing need for large-scale distributed systems that can efficiently train vast amounts of data. |
| Block: Balancing Load in LLM Serving with Context, Knowledge and Predictive Scheduling | Wei Da, Evangelia Kalyvianaki | 2025-08-05 | 下载 | This paper presents Block, a distributed scheduling framework designed to optimize load balancing and auto-provisioning across instances in large language model serving frameworks by leveraging contex... |
| In-Memory Non-Binary LDPC Decoding | Oscar Ferraz, Vitor Silva, Gabriel Falcao | 2025-08-05 | 下载 | Low-density parity-check (LDPC) codes are an important feature of several communication and storage applications, offering a flexible and effective method for error correction. |
| Understanding the Landscape of Ampere GPU Memory Errors | Zhu Zhu, Yu Sun, Dhatri Parakal, Bo Fang, Steven Farrell, Gregory H. Bauer, Brett Bode, Ian T. Foster, Michael E. Papka, William Gropp, Zhao Zhang, Lishan Yang | 2025-08-05 | 下载 | Graphics Processing Units (GPUs) have become a de facto solution for accelerating high-performance computing (HPC) applications. Understanding their memory error behavior is an essential step toward a... |
| Optimal Simultaneous Byzantine Agreement, Common Knowledge and Limited Information Exchange | Ron van der Meyden | 2025-08-05 | 下载 | In order to develop solutions that perform actions as early as possible, analysis of distributed algorithms using epistemic logic has generally concentrated on ``full information protocols'', which ma... |
| Directives for Function Offloading in 5G Networks Based on a Performance Characteristics Analysis | Falk Dettinger, Matthias Weiß, Daniel Baumann, Martin Sommer, Michael Weyrich | 2025-08-05 | 下载 | Cloud-based offloading helps address energy consumption and performance challenges in executing resource-intensive vehicle algorithms. Utilizing 5G, with its low latency and high bandwidth, enables se... |
| Frontier: Simulating the Next Generation of LLM Inference Systems | Yicheng Feng, Xin Tan, Kin Hang Sew, Yimin Jiang, Yibo Zhu, Hong Xu | 2025-08-05 | 下载 | Large Language Model (LLM) inference is growing increasingly complex with the rise of Mixture-of-Experts (MoE) models and disaggregated architectures that decouple components like prefill/decode (PD) ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Confidence Driven Classification of Application Types in the Presence of Background Network Traffic | Eun Hun Choi, Jasleen Kaur, Vladas Pipiras, Nelson Gomes Rodrigues Antunes, Brendan Massey | 2025-08-05 | 下载 | Accurately classifying the application types of network traffic using deep learning models has recently gained popularity. However, we find that these classifiers do not perform well on real-world tra... |
| Data-Driven Spectrum Demand Prediction: A Spatio-Temporal Framework with Transfer Learning | Amin Farajzadeh, Hongzhao Zheng, Sarah Dumoulin, Trevor Ha, Halim Yanikomeroglu, Amir Ghasemi | 2025-08-05 | 下载 | Accurate spectrum demand prediction is crucial for informed spectrum allocation, effective regulatory planning, and fostering sustainable growth in modern wireless communication networks. |
| CASH: Context-Aware Smart Handover for Reliable UAV Connectivity on Aerial Corridors | Abdul Saboor, Zhuangzhuang Cui, Achiel Colpaert, Evgenii Vinogradov, Sofie Pollin | 2025-08-05 | 下载 | Urban Air Mobility (UAM) envisions aerial corridors for Unmanned Aerial Vehicles (UAVs) to reduce ground traffic congestion by supporting 3D mobility, such as air taxis. |
| What If, But Privately: Private Counterfactual Retrieval | Shreya Meel, Mohamed Nomeir, Pasan Dissanayake, Sanghamitra Dutta, Sennur Ulukus | 2025-08-05 | 下载 | Transparency and explainability are two important aspects to be considered when employing black-box machine learning models in high-stake applications. |
| Decoding and Engineering the Phytobiome Communication for Smart Agriculture | Fatih Gulec, Hamdan Awan, Nigel Wallbridge, Andrew W. Eckford | 2025-08-05 | 下载 | Smart agriculture applications, integrating technologies like the Internet of Things and machine learning/artificial intelligence (ML/AI) into agriculture, hold promise to address modern challenges of... |
| Heterogeneity-Oblivious Robust Federated Learning | Weiyao Zhang, Jinyang Li, Qi Song, Miao Wang, Chungang Lin, Haitong Luo, Xuying Meng, Yujun Zhang | 2025-08-05 | 下载 | Federated Learning (FL) remains highly vulnerable to poisoning attacks, especially under real-world hyper-heterogeneity, where clients differ significantly in data distributions, communication capabil... |
| Agoran: An Agentic Open Marketplace for 6G RAN Automation | Ilias Chatzistefanidis, Navid Nikaein, Andrea Leone, Ali Maatouk, Leandros Tassiulas, Roberto Morabito, Ioannis Pitsiorlas, Marios Kountouris | 2025-08-05 | 下载 | Next-generation mobile networks must reconcile the often-conflicting goals of multiple service owners. However, today's network slice controllers remain rigid, policy-bound, and unaware of the busines... |
| Bidirectional TLS Handshake Caching for Constrained Industrial IoT Scenarios | Jörn Bodenhausen, Simon Mangel, Thomas Vogt, Martin Henze | 2025-08-05 | 下载 | While TLS has become the de-facto standard for end-to-end security, its use to secure critical communication in evolving industrial IoT scenarios is severely limited by prevalent resource constraints ... |
| Directives for Function Offloading in 5G Networks Based on a Performance Characteristics Analysis | Falk Dettinger, Matthias Weiß, Daniel Baumann, Martin Sommer, Michael Weyrich | 2025-08-05 | 下载 | Cloud-based offloading helps address energy consumption and performance challenges in executing resource-intensive vehicle algorithms. Utilizing 5G, with its low latency and high bandwidth, enables se... |
| Energy-efficient Federated Learning for UAV Communications | Chien-Wei Fu, Meng-Lin Ku | 2025-08-05 | 下载 | In this paper, we propose an unmanned aerial vehicle (UAV)-assisted federated learning (FL) framework that jointly optimizes UAV trajectory, user participation, power allocation, and data volume contr... |
| Scalability and Performance Evaluation of IEEE 802.11ah IoT Deployments: A Testbed Approach | Kostas Chounos, Katerina Kyriakou, Thanasis Korakis | 2025-08-05 | 下载 | This work focuses on the development and assessment of modern wireless Internet of Things (IoT) architectures, with relevance to emerging 5G and beyond applications. |
| NANDA Adaptive Resolver: Architecture for Dynamic Resolution of AI Agent Names | John Zinky, Hema Seshadri, Mahesh Lambe, Pradyumna Chari, Ramesh Raskar | 2025-08-05 | 下载 | AdaptiveResolver is a dynamic microservice architecture designed to address the limitations of static endpoint resolution for AI agent communication in distributed, heterogeneous environments. |
| Using the NANDA Index Architecture in Practice: An Enterprise Perspective | Sichao Wang, Ramesh Raskar, Mahesh Lambe, Pradyumna Chari, Rekha Singhal, Shailja Gupta, Rajesh Ranjan, Ken Huang | 2025-08-05 | 下载 | The proliferation of autonomous AI agents represents a paradigmatic shift from traditional web architectures toward collaborative intelligent systems requiring sophisticated mechanisms for discovery, ... |
| Evolution of AI Agent Registry Solutions: Centralized, Enterprise, and Distributed Approaches | Aditi Singh, Abul Ehtesham, Mahesh Lambe, Jared James Grogan, Abhishek Singh, Saket Kumar, Luca Muscariello, Vijoy Pandey, Guillaume Sauvage De Saint Marc, Pradyumna Chari, Ramesh Raskar | 2025-08-05 | 下载 | Autonomous AI agents now operate across cloud, enterprise, and decentralized domains, creating demand for registry infrastructures that enable trustworthy discovery, capability negotiation, and identi... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| RX-INT: A Kernel Engine for Real-Time Detection and Analysis of In-Memory Threats | Arjun Juneja | 2025-08-05 | 下载 | Malware and cheat developers use fileless execution techniques to evade traditional, signature-based security products. These methods include various types of manual mapping, module stomping, and thre... |
| MaLV-OS: Rethinking the Operating System Architecture for Machine Learning in Virtualized Clouds | Stella Bitchebe, Oana Balmau | 2025-08-05 | 下载 | A large body of research has employed Machine Learning (ML) models to develop learned operating systems (OSes) and kernels. The latter dynamically adapts to the job load and dynamically adjusts resour... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Novel Hybrid Optical and STAR IRS System for NTN Communications | Shunyuan Shang, Emna Zedini, Abla Kammoun, Mohamed-Slim Alouini | 2025-08-05 | 下载 | This paper proposes a novel non-terrestrial networks (NTNs) system that integrates optical intelligent reflecting surfaces (OIRS) and simultaneous transmitting and reflecting Intelligent reflecting su... |