2025-03-27

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
CIMPool: Scalable Neural Network Acceleration for Compute-In-Memory using Weight Pools	Shurui Li, Puneet Gupta	2025-03-27	下载	Compute-in-memory (CIM) based neural network accelerators offer a promising solution to the Von Neumann bottleneck by computing directly within memory arrays.
Performance Characterizations and Usage Guidelines of Samsung CXL Memory Module Hybrid Prototype	Jianping Zeng, Shuyi Pei, Da Zhang, Yuchen Zhou, Amir Beygi, Xuebin Yao, Ramdas Kachare, Tong Zhang, Zongwang Li, Marie Nguyen, Rekha Pitchumani, Yang Soek Ki, Changhee Jung	2025-03-27	下载	The growing prevalence of data-intensive workloads, such as artificial intelligence (AI), machine learning (ML), high-performance computing (HPC), in-memory databases, and real-time analytics, has exp...
A Bespoke Design Approach to Low-Power Printed Microprocessors for Machine Learning Applications	Panagiotis Chaidos, Giorgos Armeniakos, Sotirios Xydis, Dimitrios Soudris	2025-03-27	下载	Printed electronics have gained significant traction in recent years, presenting a viable path to integrating computing into everyday items, from disposable products to low-cost healthcare.
A 71.2-μW Speech Recognition Accelerator with Recurrent Spiking Neural Network	Chih-Chyau Yang, Tian-Sheuan Chang	2025-03-27	下载	This paper introduces a 71.2-μW speech recognition accelerator designed for edge devices' real-time applications, emphasizing an ultra low power design.
A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices	Ci-Hao Wu, Tian-Sheuan Chang	2025-03-27	下载	Transformer-based speech enhancement models yield impressive results. However, their heterogeneous and complex structure restricts model compression potential, resulting in greater complexity and redu...
MLDSE: Scaling Design Space Exploration Infrastructure for Multi-Level Hardware	Huanyu Qu, Weihao Zhang, Junfeng Lin, Songchen Ma, Hongyi Li, Luping Shi, Chengzhong Xu	2025-03-27	下载	To efficiently support large-scale NNs, multi-level hardware, leveraging advanced integration and interconnection technologies, has emerged as a promising solution to counter the slowdown of Moore's l...
Extending Silicon Lifetime: A Review of Design Techniques for Reliable Integrated Circuits	Shaik Jani Babu, Fan Hu, Linyu Zhu, Sonal Singhal, Xinfei Guo	2025-03-27	下载	Reliability has become an increasing concern in modern computing. Integrated circuits (ICs) are the backbone of modern computing devices across industries, including artificial intelligence (AI), cons...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Lobster: A GPU-Accelerated Framework for Neurosymbolic Programming	Paul Biberstein, Ziyang Li, Joseph Devietti, Mayur Naik	2025-03-27	下载	Neurosymbolic programs combine deep learning with symbolic reasoning to achieve better data efficiency, interpretability, and generalizability compared to standalone deep learning approaches.
Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time	Zhaojun Nan, Yunchu Han, Sheng Zhou, Zhisheng Niu	2025-03-27	下载	In edge intelligence systems, deep neural network (DNN) partitioning and data offloading can provide real-time task inference for resource-constrained mobile devices.
OCEP: An Ontology-Based Complex Event Processing Framework for Healthcare Decision Support in Big Data Analytics	Ritesh Chandra, Sonali Agarwal, Shashi Shekhar Kumar, Navjot Singh	2025-03-27	下载	The exponential expansion of real-time data streams across multiple domains needs the development of effective event detection, correlation, and decision-making systems.
Ocularone-Bench: Benchmarking DNN Models on GPUs to Assist the Visually Impaired	Suman Raj, Bhavani A Madhabhavi, Kautuk Astu, Arnav A Rajesh, Pratham M, Yogesh Simmhan	2025-03-27	下载	VIP navigation requires multiple DNN models for identification, posture analysis, and depth estimation to ensure safe mobility. Using a hazard vest as a unique identifier enhances visibility while sel...
MLDSE: Scaling Design Space Exploration Infrastructure for Multi-Level Hardware	Huanyu Qu, Weihao Zhang, Junfeng Lin, Songchen Ma, Hongyi Li, Luping Shi, Chengzhong Xu	2025-03-27	下载	To efficiently support large-scale NNs, multi-level hardware, leveraging advanced integration and interconnection technologies, has emerged as a promising solution to counter the slowdown of Moore's l...
Asynchronous BFT Consensus Made Wireless	Shuo Liu, Minghui Xu, Tianyi Sun, Xiuzhen Cheng	2025-03-27	下载	Asynchronous Byzantine fault-tolerant (BFT) consensus protocols, known for their robustness in unpredictable environments without relying on timing assumptions, are becoming increasingly vital for wir...
PilotANN: Memory-Bounded GPU Acceleration for Vector Search	Yuntao Gui, Peiqi Yin, Xiao Yan, Chaorui Zhang, Weixi Zhang, James Cheng	2025-03-27	下载	Approximate Nearest Neighbor Search (ANNS) has become fundamental to modern deep learning applications, having gained particular prominence through its integration into recent generative models that w...
Optimizing Multi-DNN Inference on Mobile Devices through Heterogeneous Processor Co-Execution	Yunquan Gao, Zhiguo Zhang, Praveen Kumar Donta, Chinmaya Kumar Dehury, Xiujun Wang, Dusit Niyato, Qiyang Zhang	2025-03-27	下载	Deep Neural Networks (DNNs) are increasingly deployed across diverse industries, driving demand for mobile device support. However, existing mobile inference frameworks often rely on a single processo...
Cloud Resource Allocation with Convex Optimization	Shayan Boghani, Emin Kirimlioglu, Amrita Moturi, Hao-Ting Tso	2025-03-27	下载	We present a convex optimization framework for overcoming the limitations of Kubernetes Cluster Autoscaler by intelligently allocating diverse cloud resources while minimizing costs and fragmentation.
Solving AI Foundational Model Latency with Telco Infrastructure	Sebastian Barros	2025-03-27	下载	Latency remains a critical bottleneck for deploying foundational artificial intelligence (AI) models, such as large language models (LLMs), in customer-facing, real-time applications.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Reliability and Availability in Virtualized Networks: A Survey on Standards, Modeling Approaches, and Research Challenges	Mario Di Mauro, Walter Cerroni, Fabio Postiglione, Massimo Tornatore, Kishor S. Trivedi	2025-03-27	下载	The rise of Network Function Virtualization (NFV) has transformed network infrastructures by replacing fixed hardware with software-based Virtualized Network Functions (VNFs), enabling greater agility...
Enhancing Mobile Crowdsensing Efficiency: A Coverage-aware Resource Allocation Approach	Yaru Fu, Yue Zhang, Zheng Shi, Yongna Guo, Yalin Liu	2025-03-27	下载	In this study, we investigate the resource management challenges in next-generation mobile crowdsensing networks with the goal of minimizing task completion latency while ensuring coverage performance...
Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge Base	Satvik Verma, Qun Wang, E. Wes Bethel	2025-03-27	下载	The widespread adoption of Internet of Things (IoT) devices has introduced significant cybersecurity challenges, particularly with the increasing frequency and sophistication of Distributed Denial of ...
Static and Repeated Cooperative Games for the Optimization of the AoI in IoT Networks	David Emanuele Corrado Raphael Catania, Alessandro Buratto, Giovanni Perin	2025-03-27	下载	Wireless sensing and the internet of things (IoT) are nowadays pervasive in 5G and beyond networks, and they are expected to play a crucial role in 6G.
RIS-Measurements for Codebook Design	Paweł Hatka, Marcel Garczyk, Paweł Płaczkiewicz, Dawid Brząkała, Krzysztof Cichoń, Adrian Kliks	2025-03-27	下载	Reconfigurable Intelligent Surfaces (RIS) have gained significant attention for some time. Thanks to the possibility of individual steering of each reflecting element of the boards, they are envisaged...
Immersive Multimedia Communication: State-of-the-Art on eXtended Reality Streaming	Haopeng Wang, Haiwei Dong, Abdulmotaleb El Saddik	2025-03-27	下载	Extended reality (XR) is rapidly advancing, and poised to revolutionize content creation and consumption. In XR, users integrate various sensory inputs to form a cohesive perception of the virtual env...
A Deep Reinforcement Learning-based Approach for Adaptive Handover Protocols	Johannes Voigt, Peter Jiacheng Gu, Peter Rost	2025-03-27	下载	The use of higher frequencies in mobile communication systems leads to smaller cell sizes, resulting in the deployment of more base stations and an increase in handovers to support user mobility.
Optimizing Resource Allocation and Scheduling towards FRMCS and GSM-R networks coexistence in Railway Systems	Mohamed Aziz Aboud, Nawel Zangar, Rami Langar, Marion Berbineau, Jerome Madec	2025-03-27	下载	The actual railway communication system used in Europe for high-speed trains (HST) is called the GSM-R system, which is a communication system based on 2G infrastructure.
Declarative Traffic Engineering for Low-Latency and Reliable Networking	Jacopo Massa, Stefano Forti, Federica Paganelli, Patrizio Dazzi, Antonio Brogi, Alexander Clemm, Toerless Eckert	2025-03-27	下载	Cloud-Edge applications like industrial control systems and connected vehicles demand stringent end-to-end latency guarantees. Among existing data plane candidate solutions for bounded latency network...
DemoQuanDT: A Carrier-Grade QKD Network	P. Horoschenkoff, J. Henrich, R. Böhn, I. Khan, J. Rödiger, M. Gunkel, M. Bauch, J. Benda, P. Bläcker, E. Eichhammer, U. Eismann, G. Frenck, H. Griesser, W. Jontofsohn, N. Kopshoff, S. Röhrich, F. Seidl, N. Schark, E. Sollner, D. von Blanckenburg, A. Heinemann, M. Stiemerling, M. Gärtner	2025-03-27	下载	Quantum Key Distribution Networks (QKDN) enable secure communication even in the age of powerful quantum computers. In the hands of a network operator, which can offer its service to many users, the e...
Solving AI Foundational Model Latency with Telco Infrastructure	Sebastian Barros	2025-03-27	下载	Latency remains a critical bottleneck for deploying foundational artificial intelligence (AI) models, such as large language models (LLMs), in customer-facing, real-time applications.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection	Ryan Marinelli, Josef Pichlmeier, Tamas Bisztray	2025-03-27	下载	In this work, we propose a metric called Number of Thoughts (NofT) to determine the difficulty of tasks pre-prompting and support Large Language Models (LLMs) in production contexts.
Cloud Resource Allocation with Convex Optimization	Shayan Boghani, Emin Kirimlioglu, Amrita Moturi, Hao-Ting Tso	2025-03-27	下载	We present a convex optimization framework for overcoming the limitations of Kubernetes Cluster Autoscaler by intelligently allocating diverse cloud resources while minimizing costs and fragmentation.