Skip to content

2025-03-27

cs.AR - Architecture

标题作者发布日期PDF摘要
CIMPool: Scalable Neural Network Acceleration for Compute-In-Memory using Weight PoolsShurui Li, Puneet Gupta2025-03-27下载Compute-in-memory (CIM) based neural network accelerators offer a promising solution to the Von Neumann bottleneck by computing directly within memory arrays.
Performance Characterizations and Usage Guidelines of Samsung CXL Memory Module Hybrid PrototypeJianping Zeng, Shuyi Pei, Da Zhang, Yuchen Zhou, Amir Beygi, Xuebin Yao, Ramdas Kachare, Tong Zhang, Zongwang Li, Marie Nguyen, Rekha Pitchumani, Yang Soek Ki, Changhee Jung2025-03-27下载The growing prevalence of data-intensive workloads, such as artificial intelligence (AI), machine learning (ML), high-performance computing (HPC), in-memory databases, and real-time analytics, has exp...
A Bespoke Design Approach to Low-Power Printed Microprocessors for Machine Learning ApplicationsPanagiotis Chaidos, Giorgos Armeniakos, Sotirios Xydis, Dimitrios Soudris2025-03-27下载Printed electronics have gained significant traction in recent years, presenting a viable path to integrating computing into everyday items, from disposable products to low-cost healthcare.
A 71.2-μW Speech Recognition Accelerator with Recurrent Spiking Neural NetworkChih-Chyau Yang, Tian-Sheuan Chang2025-03-27下载This paper introduces a 71.2-μW speech recognition accelerator designed for edge devices' real-time applications, emphasizing an ultra low power design.
A Low-Power Streaming Speech Enhancement Accelerator For Edge DevicesCi-Hao Wu, Tian-Sheuan Chang2025-03-27下载Transformer-based speech enhancement models yield impressive results. However, their heterogeneous and complex structure restricts model compression potential, resulting in greater complexity and redu...
MLDSE: Scaling Design Space Exploration Infrastructure for Multi-Level HardwareHuanyu Qu, Weihao Zhang, Junfeng Lin, Songchen Ma, Hongyi Li, Luping Shi, Chengzhong Xu2025-03-27下载To efficiently support large-scale NNs, multi-level hardware, leveraging advanced integration and interconnection technologies, has emerged as a promising solution to counter the slowdown of Moore's l...
Extending Silicon Lifetime: A Review of Design Techniques for Reliable Integrated CircuitsShaik Jani Babu, Fan Hu, Linyu Zhu, Sonal Singhal, Xinfei Guo2025-03-27下载Reliability has become an increasing concern in modern computing. Integrated circuits (ICs) are the backbone of modern computing devices across industries, including artificial intelligence (AI), cons...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Lobster: A GPU-Accelerated Framework for Neurosymbolic ProgrammingPaul Biberstein, Ziyang Li, Joseph Devietti, Mayur Naik2025-03-27下载Neurosymbolic programs combine deep learning with symbolic reasoning to achieve better data efficiency, interpretability, and generalizability compared to standalone deep learning approaches.
Robust DNN Partitioning and Resource Allocation Under Uncertain Inference TimeZhaojun Nan, Yunchu Han, Sheng Zhou, Zhisheng Niu2025-03-27下载In edge intelligence systems, deep neural network (DNN) partitioning and data offloading can provide real-time task inference for resource-constrained mobile devices.
OCEP: An Ontology-Based Complex Event Processing Framework for Healthcare Decision Support in Big Data AnalyticsRitesh Chandra, Sonali Agarwal, Shashi Shekhar Kumar, Navjot Singh2025-03-27下载The exponential expansion of real-time data streams across multiple domains needs the development of effective event detection, correlation, and decision-making systems.
Ocularone-Bench: Benchmarking DNN Models on GPUs to Assist the Visually ImpairedSuman Raj, Bhavani A Madhabhavi, Kautuk Astu, Arnav A Rajesh, Pratham M, Yogesh Simmhan2025-03-27下载VIP navigation requires multiple DNN models for identification, posture analysis, and depth estimation to ensure safe mobility. Using a hazard vest as a unique identifier enhances visibility while sel...
MLDSE: Scaling Design Space Exploration Infrastructure for Multi-Level HardwareHuanyu Qu, Weihao Zhang, Junfeng Lin, Songchen Ma, Hongyi Li, Luping Shi, Chengzhong Xu2025-03-27下载To efficiently support large-scale NNs, multi-level hardware, leveraging advanced integration and interconnection technologies, has emerged as a promising solution to counter the slowdown of Moore's l...
Asynchronous BFT Consensus Made WirelessShuo Liu, Minghui Xu, Tianyi Sun, Xiuzhen Cheng2025-03-27下载Asynchronous Byzantine fault-tolerant (BFT) consensus protocols, known for their robustness in unpredictable environments without relying on timing assumptions, are becoming increasingly vital for wir...
PilotANN: Memory-Bounded GPU Acceleration for Vector SearchYuntao Gui, Peiqi Yin, Xiao Yan, Chaorui Zhang, Weixi Zhang, James Cheng2025-03-27下载Approximate Nearest Neighbor Search (ANNS) has become fundamental to modern deep learning applications, having gained particular prominence through its integration into recent generative models that w...
Optimizing Multi-DNN Inference on Mobile Devices through Heterogeneous Processor Co-ExecutionYunquan Gao, Zhiguo Zhang, Praveen Kumar Donta, Chinmaya Kumar Dehury, Xiujun Wang, Dusit Niyato, Qiyang Zhang2025-03-27下载Deep Neural Networks (DNNs) are increasingly deployed across diverse industries, driving demand for mobile device support. However, existing mobile inference frameworks often rely on a single processo...
Cloud Resource Allocation with Convex OptimizationShayan Boghani, Emin Kirimlioglu, Amrita Moturi, Hao-Ting Tso2025-03-27下载We present a convex optimization framework for overcoming the limitations of Kubernetes Cluster Autoscaler by intelligently allocating diverse cloud resources while minimizing costs and fragmentation.
Solving AI Foundational Model Latency with Telco InfrastructureSebastian Barros2025-03-27下载Latency remains a critical bottleneck for deploying foundational artificial intelligence (AI) models, such as large language models (LLMs), in customer-facing, real-time applications.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Reliability and Availability in Virtualized Networks: A Survey on Standards, Modeling Approaches, and Research ChallengesMario Di Mauro, Walter Cerroni, Fabio Postiglione, Massimo Tornatore, Kishor S. Trivedi2025-03-27下载The rise of Network Function Virtualization (NFV) has transformed network infrastructures by replacing fixed hardware with software-based Virtualized Network Functions (VNFs), enabling greater agility...
Enhancing Mobile Crowdsensing Efficiency: A Coverage-aware Resource Allocation ApproachYaru Fu, Yue Zhang, Zheng Shi, Yongna Guo, Yalin Liu2025-03-27下载In this study, we investigate the resource management challenges in next-generation mobile crowdsensing networks with the goal of minimizing task completion latency while ensuring coverage performance...
Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge BaseSatvik Verma, Qun Wang, E. Wes Bethel2025-03-27下载The widespread adoption of Internet of Things (IoT) devices has introduced significant cybersecurity challenges, particularly with the increasing frequency and sophistication of Distributed Denial of ...
Static and Repeated Cooperative Games for the Optimization of the AoI in IoT NetworksDavid Emanuele Corrado Raphael Catania, Alessandro Buratto, Giovanni Perin2025-03-27下载Wireless sensing and the internet of things (IoT) are nowadays pervasive in 5G and beyond networks, and they are expected to play a crucial role in 6G.
RIS-Measurements for Codebook DesignPaweł Hatka, Marcel Garczyk, Paweł Płaczkiewicz, Dawid Brząkała, Krzysztof Cichoń, Adrian Kliks2025-03-27下载Reconfigurable Intelligent Surfaces (RIS) have gained significant attention for some time. Thanks to the possibility of individual steering of each reflecting element of the boards, they are envisaged...
Immersive Multimedia Communication: State-of-the-Art on eXtended Reality StreamingHaopeng Wang, Haiwei Dong, Abdulmotaleb El Saddik2025-03-27下载Extended reality (XR) is rapidly advancing, and poised to revolutionize content creation and consumption. In XR, users integrate various sensory inputs to form a cohesive perception of the virtual env...
A Deep Reinforcement Learning-based Approach for Adaptive Handover ProtocolsJohannes Voigt, Peter Jiacheng Gu, Peter Rost2025-03-27下载The use of higher frequencies in mobile communication systems leads to smaller cell sizes, resulting in the deployment of more base stations and an increase in handovers to support user mobility.
Optimizing Resource Allocation and Scheduling towards FRMCS and GSM-R networks coexistence in Railway SystemsMohamed Aziz Aboud, Nawel Zangar, Rami Langar, Marion Berbineau, Jerome Madec2025-03-27下载The actual railway communication system used in Europe for high-speed trains (HST) is called the GSM-R system, which is a communication system based on 2G infrastructure.
Declarative Traffic Engineering for Low-Latency and Reliable NetworkingJacopo Massa, Stefano Forti, Federica Paganelli, Patrizio Dazzi, Antonio Brogi, Alexander Clemm, Toerless Eckert2025-03-27下载Cloud-Edge applications like industrial control systems and connected vehicles demand stringent end-to-end latency guarantees. Among existing data plane candidate solutions for bounded latency network...
DemoQuanDT: A Carrier-Grade QKD NetworkP. Horoschenkoff, J. Henrich, R. Böhn, I. Khan, J. Rödiger, M. Gunkel, M. Bauch, J. Benda, P. Bläcker, E. Eichhammer, U. Eismann, G. Frenck, H. Griesser, W. Jontofsohn, N. Kopshoff, S. Röhrich, F. Seidl, N. Schark, E. Sollner, D. von Blanckenburg, A. Heinemann, M. Stiemerling, M. Gärtner2025-03-27下载Quantum Key Distribution Networks (QKDN) enable secure communication even in the age of powerful quantum computers. In the hands of a network operator, which can offer its service to many users, the e...
Solving AI Foundational Model Latency with Telco InfrastructureSebastian Barros2025-03-27下载Latency remains a critical bottleneck for deploying foundational artificial intelligence (AI) models, such as large language models (LLMs), in customer-facing, real-time applications.

cs.PF - Performance

标题作者发布日期PDF摘要
Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt DetectionRyan Marinelli, Josef Pichlmeier, Tamas Bisztray2025-03-27下载In this work, we propose a metric called Number of Thoughts (NofT) to determine the difficulty of tasks pre-prompting and support Large Language Models (LLMs) in production contexts.
Cloud Resource Allocation with Convex OptimizationShayan Boghani, Emin Kirimlioglu, Amrita Moturi, Hao-Ting Tso2025-03-27下载We present a convex optimization framework for overcoming the limitations of Kubernetes Cluster Autoscaler by intelligently allocating diverse cloud resources while minimizing costs and fragmentation.

基于 VitePress 构建