Appearance
2025-03-27
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CIMPool: Scalable Neural Network Acceleration for Compute-In-Memory using Weight Pools | Shurui Li, Puneet Gupta | 2025-03-27 | 下载 | Compute-in-memory (CIM) based neural network accelerators offer a promising solution to the Von Neumann bottleneck by computing directly within memory arrays. |
| Performance Characterizations and Usage Guidelines of Samsung CXL Memory Module Hybrid Prototype | Jianping Zeng, Shuyi Pei, Da Zhang, Yuchen Zhou, Amir Beygi, Xuebin Yao, Ramdas Kachare, Tong Zhang, Zongwang Li, Marie Nguyen, Rekha Pitchumani, Yang Soek Ki, Changhee Jung | 2025-03-27 | 下载 | The growing prevalence of data-intensive workloads, such as artificial intelligence (AI), machine learning (ML), high-performance computing (HPC), in-memory databases, and real-time analytics, has exp... |
| A Bespoke Design Approach to Low-Power Printed Microprocessors for Machine Learning Applications | Panagiotis Chaidos, Giorgos Armeniakos, Sotirios Xydis, Dimitrios Soudris | 2025-03-27 | 下载 | Printed electronics have gained significant traction in recent years, presenting a viable path to integrating computing into everyday items, from disposable products to low-cost healthcare. |
| A 71.2-μW Speech Recognition Accelerator with Recurrent Spiking Neural Network | Chih-Chyau Yang, Tian-Sheuan Chang | 2025-03-27 | 下载 | This paper introduces a 71.2-μW speech recognition accelerator designed for edge devices' real-time applications, emphasizing an ultra low power design. |
| A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices | Ci-Hao Wu, Tian-Sheuan Chang | 2025-03-27 | 下载 | Transformer-based speech enhancement models yield impressive results. However, their heterogeneous and complex structure restricts model compression potential, resulting in greater complexity and redu... |
| MLDSE: Scaling Design Space Exploration Infrastructure for Multi-Level Hardware | Huanyu Qu, Weihao Zhang, Junfeng Lin, Songchen Ma, Hongyi Li, Luping Shi, Chengzhong Xu | 2025-03-27 | 下载 | To efficiently support large-scale NNs, multi-level hardware, leveraging advanced integration and interconnection technologies, has emerged as a promising solution to counter the slowdown of Moore's l... |
| Extending Silicon Lifetime: A Review of Design Techniques for Reliable Integrated Circuits | Shaik Jani Babu, Fan Hu, Linyu Zhu, Sonal Singhal, Xinfei Guo | 2025-03-27 | 下载 | Reliability has become an increasing concern in modern computing. Integrated circuits (ICs) are the backbone of modern computing devices across industries, including artificial intelligence (AI), cons... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Lobster: A GPU-Accelerated Framework for Neurosymbolic Programming | Paul Biberstein, Ziyang Li, Joseph Devietti, Mayur Naik | 2025-03-27 | 下载 | Neurosymbolic programs combine deep learning with symbolic reasoning to achieve better data efficiency, interpretability, and generalizability compared to standalone deep learning approaches. |
| Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time | Zhaojun Nan, Yunchu Han, Sheng Zhou, Zhisheng Niu | 2025-03-27 | 下载 | In edge intelligence systems, deep neural network (DNN) partitioning and data offloading can provide real-time task inference for resource-constrained mobile devices. |
| OCEP: An Ontology-Based Complex Event Processing Framework for Healthcare Decision Support in Big Data Analytics | Ritesh Chandra, Sonali Agarwal, Shashi Shekhar Kumar, Navjot Singh | 2025-03-27 | 下载 | The exponential expansion of real-time data streams across multiple domains needs the development of effective event detection, correlation, and decision-making systems. |
| Ocularone-Bench: Benchmarking DNN Models on GPUs to Assist the Visually Impaired | Suman Raj, Bhavani A Madhabhavi, Kautuk Astu, Arnav A Rajesh, Pratham M, Yogesh Simmhan | 2025-03-27 | 下载 | VIP navigation requires multiple DNN models for identification, posture analysis, and depth estimation to ensure safe mobility. Using a hazard vest as a unique identifier enhances visibility while sel... |
| MLDSE: Scaling Design Space Exploration Infrastructure for Multi-Level Hardware | Huanyu Qu, Weihao Zhang, Junfeng Lin, Songchen Ma, Hongyi Li, Luping Shi, Chengzhong Xu | 2025-03-27 | 下载 | To efficiently support large-scale NNs, multi-level hardware, leveraging advanced integration and interconnection technologies, has emerged as a promising solution to counter the slowdown of Moore's l... |
| Asynchronous BFT Consensus Made Wireless | Shuo Liu, Minghui Xu, Tianyi Sun, Xiuzhen Cheng | 2025-03-27 | 下载 | Asynchronous Byzantine fault-tolerant (BFT) consensus protocols, known for their robustness in unpredictable environments without relying on timing assumptions, are becoming increasingly vital for wir... |
| PilotANN: Memory-Bounded GPU Acceleration for Vector Search | Yuntao Gui, Peiqi Yin, Xiao Yan, Chaorui Zhang, Weixi Zhang, James Cheng | 2025-03-27 | 下载 | Approximate Nearest Neighbor Search (ANNS) has become fundamental to modern deep learning applications, having gained particular prominence through its integration into recent generative models that w... |
| Optimizing Multi-DNN Inference on Mobile Devices through Heterogeneous Processor Co-Execution | Yunquan Gao, Zhiguo Zhang, Praveen Kumar Donta, Chinmaya Kumar Dehury, Xiujun Wang, Dusit Niyato, Qiyang Zhang | 2025-03-27 | 下载 | Deep Neural Networks (DNNs) are increasingly deployed across diverse industries, driving demand for mobile device support. However, existing mobile inference frameworks often rely on a single processo... |
| Cloud Resource Allocation with Convex Optimization | Shayan Boghani, Emin Kirimlioglu, Amrita Moturi, Hao-Ting Tso | 2025-03-27 | 下载 | We present a convex optimization framework for overcoming the limitations of Kubernetes Cluster Autoscaler by intelligently allocating diverse cloud resources while minimizing costs and fragmentation. |
| Solving AI Foundational Model Latency with Telco Infrastructure | Sebastian Barros | 2025-03-27 | 下载 | Latency remains a critical bottleneck for deploying foundational artificial intelligence (AI) models, such as large language models (LLMs), in customer-facing, real-time applications. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Reliability and Availability in Virtualized Networks: A Survey on Standards, Modeling Approaches, and Research Challenges | Mario Di Mauro, Walter Cerroni, Fabio Postiglione, Massimo Tornatore, Kishor S. Trivedi | 2025-03-27 | 下载 | The rise of Network Function Virtualization (NFV) has transformed network infrastructures by replacing fixed hardware with software-based Virtualized Network Functions (VNFs), enabling greater agility... |
| Enhancing Mobile Crowdsensing Efficiency: A Coverage-aware Resource Allocation Approach | Yaru Fu, Yue Zhang, Zheng Shi, Yongna Guo, Yalin Liu | 2025-03-27 | 下载 | In this study, we investigate the resource management challenges in next-generation mobile crowdsensing networks with the goal of minimizing task completion latency while ensuring coverage performance... |
| Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge Base | Satvik Verma, Qun Wang, E. Wes Bethel | 2025-03-27 | 下载 | The widespread adoption of Internet of Things (IoT) devices has introduced significant cybersecurity challenges, particularly with the increasing frequency and sophistication of Distributed Denial of ... |
| Static and Repeated Cooperative Games for the Optimization of the AoI in IoT Networks | David Emanuele Corrado Raphael Catania, Alessandro Buratto, Giovanni Perin | 2025-03-27 | 下载 | Wireless sensing and the internet of things (IoT) are nowadays pervasive in 5G and beyond networks, and they are expected to play a crucial role in 6G. |
| RIS-Measurements for Codebook Design | Paweł Hatka, Marcel Garczyk, Paweł Płaczkiewicz, Dawid Brząkała, Krzysztof Cichoń, Adrian Kliks | 2025-03-27 | 下载 | Reconfigurable Intelligent Surfaces (RIS) have gained significant attention for some time. Thanks to the possibility of individual steering of each reflecting element of the boards, they are envisaged... |
| Immersive Multimedia Communication: State-of-the-Art on eXtended Reality Streaming | Haopeng Wang, Haiwei Dong, Abdulmotaleb El Saddik | 2025-03-27 | 下载 | Extended reality (XR) is rapidly advancing, and poised to revolutionize content creation and consumption. In XR, users integrate various sensory inputs to form a cohesive perception of the virtual env... |
| A Deep Reinforcement Learning-based Approach for Adaptive Handover Protocols | Johannes Voigt, Peter Jiacheng Gu, Peter Rost | 2025-03-27 | 下载 | The use of higher frequencies in mobile communication systems leads to smaller cell sizes, resulting in the deployment of more base stations and an increase in handovers to support user mobility. |
| Optimizing Resource Allocation and Scheduling towards FRMCS and GSM-R networks coexistence in Railway Systems | Mohamed Aziz Aboud, Nawel Zangar, Rami Langar, Marion Berbineau, Jerome Madec | 2025-03-27 | 下载 | The actual railway communication system used in Europe for high-speed trains (HST) is called the GSM-R system, which is a communication system based on 2G infrastructure. |
| Declarative Traffic Engineering for Low-Latency and Reliable Networking | Jacopo Massa, Stefano Forti, Federica Paganelli, Patrizio Dazzi, Antonio Brogi, Alexander Clemm, Toerless Eckert | 2025-03-27 | 下载 | Cloud-Edge applications like industrial control systems and connected vehicles demand stringent end-to-end latency guarantees. Among existing data plane candidate solutions for bounded latency network... |
| DemoQuanDT: A Carrier-Grade QKD Network | P. Horoschenkoff, J. Henrich, R. Böhn, I. Khan, J. Rödiger, M. Gunkel, M. Bauch, J. Benda, P. Bläcker, E. Eichhammer, U. Eismann, G. Frenck, H. Griesser, W. Jontofsohn, N. Kopshoff, S. Röhrich, F. Seidl, N. Schark, E. Sollner, D. von Blanckenburg, A. Heinemann, M. Stiemerling, M. Gärtner | 2025-03-27 | 下载 | Quantum Key Distribution Networks (QKDN) enable secure communication even in the age of powerful quantum computers. In the hands of a network operator, which can offer its service to many users, the e... |
| Solving AI Foundational Model Latency with Telco Infrastructure | Sebastian Barros | 2025-03-27 | 下载 | Latency remains a critical bottleneck for deploying foundational artificial intelligence (AI) models, such as large language models (LLMs), in customer-facing, real-time applications. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection | Ryan Marinelli, Josef Pichlmeier, Tamas Bisztray | 2025-03-27 | 下载 | In this work, we propose a metric called Number of Thoughts (NofT) to determine the difficulty of tasks pre-prompting and support Large Language Models (LLMs) in production contexts. |
| Cloud Resource Allocation with Convex Optimization | Shayan Boghani, Emin Kirimlioglu, Amrita Moturi, Hao-Ting Tso | 2025-03-27 | 下载 | We present a convex optimization framework for overcoming the limitations of Kubernetes Cluster Autoscaler by intelligently allocating diverse cloud resources while minimizing costs and fragmentation. |