2026-02-17

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
DARTH-PUM: A Hybrid Processing-Using-Memory Architecture	Ryan Wong, Ben Feinberg, Saugata Ghose	2026-02-17	下载	Analog processing-using-memory (PUM; a.k.a. in-memory computing) makes use of electrical interactions inside memory arrays to perform bulk matrix-vector multiplication (MVM) operations.
Bit-Width-Aware Design Environment for Few-Shot Learning on Edge AI Hardware	R. Kanda, H. L. Blevec, N. Onizawa, M. Leonardon, V. Gripon, T. Hanyu	2026-02-17	下载	In this study, we propose an implementation methodology of real-time few-shot learning on tiny FPGA SoCs such as the PYNQ-Z1 board with arbitrary fixed-point bit-widths.
PhD Thesis Summary: Methods for Reliability Assessment and Enhancement of Deep Neural Network Hardware Accelerators	Mahdi Taheri	2026-02-17	下载	This manuscript summarizes the work and showcases the impact of the doctoral thesis by introducing novel, cost-efficient methods for assessing and enhancing the reliability of DNN hardware accelerator...
DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs	Parth Patne, Mahdi Taheri, Christian Herglotz, Maksim Jenihhin, Milos Krstic, Michael Hübner	2026-02-17	下载	Early-exit deep neural networks enable adaptive inference by terminating computation when sufficient confidence is achieved, reducing cost for edge AI accelerators in resource-constrained settings.
Iterative LLM-Based Assertion Generation Using Syntax-Semantic Representations for Functional Coverage-Guided Verification	Yonghao Wang, Jiaxin Zhou, Yang Yin, Hongqin Lyu, Zhiteng Chao, Wenchao Ding, Jing Ye, Tiancheng Wang, Huawei Li	2026-02-17	下载	While leveraging LLMs to automatically generate SystemVerilog assertions (SVAs) from natural language specifications holds great potential, existing techniques face a key challenge: LLMs often lack su...
Human-AI Interaction: Evaluating LLM Reasoning on Digital Logic Circuit included Graph Problems, in terms of creativity in design and analysis	Yogeswar Reddy Thota, Setareh Rafatirad, Homayoun Houman, Tooraj Nikoubin	2026-02-17	下载	Large Language Models (LLMs) are increasingly used by undergraduate students as on-demand tutors, yet their reliability on circuit- and diagram-based digital logic problems remains unclear.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Scrutinizing Variables for Checkpoint Using Automatic Differentiation	Xin Huang, Weiping Zhang, Shiman Meng, Wubiao Xu, Xiang Fu, Luanzheng Guo, Kento Sato	2026-02-17	下载	Checkpoint/Restart (C/R) saves the running state of the programs periodically, which consumes considerable system resources. We observe that not every piece of data is involved in the computation in t...
Distributed Order Recording Techniques for Efficient Record-and-Replay of Multi-threaded Programs	Xiang Fu, Shiman Meng, Weiping Zhang, Luanzheng Guo, Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz	2026-02-17	下载	After all these years and all these other shared memory programming frameworks, OpenMP is still the most popular one. However, its greater levels of non-deterministic execution makes debugging and tes...
Service Orchestration in the Computing Continuum: Structural Challenges and Vision	Boris Sedlak, Víctor Casamayor Pujol, Ildefons Magrans de Abril, Praveen Kumar Donta, Adel N. Toosi, Schahram Dustdar	2026-02-17	下载	The Computing Continuum (CC) integrates different layers of processing infrastructure, from Edge to Cloud, to optimize service quality through ubiquitous and reliable computation.
Tight Communication Bounds for Distributed Algorithms in the Quantum Routing Model	Fabien Dufoulon, Frédéric Magniez, Gopal Pandurangan	2026-02-17	下载	We present new distributed quantum algorithms for fundamental distributed computing problems, namely, leader election, broadcast, Minimum Spanning Tree (MST), and Breadth-First Search (BFS) tree, in a...
On the Geometric Coherence of Global Aggregation in Federated Graph Neural Networks	Chethana Prasad Kabgere, Shylaja SS	2026-02-17	下载	Federated learning over graph-structured data exposes a fundamental mismatch between standard aggregation mechanisms and the operator nature of graph neural networks (GNNs).
PhD Thesis Summary: Methods for Reliability Assessment and Enhancement of Deep Neural Network Hardware Accelerators	Mahdi Taheri	2026-02-17	下载	This manuscript summarizes the work and showcases the impact of the doctoral thesis by introducing novel, cost-efficient methods for assessing and enhancing the reliability of DNN hardware accelerator...
FlashMem: Supporting Modern DNN Workloads on Mobile with GPU Memory Hierarchy Optimizations	Zhihao Shu, Md Musfiqur Rahman Sanim, Hangyu Zheng, Kunxiong Zhu, Miao Yin, Gagan Agrawal, Wei Niu	2026-02-17	下载	The increasing size and complexity of modern deep neural networks (DNNs) pose significant challenges for on-device inference on mobile GPUs, with limited memory and computational resources.
ROIX-Comp: Optimizing X-ray Computed Tomography Imaging Strategy for Data Reduction and Reconstruction	Amarjit Singh, Kento Sato, Kohei Yoshida, Kentaro Uesugi, Yasumasa Joti, Takaki Hatsui, Andrès Rubio Proaño	2026-02-17	下载	In high-performance computing (HPC) environments, particularly in synchrotron radiation facilities, vast amounts of X-ray images are generated.
Co-Design and Evaluation of a CPU-Free MPI GPU Communication Abstraction and Implementation	Patrick G. Bridges, Derek Schafer, Jack Lange, James B. White, Anthony Skjellum, Evan Suggs, Thomas Hines, Purushotham Bangalore, Matthew G. F. Dosanjh, Whit Schonbein	2026-02-17	下载	Removing the CPU from the communication fast path is essential to efficient GPU-based ML and HPC application performance. However, existing GPU communication APIs either continue to rely on the CPU fo...
SCENE OTA-FD: Self-Centering Noncoherent Estimator for Over-the-Air Federated Distillation	Hao Chen, Zavareh Bozorgasl	2026-02-17	下载	We propose SCENE (Self-Centering Noncoherent Estimator), a pilot-free and phase-invariant aggregation primitive for over-the-air federated distillation (OTA-FD).
PiPNN: Ultra-Scalable Graph-Based Nearest Neighbor Indexing	Tobias Rubel, Richard Wen, Laxman Dhulipala, Lars Gottesbüren, Rajesh Jayaram, Jakub Łącki	2026-02-17	下载	The fastest indexes for Approximate Nearest Neighbor Search today are also the slowest to build: graph-based methods like HNSW and Vamana achieve state-of-the-art query performance but have large cons...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Arterial Network Traffic State Prediction with Connected Vehicle Data: An Abnormality-Aware Spatiotemporal Network	Lei Han, Mohamed Abdel-Aty, Yang-Jun Joo	2026-02-17	下载	Emerging connected-vehicle (CV) data shows great potential in urban traffic monitoring and forecasting. However, prior CV-based studies on arterial traffic measures prediction are limited to simulated...
DNN-Enabled Multi-User Beamforming for Throughput Maximization under Adjustable Fairness	Kaifeng Lu, Markus Rupp, Stefan Schwarz	2026-02-17	下载	Ensuring user fairness in wireless communications is a fundamental challenge, as balancing the trade-off between fairness and sum rate leads to a non-convex, multi-objective optimization whose complex...
On the Geometric Coherence of Global Aggregation in Federated Graph Neural Networks	Chethana Prasad Kabgere, Shylaja SS	2026-02-17	下载	Federated learning over graph-structured data exposes a fundamental mismatch between standard aggregation mechanisms and the operator nature of graph neural networks (GNNs).
AI Sessions for Network-Exposed AI-as-a-Service	Mohaned Chraiti, Merve Saimler	2026-02-17	下载	Cloud-based Artificial Intelligence (AI) inference is increasingly latency- and context-sensitive, yet today's AI-as-a-Service is typically consumed as an application-chosen endpoint, leaving the netw...
AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service	Mohaned Chraiti, Merve Saimler	2026-02-17	下载	With AI-as-a-Service (AIaaS) now deployed across multiple providers and model tiers, selecting the appropriate model instance at run time is increasingly outside the end user's knowledge and operation...
High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration	Mohaned Chraiti, Ozgur Ercetin, Merve Saimler	2026-02-17	下载	To support the emergence of AI-as-a-Service (AIaaS), communication service providers (CSPs) are on the verge of a radical transformation-from pure connectivity providers to AIaaS a managed network ser...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
The Compute ICE-AGE: Invariant Compute Envelope under Addressable Graph Evolution	Raymond Jay Martin	2026-02-17	下载	This paper presents empirical results from a production-grade C++ implementation of a deterministic semantic state substrate derived from prior formal work on Bounded Local Generator Classes (Martin, ...