Skip to content

2026-02-17

cs.AR - Architecture

标题作者发布日期PDF摘要
DARTH-PUM: A Hybrid Processing-Using-Memory ArchitectureRyan Wong, Ben Feinberg, Saugata Ghose2026-02-17下载Analog processing-using-memory (PUM; a.k.a. in-memory computing) makes use of electrical interactions inside memory arrays to perform bulk matrix-vector multiplication (MVM) operations.
Bit-Width-Aware Design Environment for Few-Shot Learning on Edge AI HardwareR. Kanda, H. L. Blevec, N. Onizawa, M. Leonardon, V. Gripon, T. Hanyu2026-02-17下载In this study, we propose an implementation methodology of real-time few-shot learning on tiny FPGA SoCs such as the PYNQ-Z1 board with arbitrary fixed-point bit-widths.
PhD Thesis Summary: Methods for Reliability Assessment and Enhancement of Deep Neural Network Hardware AcceleratorsMahdi Taheri2026-02-17下载This manuscript summarizes the work and showcases the impact of the doctoral thesis by introducing novel, cost-efficient methods for assessing and enhancing the reliability of DNN hardware accelerator...
DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNsParth Patne, Mahdi Taheri, Christian Herglotz, Maksim Jenihhin, Milos Krstic, Michael Hübner2026-02-17下载Early-exit deep neural networks enable adaptive inference by terminating computation when sufficient confidence is achieved, reducing cost for edge AI accelerators in resource-constrained settings.
Iterative LLM-Based Assertion Generation Using Syntax-Semantic Representations for Functional Coverage-Guided VerificationYonghao Wang, Jiaxin Zhou, Yang Yin, Hongqin Lyu, Zhiteng Chao, Wenchao Ding, Jing Ye, Tiancheng Wang, Huawei Li2026-02-17下载While leveraging LLMs to automatically generate SystemVerilog assertions (SVAs) from natural language specifications holds great potential, existing techniques face a key challenge: LLMs often lack su...
Human-AI Interaction: Evaluating LLM Reasoning on Digital Logic Circuit included Graph Problems, in terms of creativity in design and analysisYogeswar Reddy Thota, Setareh Rafatirad, Homayoun Houman, Tooraj Nikoubin2026-02-17下载Large Language Models (LLMs) are increasingly used by undergraduate students as on-demand tutors, yet their reliability on circuit- and diagram-based digital logic problems remains unclear.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Scrutinizing Variables for Checkpoint Using Automatic DifferentiationXin Huang, Weiping Zhang, Shiman Meng, Wubiao Xu, Xiang Fu, Luanzheng Guo, Kento Sato2026-02-17下载Checkpoint/Restart (C/R) saves the running state of the programs periodically, which consumes considerable system resources. We observe that not every piece of data is involved in the computation in t...
Distributed Order Recording Techniques for Efficient Record-and-Replay of Multi-threaded ProgramsXiang Fu, Shiman Meng, Weiping Zhang, Luanzheng Guo, Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz2026-02-17下载After all these years and all these other shared memory programming frameworks, OpenMP is still the most popular one. However, its greater levels of non-deterministic execution makes debugging and tes...
Service Orchestration in the Computing Continuum: Structural Challenges and VisionBoris Sedlak, Víctor Casamayor Pujol, Ildefons Magrans de Abril, Praveen Kumar Donta, Adel N. Toosi, Schahram Dustdar2026-02-17下载The Computing Continuum (CC) integrates different layers of processing infrastructure, from Edge to Cloud, to optimize service quality through ubiquitous and reliable computation.
Tight Communication Bounds for Distributed Algorithms in the Quantum Routing ModelFabien Dufoulon, Frédéric Magniez, Gopal Pandurangan2026-02-17下载We present new distributed quantum algorithms for fundamental distributed computing problems, namely, leader election, broadcast, Minimum Spanning Tree (MST), and Breadth-First Search (BFS) tree, in a...
On the Geometric Coherence of Global Aggregation in Federated Graph Neural NetworksChethana Prasad Kabgere, Shylaja SS2026-02-17下载Federated learning over graph-structured data exposes a fundamental mismatch between standard aggregation mechanisms and the operator nature of graph neural networks (GNNs).
PhD Thesis Summary: Methods for Reliability Assessment and Enhancement of Deep Neural Network Hardware AcceleratorsMahdi Taheri2026-02-17下载This manuscript summarizes the work and showcases the impact of the doctoral thesis by introducing novel, cost-efficient methods for assessing and enhancing the reliability of DNN hardware accelerator...
FlashMem: Supporting Modern DNN Workloads on Mobile with GPU Memory Hierarchy OptimizationsZhihao Shu, Md Musfiqur Rahman Sanim, Hangyu Zheng, Kunxiong Zhu, Miao Yin, Gagan Agrawal, Wei Niu2026-02-17下载The increasing size and complexity of modern deep neural networks (DNNs) pose significant challenges for on-device inference on mobile GPUs, with limited memory and computational resources.
ROIX-Comp: Optimizing X-ray Computed Tomography Imaging Strategy for Data Reduction and ReconstructionAmarjit Singh, Kento Sato, Kohei Yoshida, Kentaro Uesugi, Yasumasa Joti, Takaki Hatsui, Andrès Rubio Proaño2026-02-17下载In high-performance computing (HPC) environments, particularly in synchrotron radiation facilities, vast amounts of X-ray images are generated.
Co-Design and Evaluation of a CPU-Free MPI GPU Communication Abstraction and ImplementationPatrick G. Bridges, Derek Schafer, Jack Lange, James B. White, Anthony Skjellum, Evan Suggs, Thomas Hines, Purushotham Bangalore, Matthew G. F. Dosanjh, Whit Schonbein2026-02-17下载Removing the CPU from the communication fast path is essential to efficient GPU-based ML and HPC application performance. However, existing GPU communication APIs either continue to rely on the CPU fo...
SCENE OTA-FD: Self-Centering Noncoherent Estimator for Over-the-Air Federated DistillationHao Chen, Zavareh Bozorgasl2026-02-17下载We propose SCENE (Self-Centering Noncoherent Estimator), a pilot-free and phase-invariant aggregation primitive for over-the-air federated distillation (OTA-FD).
PiPNN: Ultra-Scalable Graph-Based Nearest Neighbor IndexingTobias Rubel, Richard Wen, Laxman Dhulipala, Lars Gottesbüren, Rajesh Jayaram, Jakub Łącki2026-02-17下载The fastest indexes for Approximate Nearest Neighbor Search today are also the slowest to build: graph-based methods like HNSW and Vamana achieve state-of-the-art query performance but have large cons...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Arterial Network Traffic State Prediction with Connected Vehicle Data: An Abnormality-Aware Spatiotemporal NetworkLei Han, Mohamed Abdel-Aty, Yang-Jun Joo2026-02-17下载Emerging connected-vehicle (CV) data shows great potential in urban traffic monitoring and forecasting. However, prior CV-based studies on arterial traffic measures prediction are limited to simulated...
DNN-Enabled Multi-User Beamforming for Throughput Maximization under Adjustable FairnessKaifeng Lu, Markus Rupp, Stefan Schwarz2026-02-17下载Ensuring user fairness in wireless communications is a fundamental challenge, as balancing the trade-off between fairness and sum rate leads to a non-convex, multi-objective optimization whose complex...
On the Geometric Coherence of Global Aggregation in Federated Graph Neural NetworksChethana Prasad Kabgere, Shylaja SS2026-02-17下载Federated learning over graph-structured data exposes a fundamental mismatch between standard aggregation mechanisms and the operator nature of graph neural networks (GNNs).
AI Sessions for Network-Exposed AI-as-a-ServiceMohaned Chraiti, Merve Saimler2026-02-17下载Cloud-based Artificial Intelligence (AI) inference is increasingly latency- and context-sensitive, yet today's AI-as-a-Service is typically consumed as an application-chosen endpoint, leaving the netw...
AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-ServiceMohaned Chraiti, Merve Saimler2026-02-17下载With AI-as-a-Service (AIaaS) now deployed across multiple providers and model tiers, selecting the appropriate model instance at run time is increasingly outside the end user's knowledge and operation...
High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain OrchestrationMohaned Chraiti, Ozgur Ercetin, Merve Saimler2026-02-17下载To support the emergence of AI-as-a-Service (AIaaS), communication service providers (CSPs) are on the verge of a radical transformation-from pure connectivity providers to AIaaS a managed network ser...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
The Compute ICE-AGE: Invariant Compute Envelope under Addressable Graph EvolutionRaymond Jay Martin2026-02-17下载This paper presents empirical results from a production-grade C++ implementation of a deterministic semantic state substrate derived from prior formal work on Bounded Local Generator Classes (Martin, ...

基于 VitePress 构建