Skip to content

2026-02-16

cs.AR - Architecture

标题作者发布日期PDF摘要
The Turbo-Charged Mapper: Fast and Optimal Mapping for Accelerator Modeling and EvaluationMichael Gilbert, Tanner Andrulis, Vivienne Sze, Joel S. Emer2026-02-16下载The energy and latency of an accelerator running a deep neural network (DNN) depend on how the computation and data movement are scheduled in the accelerator (i.e., mapping).
Fast and Fusiest: An Optimal Fusion-Aware Mapper for Accelerator Modeling and EvaluationTanner Andrulis, Michael Gilbert, Vivienne Sze, Joel S. Emer2026-02-16下载The latency and energy of tensor algebra accelerators depend on how data movement and operations are scheduled (i.e., mapped) onto accelerators, so determining the potential of an accelerator architec...
Qute: Towards Quantum-Native DatabaseMuzhi Chen, Xuanhe Zhou, Wei Zhou, Bangrui Xu, Surui Tang, Guoliang Li, Bingsheng He, Yeye He, Yitong Song, Fan Wu2026-02-16下载This paper envisions a quantum database (Qute) that treats quantum computation as a first-class execution option. Unlike prior simulation-based methods that either run quantum algorithms on classical ...
RNM-TD3: N:M Semi-structured Sparse Reinforcement Learning From ScratchIsam Vrce, Andreas Kassler, Gökçe Aydos2026-02-16下载Sparsity is a well-studied technique for compressing deep neural networks (DNNs) without compromising performance. In deep reinforcement learning (DRL), neural networks with up to 5% of their original...
Scope: A Scalable Merged Pipeline Framework for Multi-Chip-Module NN AcceleratorsZongle Huang, Hongyang Jia, Kaiwei Zou, Yongpan Liu2026-02-16下载Neural network (NN) accelerators with multi-chip-module (MCM) architectures enable integration of massive computation capability; however, they face challenges of computing resource underutilization a...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Distributed Semi-Speculative Parallel Anisotropic Mesh AdaptationKevin Garner, Polykarpos Thomadakis, Nikos Chrisochoides2026-02-16下载This paper presents a distributed memory method for anisotropic mesh adaptation that is designed to avoid the use of collective communication and global synchronization techniques.
Atomix: Timely, Transactional Tool Use for Reliable Agentic WorkflowsBardia Mohammadi, Nearchos Potamitis, Lars Klein, Akhil Arora, Laurent Bindschaedler2026-02-16下载LLM agents increasingly act on external systems, yet tool effects are immediate. Under failures, speculation, or contention, losing branches can leak unintended side effects with no safe rollback.
Evaluation of Dynamic Vector Bin Packing for Virtual Machine PlacementZong Yu Lee, Xueyan Tang2026-02-16下载Virtual machine placement is a crucial challenge in cloud computing for efficiently utilizing physical machine resources in data centers. Virtual machine placement can be formulated as a MinUsageTime ...
Efficient Multi-round LLM Inference over Disaggregated ServingWenhao He, Youhe Jiang, Penghao Zhao, Quanqing Xu, Eiko Yoneki, Bin Cui, Fangcheng Fu2026-02-16下载With the rapid evolution of Large Language Models (LLMs), multi-round workflows, such as autonomous agents and iterative retrieval, have become increasingly prevalent.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Scan-Based Analysis of Internet-Exposed IoT Devices Using Shodan DataRichelle Williams, Fernando Koch2026-02-16下载An open measurement problem in IoT security is whether scan-observable network configurations encode population-level exposure risk beyond individual devices.
Data-Driven Optimization of Multi-Generational Cellular Networks: A Performance Classification Framework for Strategic Infrastructure ManagementMaryam Sabahat, M. Umar Khan2026-02-16下载The exponential growth in mobile data demand necessitates intelligent management of telecommunications infrastructure to ensure Quality of Service (QoS) and operational efficiency.
Exploring Performance Tradeoffs in Age-Aware Remote Monitoring with SatellitesSunjung Kang, Vishrant Tripathi, Christopher G. Brinton2026-02-16下载We investigate a remote monitoring framework with multiple sensing modalities including IoT sensors on the ground, mobile UAVs in the air, and a periodically available satellite constellation.
Instruction-Set Architecture for Programmable NV-Center Quantum Repeater NodesVinay Kumar, Claudio Cicconetti, Riccardo Bassoli, Marco Conti, Andrea Passarella2026-02-16下载Programmability is increasingly central in emerging quantum network software stacks, yet the node-internal controller-to-hardware interface for quantum repeater devices remains under-specified.
When Scaling Fails: Network and Fabric Effects on Distributed GPU Training PerformanceDinesh Gopalan, Ratul Ali2026-02-16下载Scaling distributed GPU training is commonly assumed to yield predictable performance gains as additional nodes are added. In practice, many large-scale deployments encounter diminishing returns and u...
ASA: Adaptive Smart Agent Federated Learning via Device-Aware Clustering for Heterogeneous IoTAli Salimi, Saadat Izadi, Mahmood Ahmadi, Hadi Tabatabaee Malazi2026-02-16下载Federated learning (FL) has become a promising answer to facilitating privacy-preserving collaborative learning in distributed IoT devices. However, device heterogeneity is a key challenge because IoT...
A Q-Learning Approach for Dynamic Resource Management in Three-Tier Vehicular Fog ComputingBahar Mojtabaei Ranani, Mahmood Ahmadi, Sajad Ahmadian2026-02-16下载In this paper, a method for predicting the resources required for an intelligent vehicle client using a three-layer vehicular computing architecture is proposed.
Bitcoin Under Stress: Measuring Infrastructure Resilience 2014-2025Wenbin Wu, Alexander Neumueller2026-02-16下载Bitcoin's design promises resilience through decentralization, yet the physical infrastructure supporting the network creates hidden dependencies.
LiSFC-Search: Lifelong Search for Network SFC Optimization under Non-stationary DriftsZuyuan Zhang, Vaneet Aggarwal, Tian Lan2026-02-16下载Edge-cloud convergence is reshaping service provisioning across 5G/6G and computing power networks (CPNs). Service function chaining (SFC) requires continuously placing and scheduling virtual network ...

cs.PF - Performance

标题作者发布日期PDF摘要
Decomposing Docker Container Startup Performance: A Three-Tier Measurement Study on Heterogeneous InfrastructureShamsher Khan2026-02-16下载Container startup latency is a critical performance metric for CI/CD pipelines, serverless computing, and auto-scaling systems, yet practitioners lack empirical guidance on how infrastructure choices ...

基于 VitePress 构建