Skip to content

2026-02-05

cs.AR - Architecture

标题作者发布日期PDF摘要
D-Legion: A Scalable Many-Core Architecture for Accelerating Matrix Multiplication in Quantized LLMsAhmed J. Abdelmaksoud, Cristian Sestito, Shiwei Wang, Themis Prodromakis2026-02-05下载The performance gains obtained by large language models (LLMs) are closely linked to their substantial computational and memory requirements. Quantized LLMs offer significant advantages with extremely...
Balancing FP8 Computation Accuracy and Efficiency on Digital CIM via Shift-Aware On-the-fly Aligned-Mantissa Bitwidth PredictionLiang Zhao, Kunming Shao, Zhipeng Liao, Xijie Huang, Tim Kwang-Ting Cheng, Chi-Ying Tsui, Yi Zou2026-02-05下载FP8 low-precision formats have gained significant adoption in Transformer inference and training. However, existing digital compute-in-memory (DCIM) architectures face challenges in supporting variabl...
Quantum Sequential CircuitsD. -S. Wang2026-02-05下载This work introduces and characterizes quantum sequential circuits (QSCs) as a hardware-oriented paradigm for quantum computing, built upon a novel foundational element termed the quantum transistor.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Location-Aware Dispersion on Anonymous GraphsHimani, Supantha Pandit, Gokarna Sharma2026-02-05下载The well-studied DISPERSION problem is a fundamental coordination problem in distributed robotics, where a set of mobile robots must relocate so that each occupies a distinct node of a network.
The Quantum Message Complexity of Distributed Wake-Up with AdvicePeter Robinson, Ming Ming Tan2026-02-05下载We consider the distributed wake-up problem with advice, where nodes are equipped with initial knowledge about the network at large. After the adversary awakens a subset of nodes, an oracle computes a...
Intent-driven Diffusion-based Path for Mobile Data Collector in IoT-enabled Dense WSNsUma Mahesh Boda, Mallikharjuna Rao Nuka2026-02-05下载Mobile data collection using controllable sinks is an effective approach to improve energy efficiency and data freshness in densely deployed wireless sensor networks (WSNs).
TimelyFreeze: Adaptive Parameter Freezing Mechanism for Pipeline ParallelismSeonghye Cho, Jaemin Han, Hyunjin Kim, Euisoo Jung, Jae-Gil Lee2026-02-05下载Pipeline parallelism enables training models that exceed single-device memory, but practical throughput remains limited by pipeline bubbles. Although parameter freezing can improve training throughput...
FedRandom: Sampling Consistent and Accurate Contribution Values in Federated LearningArno Geimer, Beltran Fiz Pontiveros, Radu State2026-02-05下载Federated Learning is a privacy-preserving decentralized approach for Machine Learning tasks. In industry deployments characterized by a limited number of entities possessing abundant data, the signif...
Smoothed aggregation algebraic multigrid for problems with heterogeneous and anisotropic materialsMax Firmbach, Malachi Phillips, Christian Glusa, Alexander Popp, Christopher M. Siefert, Matthias Mayr2026-02-05下载This paper introduces a material-aware strength-of-connection measure for smoothed aggregation algebraic multigrid methods, aimed at improving robustness for scalar partial differential equations with...
Sovereign-by-Design A Reference Architecture for AI and Blockchain Enabled SystemsMatteo Esposito, Lodovica Marchesi, Roberto Tonelli, Valentina Lenarduzzi2026-02-05下载Digital sovereignty has emerged as a central concern for modern software-intensive systems, driven by the dominance of non-sovereign cloud infrastructures, the rapid adoption of Generative AI, and inc...
Emergence-as-Code for Self-Governing Reliable SystemsAnatoly A. Krasnovsky2026-02-05下载SLO-as-code has made per-service} reliability declarative, but user experience is defined by journeys whose reliability is an emergent property of microservice topology, routing, redundancy, timeouts/...
Reaching Univalency with Subquadratic CommunicationAndrew Lewis-Pye2026-02-05下载The Dolev-Reischuk lower bound establishes that any deterministic Byzantine Agreement (BA) protocol for nn processors tolerating ff faults requires Ω(f^2+n) messages.
Proteus: Append-Only Ledgers for (Mostly) Trusted Execution EnvironmentsShubham Mishra, João Gonçalves, Chawinphat Tankuranand, Neil Giridharan, Natacha Crooks, Heidi Howard, Chris Jensen2026-02-05下载Distributed ledgers are increasingly relied upon by industry to provide trustworthy accountability, strong integrity protection, and high availability for critical data without centralizing trust.
MergePipe: A Budget-Aware Parameter Management System for Scalable LLM MergingYuanyi Wang, Yanggan Gu, Zihao Wang, Kunxi Li, Yifan Yang, Zhaoyi Yan, Congkai Xie, Jianmin Wu, Hongxia Yang2026-02-05下载Large language model (LLM) merging has become a key technique in modern LLM development pipelines, enabling the integration of multiple task- or domain-specific expert models without retraining.
ORACL: Optimized Reasoning for Autoscaling via Chain of Thought with LLMs for MicroservicesHaoyu Bai, Muhammed Tawfiqul Islam, Minxian Xu, Rajkumar Buyya2026-02-05下载Applications are moving away from monolithic designs to microservice and serverless architectures, where fleets of lightweight and independently deployable components run on public clouds.
From Sequential to Parallel: Reformulating Dynamic Programming as GPU Kernels for Large-Scale Stochastic Combinatorial OptimizationJingyi Zhao, Linxin Yang, Haohua Zhang, Qile He, Tian Ding2026-02-05下载A major bottleneck in scenario-based Sample Average Approximation (SAA) for stochastic programming (SP) is the cost of solving an exact second-stage problem for every scenario, especially when each sc...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Interpreting Manifolds and Graph Neural Embeddings from Internet of Things Traffic FlowsEnrique Feito-Casares, Francisco M. Melgarejo-Meseguer, Elena Casiraghi, Giorgio Valentini, José-Luis Rojo-Álvarez2026-02-05下载The rapid expansion of Internet of Things (IoT) ecosystems has led to increasingly complex and heterogeneous network topologies. Traditional network monitoring and visualization tools rely on aggregat...
Intent-driven Diffusion-based Path for Mobile Data Collector in IoT-enabled Dense WSNsUma Mahesh Boda, Mallikharjuna Rao Nuka2026-02-05下载Mobile data collection using controllable sinks is an effective approach to improve energy efficiency and data freshness in densely deployed wireless sensor networks (WSNs).
Data analysis of cloud virtualization experimentsPedro R. X. do Carmo, Eduardo Freitas, Assis T. de Oliveira Filho, Judith Kelner, Djamel Sadok2026-02-05下载The cloud computing paradigm underlines data center and telecommunication infrastructure design. Heavily leveraging virtualization, it slices hardware and software resources into smaller software unit...
Statistical Verification of Medium-Access Parameterization for Power-Grid Edge Ad Hoc Sensor NetworksHaitian Wang, Xia Cheng, Yiren Wang, Xinyu Wang, Zichen Geng, Xian Zhang, Yihao Ding2026-02-05下载The widespread deployment of power grid ad hoc sensor networks based on IEEE 802.15.4 raises reliability challenges when nodes selfishly adapt CSMA/CA parameters to maximize individual performance.
Wi-Fi Radar via Over-the-Air Referencing: Bridging Wi-Fi Sensing and Bistatic RadarKoji Yamamoto2026-02-05下载Wi-Fi channel state information (CSI), which is originally acquired for communication purposes, has recently been reused for sensing and radar-like functionalities.
Causal Online Learning of Safe Regions in Cloud Radio Access NetworksKim Hammar, Tansu Alpcan, Emil Lupu2026-02-05下载Cloud radio access networks (RANs) enable cost-effective management of mobile networks by dynamically scaling their capacity on demand. However, deploying adaptive controllers to implement such dynami...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Taking the Leap: Efficient and Reliable Fine-Grained NUMA Migration in User-spaceFelix Schuhknecht, Nick Rassau2026-02-05下载Modern multi-socket architectures offer a single virtual address space, but physically divide main-memory across multiple regions, where each region is attached to a CPU and its cores.

cs.PF - Performance

标题作者发布日期PDF摘要
End-to-End Throughput Benchmarking of Portable Deterministic CNN-Based Signal Processing PipelinesChristiaan Boerkamp, Akhil John Thomas2026-02-05下载This paper presents a benchmarking methodology for evaluating end-to-end performance of deterministic signal-processing pipelines expressed using CNN-compatible primitives.
Protean Compiler: An Agile Framework to Drive Fine-grain Phase OrderingAmir H. Ashouri, Shayan Shirahmad Gale Bagi, Kavin Satheeskumar, Tejas Srikanth, Jonathan Zhao, Ibrahim Saidoun, Ziwen Wang, Bryan Chan, Tomasz S. Czajkowski2026-02-05下载The phase ordering problem has been a long-standing challenge since the late 1970s, yet it remains an open problem due to having a vast optimization space and an unbounded nature, making it an open-en...
SweetSpot: An Analytical Model for Predicting Energy Efficiency of LLM InferenceHiari Pizzini Cavagna, Andrea Proia, Giacomo Madella, Giovanni B. Esposito, Francesco Antici, Daniele Cesarini, Zeynep Kiziltan, Andrea Bartolini2026-02-05下载Large Language Models (LLMs) inference is central to modern AI applications, dominating worldwide datacenter workloads, making it critical to predict its energy footprint.
Wasure: A Modular Toolkit for Comprehensive WebAssembly BenchmarkingRiccardo Carissimi, Ben L. Titzer2026-02-05下载WebAssembly (Wasm) has become a key compilation target for portable and efficient execution across diverse platforms. Benchmarking its performance, however, is a multi-dimensional challenge: it depend...
Emergence-as-Code for Self-Governing Reliable SystemsAnatoly A. Krasnovsky2026-02-05下载SLO-as-code has made per-service} reliability declarative, but user experience is defined by journeys whose reliability is an emergent property of microservice topology, routing, redundancy, timeouts/...

基于 VitePress 构建