Skip to content

2026-02-23

cs.AR - Architecture

标题作者发布日期PDF摘要
GauS: Differentiable Scheduling Optimization via Gaussian ReparameterizationYaohui Cai, Vesal Bakhtazad, Cunxi Yu, Zhiru Zhang2026-02-23下载Efficient operator scheduling is a fundamental challenge in software compilation and hardware synthesis. While recent differentiable approaches have sought to replace traditional ones like exact solve...
Compute System Organization for High Frequency High Order Wavefront Sensing and ControlBarry Lyu, Vaibhavi Manjarekar, Nathaniel Bleier2026-02-23下载Maintaining long-term wavefront stability is critical for the Habitable Worlds Observatory (HWO), which targets contrasts approaching 101010^{-10} and therefore requires continuous dark-zone maintenance...
CQ-CiM: Hardware-Aware Embedding Shaping for Robust CiM-Based RetrievalXinzhao Li, Alptekin Vardar, Franz Müller, Navya Goli, Umamaheswara Rao Tida, Kai Ni, Xiaobo Sharon Hu, Thomas Kämpfe, Ruiyang Qin2026-02-23下载Deploying Retrieval-Augmented Generation (RAG) on edge devices is in high demand, but is hindered by the latency of massive data movement and computation on traditional architectures.
Extending CPU-less parallel execution of lambda calculus in digital logic with lists and arithmeticHarry Fitchett, Jasmine Ritchie, Charles Fox2026-02-23下载Computer architecture is searching for new ways to make use of increasingly available digital logic without the serial bottlenecks of CPU-based design.
Interconnect-Aware Logic Resynthesis for Multi-Die FPGAsXiaoke Wang, Raveena Raikar, Markus Rein, Ruiqi Chen, Chang Meng, Dirk Stroobandt2026-02-23下载Multi-die FPGAs enable device scaling beyond reticle limits but introduce severe interconnect overhead across die boundaries. Inter-die connections, commonly referred to as super-long lines (SLLs), in...
Hardware-Friendly Randomization: Enabling Random-Access and Minimal Wiring in FHE Accelerators with Low Total CostIlan Rosenfeld, Noam Kleinburd, Hillel Chapman, Dror Reuven2026-02-23下载The Ring-Learning With Errors (RLWE) problem forms the backbone of highly efficient Fully Homomorphic Encryption (FHE) schemes. A significant component of the RLWE public key and ciphertext of the for...
Fair and Square: Replacing One Real Multiplication with a Single Square and One Complex Multiplication with Three Squares When Performing Matrix Multiplication and ConvolutionsVincenzo Liguori2026-02-23下载This paper shows that, for matrix multiplications and convolutions, it is possible to asymptotically replace each real multiplication with a single squaring operation.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
The Tragedy of Chain CommonsIgnacio Amores-Sesar, Mirza Ahad Baig, Seth Gilbert, Ray Neiheiser, Michelle X. Yeo2026-02-23下载Byzantine Fault Tolerant (BFT) consensus forms the foundation of many modern blockchains striving for both high throughput and low latency. A growing bottleneck is transaction execution and validation...
Mitigating Artifacts in Pre-quantization Based Scientific Data Compressors with Quantization-aware InterpolationPu Jiao, Sheng Di, Jiannan Tian, Mingze Xia, Xuan Wu, Yang Zhang, Xin Liang, Franck Cappello2026-02-23下载Error-bounded lossy compression has been regarded as a promising way to address the ever-increasing amount of scientific data in today's high-performance computing systems.
A Context-Aware Knowledge Graph Platform for Stream Processing in Industrial IoTMonica Marconi Sciarroni, Emanuele Storti2026-02-23下载Industrial IoT ecosystems bring together sensors, machines and smart devices operating collaboratively across industrial environments. These systems generate large volumes of heterogeneous, high-veloc...
Linear Reservoir: A Diagonalization-Based OptimizationRomain de Coudenhove, Yannis Bendi-Ouis, Anthony Strock, Xavier Hinaut2026-02-23下载We introduce a diagonalization-based optimization for Linear Echo State Networks (ESNs) that reduces the per-step computational complexity of reservoir state updates from O(N^2) to O(N).
A Risk-Aware UAV-Edge Service Framework for Wildfire Monitoring and Emergency ResponseYulun Huang, Zhiyu Wang, Rajkumar Buyya2026-02-23下载Wildfire monitoring demands timely data collection and processing for early detection and rapid response. UAV-assisted edge computing is a promising approach, but jointly minimizing end-to-end service...
GPU-Resident Gaussian Process Regression Leveraging Asynchronous Tasks with HPXHenrik Möllmann, Dirk Pflüger, Alexander Strack2026-02-23下载Gaussian processes (GPs) are a widely used regression tool, but the cubic complexity of exact solvers limits their scalability. To address this challenge, we extend the GPRat library by incorporating ...
Why iCloud Fails: The Category Mistake of Cloud SynchronizationPaul Borrill2026-02-23下载iCloud Drive presents a filesystem interface but implements cloud synchronization semantics that diverge from POSIX in fundamental ways. This divergence is not an implementation bug; it is a Category ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Dynamic Model Routing and Cascading for Efficient LLM Inference: A SurveyYasmin Moslem, John D. Kelleher2026-02-23下载The rapid growth of large language models (LLMs) with diverse capabilities, costs, and domains has created a critical need for intelligent model selection at inference time.
Agentic AI for Scalable and Robust Optical Systems ControlZehao Wang, Mingzhe Han, Wei Cheng, Yue-Kai Huang, Philip Ji, Denton Wu, Mahdi Safari, Flemming Holtorf, Kenaish AlQubaisi, Norbert M. Linke, Danyang Zhuo, Yiran Chen, Ting Wang, Dirk Englund, Tingjun Chen2026-02-23下载We present AgentOptics, an agentic AI framework for high-fidelity, autonomous optical system control built on the Model Context Protocol (MCP).
Adaptive Underwater Acoustic Communications with Limited Feedback: An AoI-Aware Hierarchical Bandit ApproachFabio Busacca, Andrea Panebianco, Yin Sun2026-02-23下载Underwater Acoustic (UWA) networks are vital for remote sensing and ocean exploration but face inherent challenges such as limited bandwidth, long propagation delays, and highly dynamic channels.
Digital Twin--Driven Adaptive Wavelet Strategy for Efficient 6G Backbone Network TelemetryAlexandre Barbosa de Lima, Xavier Hesselbach, José Roberto de Almeida Amazonas2026-02-23下载Classical orthogonal wavelets guarantee perfect reconstruction but rely on fixed bases optimized for polynomial smoothness, achieving suboptimal compression on signals with fractal spectral signatures...
A Quantum Internet Protocol Suite Beyond LayeringAngela Sara Cacciapuoti, Marcello Caleffi2026-02-23下载Layering, the protocol organization principle underpinning the classical Internet, is ill-suited to the Quantum Internet, built around entanglement, which is non-local and stateful.
BeamVLM for Low-altitude Economy: Generative Beam Prediction via Vision-language ModelsChenran Kou, Changsheng You, Mingjiang Wu, Dingzhu Wen, Zezhong Zhang, Chengwen Xing2026-02-23下载For low-altitude economy (LAE), fast and accurate beam prediction between high-mobility unmanned aerial vehicles (UAVs) and ground base stations is of paramount importance, which ensures seamless cove...
vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality ModelsXunzhuo Liu, Huamin Chen, Samzong Lu, Yossi Ovadia, Guohong Wen, Hao Wu, Zhengda Tan, Jintao Zhang, Senan Zedan, Yehudit Kerido, Liav Weiss, Haichen Zhang, Bishen Yu, Asaad Balum, Noa Limoy, Abdallah Samara, Baofa Fan, Brent Salisbury, Ryan Cook, Zhijie Wang, Qiping Pan, Rehan Khan, Avishek Goswami, Houston H. Zhang, Shuyi Wang, Ziang Tang, Fang Han, Zohaib Hassan, Jianqiao Zheng, Avinash Changrani2026-02-23下载As large language models (LLMs) diversify across modalities, capabilities, and cost profiles, the problem of intelligent request routing -- selecting the right model for each query at inference time -...
AI-Powered Conflict Management in Open RAN: Detection, Classification, and MitigationAbdul Wadud, Nima Afraz, Fatemeh Golpayegani2026-02-23下载Open Radio Access Network (RAN) was designed with native Artificial Intelligence (AI) as a core pillar, enabling AI- driven xApps and rApps to dynamically optimize network performance.
Traffic-Aware Configuration of OPC UA PubSub in Industrial Automation NetworksKasra Ekrad, Bjarne Johansson, Inés Alvarez Vadillo, Saad Mubeen, Mohammad Ashjaei2026-02-23下载Interoperability across industrial automation systems is a cornerstone of Industry 4.0. To address this need, the OPC Unified Architecture (OPC UA) Publish-Subscribe (PubSub) model offers a promising ...
Spritz: Path-Aware Load Balancing in Low-Diameter NetworksTommaso Bonato, Ales Kubicek, Abdul Kabbani, Ahmad Ghalayini, Maciej Besta, Torsten Hoefler2026-02-23下载Low-diameter topologies such as Dragonfly and Slim Fly are increasingly adopted in HPC and datacenter networks, yet existing load balancing techniques either rely on proprietary in-network mechanisms ...
EMS-FL: Federated Tuning of Mixture-of-Experts in Satellite-Terrestrial Networks via Expert-Driven Model SplittingAngzi Xu, Zezhong Zhang, Zhi Liu, Shuguang Cui2026-02-23下载The rapid advancement of large AI models imposes stringent demands on data volume and computational resources. Federated learning, though designed to exploit distributed data and computational resourc...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Right to History: A Sovereignty Kernel for Verifiable AI Agent ExecutionJing Zhang2026-02-23下载AI agents increasingly act on behalf of humans, yet no existing system provides a tamper-evident, independently verifiable record of what they did.
Why iCloud Fails: The Category Mistake of Cloud SynchronizationPaul Borrill2026-02-23下载iCloud Drive presents a filesystem interface but implements cloud synchronization semantics that diverge from POSIX in fundamental ways. This divergence is not an implementation bug; it is a Category ...

cs.PF - Performance

标题作者发布日期PDF摘要
Dynamic Model Routing and Cascading for Efficient LLM Inference: A SurveyYasmin Moslem, John D. Kelleher2026-02-23下载The rapid growth of large language models (LLMs) with diverse capabilities, costs, and domains has created a critical need for intelligent model selection at inference time.
QuickGrasp: Responsive Video-Language Querying Service via Accelerated Tokenization and Edge-Augmented InferenceMiao Zhang, Ruixiao Zhang, Jianxin Shi, Hengzhi Wang, Hao Fang, Jiangchuan Liu2026-02-23下载Video-language models (VLMs) are reshaping video querying services, bringing unified solutions to complex perception and reasoning tasks. However, deploying large VLMs in real-world systems remains ch...

基于 VitePress 构建