Skip to content

2026-01-15

cs.AR - Architecture

标题作者发布日期PDF摘要
Mugi: Value Level Parallelism For Efficient LLMsDaniel Price, Prabhu Vellaisamy, John Shen, Di Wu2026-01-15下载Value level parallelism (VLP) has been proposed to improve the efficiency of large-batch, low-precision general matrix multiply (GEMM) between symmetric activations and weights.
Converting Binary Floating-Point Numbers to Shortest Decimal Strings: An Experimental ReviewJaël Champagne Gareau, Daniel Lemire2026-01-15下载When sharing or logging numerical data, we must convert binary floating-point numbers into their decimal string representations. For example, the number π might become 3.1415927.
Architectural Classification of XR Workloads: Cross-Layer Archetypes and ImplicationsXinyu Shi, Simei Yang, Francky Catthoor2026-01-15下载Edge and mobile platforms for augmented and virtual reality, collectively referred to as extended reality (XR) must deliver deterministic ultra-low-latency performance under stringent power and area c...
FaTRQ: Tiered Residual Quantization for LLM Vector Search in Far-Memory-Aware ANNS SystemsTianqi Zhang, Flavio Ponzina, Tajana Rosing2026-01-15下载Approximate Nearest-Neighbor Search (ANNS) is a key technique in retrieval-augmented generation (RAG), enabling rapid identification of the most relevant high-dimensional embeddings from massive vecto...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Context Lake: A System Class Defined by Decision CoherenceXiaowei Jiang2026-01-15下载AI agents are increasingly the primary consumers of data, operating continuously to make concurrent, irreversible decisions. Traditional data systems designed for human analysis cycles become correctn...
Breaking the Storage-Bandwidth Tradeoff in Distributed Storage with Quantum EntanglementLei Hu, Mohamed Nomeir, Alptug Aytekin, Sennur Ulukus2026-01-15下载This work investigates the use of quantum resources in distributed storage systems. Consider an (n,k,d)(n,k,d) distributed storage system in which a file is stored across nn nodes such that any kk nodes ...
Mitigating GIL Bottlenecks in Edge AI SystemsMridankan Mandal, Smit Sanjay Shende2026-01-15下载Deploying Python-based AI agents on resource-constrained edge devices presents a critical runtime optimization challenge: high thread counts are needed to mask I/O latency, yet Python's Global Interpr...
WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware BatchingXiangchen Li, Jiakun Fan, Qingyuan Wang, Dimitrios Spatharakis, Saeid Ghafouri, Hans Vandierendonck, Deepu John, Bo Ji, Ali R. Butt, Dimitrios S. Nikolopoulos2026-01-15下载As Large Language Models (LLMs) become increasingly accessible to end users, an ever-growing number of inference requests are initiated from edge devices and computed on centralized GPU clusters.
Chebyshev Accelerated Subspsace Eigensolver for Pseudo-hermitian HamiltoniansEdoardo Di Napoli, Clément Richefort, Xinzhe Wu2026-01-15下载Studying the optoelectronic structure of materials can require the computation of up to several thousands of the smallest eigenpairs of a pseudo-hermitian Hamiltonian.
SCRamble: Adaptive Decentralized Overlay Construction for Blockchain NetworksEvangelos Kolyvas, Alexandros Antonov, Spyros Voulgaris2026-01-15下载Despite being under development for over 15 years, transaction throughput remains one of the key challenges confronting blockchains, which typically has a cap of a limited number of transactions per s...
Distributed Linearly Separable Computation with Arbitrary Heterogeneous Data AssignmentZiting Zhang, Kai Wan, Minquan Cheng, Shuo Shao, Giuseppe Caire2026-01-15下载Distributed linearly separable computation is a fundamental problem in large-scale distributed systems, requiring the computation of linearly separable functions over different datasets across distrib...
Fuzzychain-edge: A novel Fuzzy logic-based adaptive Access control model for Blockchain in Edge ComputingKhushbakht Farooq, Muhammad Ibrahim, Irsa Manzoor, Mukhtaj Khan, Wei Song2026-01-15下载The rapid integration of IoT with edge computing has revolutionized various domains, particularly healthcare, by enabling real-time data sharing, remote monitoring, and decision-making.
A Forward Simulation-Based Hierarchy of Linearizable Concurrent ObjectsChao Wang, Ruijia Li, Yang Zhou, Peng Wu, Yi Lv, Jianwei Liao, Jim Woodcock, Zhiming Liu2026-01-15下载In this paper, we systematically investigate the connection between linearizable objects and forward simulation. We prove that the sets of linearizable objects satisfying wait-freedom (resp.
Fundamental Limits of Coded Polynomial AggregationXi Zhong, Jörg Kliewer, Mingyue Ji2026-01-15下载Coded polynomial aggregation (CPA) enables the master to directly recover a weighted aggregation of polynomial evaluations without individually decoding each term, thereby reducing the number of requi...
Clustering-Based User Selection in Federated Learning: Metadata Exploitation for 3GPP NetworksCe Zheng, Shiyao Ma, Ke Zhang, Chen Sun, Wenqi Zhang2026-01-15下载Federated learning (FL) enables collaborative model training without sharing raw user data, but conventional simulations often rely on unrealistic data partitioning and current user selection methods ...
Federated Unlearning in Edge Networks: A Survey of Fundamentals, Challenges, Practical Applications and Future DirectionsJer Shyuan Ng, Wathsara Daluwatta, Shehan Edirimannage, Charitha Elvitigala, Asitha Kottahachchi Kankanamge Don, Ibrahim Khalil, Heng Zhang, Dusit Niyato2026-01-15下载The proliferation of connected devices and privacy-sensitive applications has accelerated the adoption of Federated Learning (FL), a decentralized paradigm that enables collaborative model training wi...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
An Efficient and Explainable KAN Framework for Wireless Radiation Field PredictionJingzhou Shen, Xuyu Wang2026-01-15下载Modeling wireless channels accurately remains a challenge due to environmental variations and signal uncertainties. Recent neural networks can learn radio frequency~(RF) signal propagation patterns, b...
Breaking the Storage-Bandwidth Tradeoff in Distributed Storage with Quantum EntanglementLei Hu, Mohamed Nomeir, Alptug Aytekin, Sennur Ulukus2026-01-15下载This work investigates the use of quantum resources in distributed storage systems. Consider an (n,k,d)(n,k,d) distributed storage system in which a file is stored across nn nodes such that any kk nodes ...
A user subscription model in mobile radio access networks with network slicingJosé-Ramón Vidal, Luis Guijarro, Vicent Pla2026-01-15下载Network slicing is an architectural enabling technology that logically decouples the current cellular networks into infrastructure providers (InPs) and Network Slice Tenants (NSTs).
Enhancing Mobile Ad Hoc Networks (MANETs) with Software-Defined Networking (SDN): A Balanced ApproachRiccardo Fonti, Andrea Piroddi2026-01-15下载Mobile Ad Hoc Networks (MANETs) are decentralized wireless networks, characterized by their dynamic topologies and node mobility. In the era of cutting-edge technologies, integrating Software-Defined ...
SDN-Driven Innovations in MANETs and IoT: A Path to Smarter NetworksAndrea Piroddi, Riccardo Fonti2026-01-15下载Mobile Ad Hoc Networks (MANETs) and Internet of Things (IoT) networks operate in decentralized and dynamic environments, making them ideal for scenarios lacking traditional infrastructure.
SCRamble: Adaptive Decentralized Overlay Construction for Blockchain NetworksEvangelos Kolyvas, Alexandros Antonov, Spyros Voulgaris2026-01-15下载Despite being under development for over 15 years, transaction throughput remains one of the key challenges confronting blockchains, which typically has a cap of a limited number of transactions per s...
Queueing-Aware Optimization of Reasoning Tokens for Accuracy-Latency Trade-offs in LLM ServersEmre Ozbas, Melih Bastopcu2026-01-15下载We consider a single large language model (LLM) server that serves a heterogeneous stream of queries belonging to NN distinct task types. Queries arrive according to a Poisson process, and each type ...
Bias in the Shadows: Explore Shortcuts in Encrypted Network Traffic ClassificationChuyi Wang, Xiaohui Xie, Tongze Wang, Yong Cui2026-01-15下载Pre-trained models operating directly on raw bytes have achieved promising performance in encrypted network traffic classification (NTC), but often suffer from shortcut learning-relying on spurious co...
Starfield: Demand-Aware Satellite Topology Design for Low-Earth Orbit Mega ConstellationsShayan Hamidi Dehshali, Tzu-Hsuan Liao, Shaileshh Bojja Venkatakrishnan2026-01-15下载Low-Earth orbit (LEO) mega-constellations are emerging as high-capacity backbones for next-generation Internet. Deployment of laser terminals enables high-bandwidth, low-latency inter-satellite links ...
Large Language Model (LLM)-enabled Reinforcement Learning for Wireless Network OptimizationJie Zheng, Ruichen Zhang, Dusit Niyato, Haijun Zhang, Jiacheng Wang, Hongyang Du, Jiawen Kang, Zehui Xiong2026-01-15下载Enhancing future wireless networks presents a significant challenge for networking systems due to diverse user demands and the emergence of 6G technology.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Mitigating GIL Bottlenecks in Edge AI SystemsMridankan Mandal, Smit Sanjay Shende2026-01-15下载Deploying Python-based AI agents on resource-constrained edge devices presents a critical runtime optimization challenge: high thread counts are needed to mask I/O latency, yet Python's Global Interpr...

cs.PF - Performance

标题作者发布日期PDF摘要
Balanced allocation: considerations from large scale service environmentsAmer Diwan, Prabhakar Raghavan, Eli Upfal2026-01-15下载We study d-way balanced allocation, which assigns each incoming job to the lightest loaded among d randomly chosen servers. While prior work has extensively studied the performance of the basic scheme...
Mitigating GIL Bottlenecks in Edge AI SystemsMridankan Mandal, Smit Sanjay Shende2026-01-15下载Deploying Python-based AI agents on resource-constrained edge devices presents a critical runtime optimization challenge: high thread counts are needed to mask I/O latency, yet Python's Global Interpr...
Long-term Monitoring of Kernel and Hardware Events to Understand Latency VarianceFang Zhou, Yuyang Huang, Miao Yu, Sixiang Ma, Tongping Liu, Yang Wang2026-01-15下载This paper presents our experience to understand latency variance caused by kernel and hardware events, which are often invisible at the application level.
Emergency Department Patient Flow Optimization with an Alternative Care Threshold PolicySahba Baniasadi, Paul M. Griffin, Prakash Chakraborty2026-01-15下载Emergency department (ED) overcrowding and patient boarding represent critical systemic challenges that compromise care quality. We propose a threshold-based admission policy that redirects non-urgent...

基于 VitePress 构建