Appearance
2026-01-15
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Mugi: Value Level Parallelism For Efficient LLMs | Daniel Price, Prabhu Vellaisamy, John Shen, Di Wu | 2026-01-15 | 下载 | Value level parallelism (VLP) has been proposed to improve the efficiency of large-batch, low-precision general matrix multiply (GEMM) between symmetric activations and weights. |
| Converting Binary Floating-Point Numbers to Shortest Decimal Strings: An Experimental Review | Jaël Champagne Gareau, Daniel Lemire | 2026-01-15 | 下载 | When sharing or logging numerical data, we must convert binary floating-point numbers into their decimal string representations. For example, the number π might become 3.1415927. |
| Architectural Classification of XR Workloads: Cross-Layer Archetypes and Implications | Xinyu Shi, Simei Yang, Francky Catthoor | 2026-01-15 | 下载 | Edge and mobile platforms for augmented and virtual reality, collectively referred to as extended reality (XR) must deliver deterministic ultra-low-latency performance under stringent power and area c... |
| FaTRQ: Tiered Residual Quantization for LLM Vector Search in Far-Memory-Aware ANNS Systems | Tianqi Zhang, Flavio Ponzina, Tajana Rosing | 2026-01-15 | 下载 | Approximate Nearest-Neighbor Search (ANNS) is a key technique in retrieval-augmented generation (RAG), enabling rapid identification of the most relevant high-dimensional embeddings from massive vecto... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Context Lake: A System Class Defined by Decision Coherence | Xiaowei Jiang | 2026-01-15 | 下载 | AI agents are increasingly the primary consumers of data, operating continuously to make concurrent, irreversible decisions. Traditional data systems designed for human analysis cycles become correctn... |
| Breaking the Storage-Bandwidth Tradeoff in Distributed Storage with Quantum Entanglement | Lei Hu, Mohamed Nomeir, Alptug Aytekin, Sennur Ulukus | 2026-01-15 | 下载 | This work investigates the use of quantum resources in distributed storage systems. Consider an distributed storage system in which a file is stored across nodes such that any nodes ... |
| Mitigating GIL Bottlenecks in Edge AI Systems | Mridankan Mandal, Smit Sanjay Shende | 2026-01-15 | 下载 | Deploying Python-based AI agents on resource-constrained edge devices presents a critical runtime optimization challenge: high thread counts are needed to mask I/O latency, yet Python's Global Interpr... |
| WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching | Xiangchen Li, Jiakun Fan, Qingyuan Wang, Dimitrios Spatharakis, Saeid Ghafouri, Hans Vandierendonck, Deepu John, Bo Ji, Ali R. Butt, Dimitrios S. Nikolopoulos | 2026-01-15 | 下载 | As Large Language Models (LLMs) become increasingly accessible to end users, an ever-growing number of inference requests are initiated from edge devices and computed on centralized GPU clusters. |
| Chebyshev Accelerated Subspsace Eigensolver for Pseudo-hermitian Hamiltonians | Edoardo Di Napoli, Clément Richefort, Xinzhe Wu | 2026-01-15 | 下载 | Studying the optoelectronic structure of materials can require the computation of up to several thousands of the smallest eigenpairs of a pseudo-hermitian Hamiltonian. |
| SCRamble: Adaptive Decentralized Overlay Construction for Blockchain Networks | Evangelos Kolyvas, Alexandros Antonov, Spyros Voulgaris | 2026-01-15 | 下载 | Despite being under development for over 15 years, transaction throughput remains one of the key challenges confronting blockchains, which typically has a cap of a limited number of transactions per s... |
| Distributed Linearly Separable Computation with Arbitrary Heterogeneous Data Assignment | Ziting Zhang, Kai Wan, Minquan Cheng, Shuo Shao, Giuseppe Caire | 2026-01-15 | 下载 | Distributed linearly separable computation is a fundamental problem in large-scale distributed systems, requiring the computation of linearly separable functions over different datasets across distrib... |
| Fuzzychain-edge: A novel Fuzzy logic-based adaptive Access control model for Blockchain in Edge Computing | Khushbakht Farooq, Muhammad Ibrahim, Irsa Manzoor, Mukhtaj Khan, Wei Song | 2026-01-15 | 下载 | The rapid integration of IoT with edge computing has revolutionized various domains, particularly healthcare, by enabling real-time data sharing, remote monitoring, and decision-making. |
| A Forward Simulation-Based Hierarchy of Linearizable Concurrent Objects | Chao Wang, Ruijia Li, Yang Zhou, Peng Wu, Yi Lv, Jianwei Liao, Jim Woodcock, Zhiming Liu | 2026-01-15 | 下载 | In this paper, we systematically investigate the connection between linearizable objects and forward simulation. We prove that the sets of linearizable objects satisfying wait-freedom (resp. |
| Fundamental Limits of Coded Polynomial Aggregation | Xi Zhong, Jörg Kliewer, Mingyue Ji | 2026-01-15 | 下载 | Coded polynomial aggregation (CPA) enables the master to directly recover a weighted aggregation of polynomial evaluations without individually decoding each term, thereby reducing the number of requi... |
| Clustering-Based User Selection in Federated Learning: Metadata Exploitation for 3GPP Networks | Ce Zheng, Shiyao Ma, Ke Zhang, Chen Sun, Wenqi Zhang | 2026-01-15 | 下载 | Federated learning (FL) enables collaborative model training without sharing raw user data, but conventional simulations often rely on unrealistic data partitioning and current user selection methods ... |
| Federated Unlearning in Edge Networks: A Survey of Fundamentals, Challenges, Practical Applications and Future Directions | Jer Shyuan Ng, Wathsara Daluwatta, Shehan Edirimannage, Charitha Elvitigala, Asitha Kottahachchi Kankanamge Don, Ibrahim Khalil, Heng Zhang, Dusit Niyato | 2026-01-15 | 下载 | The proliferation of connected devices and privacy-sensitive applications has accelerated the adoption of Federated Learning (FL), a decentralized paradigm that enables collaborative model training wi... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| An Efficient and Explainable KAN Framework for Wireless Radiation Field Prediction | Jingzhou Shen, Xuyu Wang | 2026-01-15 | 下载 | Modeling wireless channels accurately remains a challenge due to environmental variations and signal uncertainties. Recent neural networks can learn radio frequency~(RF) signal propagation patterns, b... |
| Breaking the Storage-Bandwidth Tradeoff in Distributed Storage with Quantum Entanglement | Lei Hu, Mohamed Nomeir, Alptug Aytekin, Sennur Ulukus | 2026-01-15 | 下载 | This work investigates the use of quantum resources in distributed storage systems. Consider an distributed storage system in which a file is stored across nodes such that any nodes ... |
| A user subscription model in mobile radio access networks with network slicing | José-Ramón Vidal, Luis Guijarro, Vicent Pla | 2026-01-15 | 下载 | Network slicing is an architectural enabling technology that logically decouples the current cellular networks into infrastructure providers (InPs) and Network Slice Tenants (NSTs). |
| Enhancing Mobile Ad Hoc Networks (MANETs) with Software-Defined Networking (SDN): A Balanced Approach | Riccardo Fonti, Andrea Piroddi | 2026-01-15 | 下载 | Mobile Ad Hoc Networks (MANETs) are decentralized wireless networks, characterized by their dynamic topologies and node mobility. In the era of cutting-edge technologies, integrating Software-Defined ... |
| SDN-Driven Innovations in MANETs and IoT: A Path to Smarter Networks | Andrea Piroddi, Riccardo Fonti | 2026-01-15 | 下载 | Mobile Ad Hoc Networks (MANETs) and Internet of Things (IoT) networks operate in decentralized and dynamic environments, making them ideal for scenarios lacking traditional infrastructure. |
| SCRamble: Adaptive Decentralized Overlay Construction for Blockchain Networks | Evangelos Kolyvas, Alexandros Antonov, Spyros Voulgaris | 2026-01-15 | 下载 | Despite being under development for over 15 years, transaction throughput remains one of the key challenges confronting blockchains, which typically has a cap of a limited number of transactions per s... |
| Queueing-Aware Optimization of Reasoning Tokens for Accuracy-Latency Trade-offs in LLM Servers | Emre Ozbas, Melih Bastopcu | 2026-01-15 | 下载 | We consider a single large language model (LLM) server that serves a heterogeneous stream of queries belonging to distinct task types. Queries arrive according to a Poisson process, and each type ... |
| Bias in the Shadows: Explore Shortcuts in Encrypted Network Traffic Classification | Chuyi Wang, Xiaohui Xie, Tongze Wang, Yong Cui | 2026-01-15 | 下载 | Pre-trained models operating directly on raw bytes have achieved promising performance in encrypted network traffic classification (NTC), but often suffer from shortcut learning-relying on spurious co... |
| Starfield: Demand-Aware Satellite Topology Design for Low-Earth Orbit Mega Constellations | Shayan Hamidi Dehshali, Tzu-Hsuan Liao, Shaileshh Bojja Venkatakrishnan | 2026-01-15 | 下载 | Low-Earth orbit (LEO) mega-constellations are emerging as high-capacity backbones for next-generation Internet. Deployment of laser terminals enables high-bandwidth, low-latency inter-satellite links ... |
| Large Language Model (LLM)-enabled Reinforcement Learning for Wireless Network Optimization | Jie Zheng, Ruichen Zhang, Dusit Niyato, Haijun Zhang, Jiacheng Wang, Hongyang Du, Jiawen Kang, Zehui Xiong | 2026-01-15 | 下载 | Enhancing future wireless networks presents a significant challenge for networking systems due to diverse user demands and the emergence of 6G technology. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Mitigating GIL Bottlenecks in Edge AI Systems | Mridankan Mandal, Smit Sanjay Shende | 2026-01-15 | 下载 | Deploying Python-based AI agents on resource-constrained edge devices presents a critical runtime optimization challenge: high thread counts are needed to mask I/O latency, yet Python's Global Interpr... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Balanced allocation: considerations from large scale service environments | Amer Diwan, Prabhakar Raghavan, Eli Upfal | 2026-01-15 | 下载 | We study d-way balanced allocation, which assigns each incoming job to the lightest loaded among d randomly chosen servers. While prior work has extensively studied the performance of the basic scheme... |
| Mitigating GIL Bottlenecks in Edge AI Systems | Mridankan Mandal, Smit Sanjay Shende | 2026-01-15 | 下载 | Deploying Python-based AI agents on resource-constrained edge devices presents a critical runtime optimization challenge: high thread counts are needed to mask I/O latency, yet Python's Global Interpr... |
| Long-term Monitoring of Kernel and Hardware Events to Understand Latency Variance | Fang Zhou, Yuyang Huang, Miao Yu, Sixiang Ma, Tongping Liu, Yang Wang | 2026-01-15 | 下载 | This paper presents our experience to understand latency variance caused by kernel and hardware events, which are often invisible at the application level. |
| Emergency Department Patient Flow Optimization with an Alternative Care Threshold Policy | Sahba Baniasadi, Paul M. Griffin, Prakash Chakraborty | 2026-01-15 | 下载 | Emergency department (ED) overcrowding and patient boarding represent critical systemic challenges that compromise care quality. We propose a threshold-based admission policy that redirects non-urgent... |