Skip to content

2026-01-06

cs.AR - Architecture

标题作者发布日期PDF摘要
SpANNS: Optimizing Approximate Nearest Neighbor Search for Sparse Vectors Using Near Memory ProcessingTianqi Zhang, Flavio Ponzina, Tajana Rosing2026-01-06下载Approximate Nearest Neighbor Search (ANNS) is a fundamental operation in vector databases, enabling efficient similarity search in high-dimensional spaces.
Bare-Metal Tensor Virtualization: Overcoming the Memory Wall in Edge-AI Inference on ARM64Bugra Kilictas, Faruk Alpay2026-01-06下载The deployment of Large Language Models (LLMs) on edge devices is fundamentally constrained by the "Memory Wall" the bottleneck where data movement latency outstrips arithmetic throughput.
Advancing Assistive Robotics: Multi-Modal Navigation and Biophysical Monitoring for Next-Generation WheelchairsMd. Anowar Hossain, Mohd. Ehsanul Hoque2026-01-06下载Assistive electric-powered wheelchairs (EPWs) have become essential mobility aids for people with disabilities such as amyotrophic lateral sclerosis (ALS), post-stroke hemiplegia, and dementia-related...
Sparsity-Aware Streaming SNN Accelerator with Output-Channel Dataflow for Automatic Modulation ClassificationKuilian Yang, Li Zhang, Ahmed M. Eltawil, Khaled Nabil Salama2026-01-06下载The rapid advancement of wireless communication technologies, including 5G, emerging 6G networks, and the large-scale deployment of the Internet of Things (IoT), has intensified the need for efficient...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Enhancing Model Context Protocol (MCP) with Context-Aware Server CollaborationMeenakshi Amulya Jayanti, X. Y. Han2026-01-06下载The Model Context Protocol (MCP) (MCP Community, 2025) has emerged as a widely used framework for enabling LLM-based agents to communicate with external tools and services.
Revisiting Speculative Leaderless Protocols for Low-Latency BFT ReplicationDaniel Qian, Xiyu Hao, Jinkun Geng, Yuncheng Yao, Aurojit Panda, Jinyang Li, Anirudh Sivaraman2026-01-06下载As Byzantine Fault Tolerant (BFT) protocols begin to be used in permissioned blockchains for user-facing applications such as payments, it is crucial that they provide low latency.
Software-Defined Agentic ServingSaurabh Agarwal, Marco Laju, Jayanth Srinivasa, Myungjin Lee, Aditya Akella2026-01-06下载As multi-agent LLM pipelines grow in complexity, existing serving paradigms fail to adapt to the dynamic serving conditions. We argue that agentic serving systems should be programmable and system-awa...
Exploring Blockchain Interoperability: Frameworks, Use Cases, and Future ChallengesStanly Wilson, Kwabena Adu-Duodu, Yinhao Li, Ellis Solaiman, Omer Rana, Rajiv Ranjan2026-01-06下载Trust between entities in any scenario without a trusted third party is very difficult, and trust is exactly what blockchain aims to bring into the digital world with its basic features.
Proceedings of the 1st International Workshop on Low Carbon Computing (LOCO 2024)Wim Vanderbauwhede, Lauritz Thamsen, José Cano2026-01-06下载This is the proceedings of the 1st International Workshop on Low Carbon Computing (LOCO 2024).
Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over UnslothArjun S. Nair2026-01-06下载Large language model fine-tuning is bottlenecked by memory: a 7B parameter model requires 84GB--14GB for weights, 14GB for gradients, and 56GB for FP32 optimizer states--exceeding even A100-40GB capac...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
On the Capacity Region of Individual Key Rates in Vector Linear Secure AggregationLei Hu, Sennur Ulukus2026-01-06下载We provide new insights into an open problem recently posed by Yuan-Sun [ISIT 2025], concerning the minimum individual key rate required in the vector linear secure aggregation problem.
inRAN: Interpretable Online Bayesian Learning for Network Automation in Open Radio Access NetworksMing Zhao, Yuru Zhang, Qiang Liu, Ahan Kak, Nakjung Choi2026-01-06下载Emerging AI/ML techniques have been showing great potential in automating network control in open radio access networks (Open RAN). However, existing approaches heavily rely on blackbox policies param...
oneTwin: Online Digital Network Twin via Neural Radio Radiance FieldYuru Zhang, Ming Zhao, Qiang Liu, Nakjung Choi2026-01-06下载Digital network twin is a promising technology that replicates real-world networks in real-time and assists with the design, operation, and management of next-generation networks.
TaNG: Modeling Packet Classification with TSS-assisted Neural Networks on GPUsZhengyu Liao, Shiyou Qian2026-01-06下载Packet classification is a core function in software-defined networks, and learning-based methods have recently shown significant throughput gains on large-scale rulesets.
Multi-Modal Data-Enhanced Foundation Models for Prediction and Control in Wireless Networks: A SurveyHan Zhang, Mohammad Farzanullah, Mohammad Ghassemi, Akram Bin Sediq, Ali Afana, Melike Erol-Kantarci2026-01-06下载Foundation models (FMs) are recognized as a transformative breakthrough that has started to reshape the future of artificial intelligence (AI) across both academia and industry.
Eco-WakeLoc: An Energy-Neutral and Cooperative UWB Real-Time Locating SystemSilvano Cortesi, Lukas Schulthess, Davide Plozza, Christian Vogt, Michele Magno2026-01-06下载Indoor localization systems face a fundamental trade-off between efficiency and responsiveness, which is especially important for emerging use cases such as mobile robots operating in GPS-denied envir...
Probabilistic Time Slot Leasing in TDMA-Based IoT Networks for Enhanced Channel UtilizationHicham Lakhlef, Mohamed Ali Zormati, Khaled Abid, Toufik Ahmed2026-01-06下载In large-scale resource-constrained wireless networks, such as those prevalent in the Internet of Things (IoT), efficient communication scheduling remains a critical challenge.
Which Deep Learner? A Systematic Evaluation of Advanced Deep Forecasting Models Accuracy and Efficiency for Network Traffic PredictionEilaf MA Babai, Aalaa MA Babai, Koji Okamura2026-01-06下载Network traffic prediction is essential for automating modern network management. It is a difficult time series forecasting (TSF) problem that has been addressed by Deep Learning (DL) models due to th...

cs.PF - Performance

标题作者发布日期PDF摘要
Rapid Augmentations for Time Series (RATS): A High-Performance Library for Time Series AugmentationWadie Skaf, Felix Kern, Aryamaan Basu Roy, Tejas Pradhan, Roman Kalkreuth, Holger Hoos2026-01-06下载Time series augmentation is critical for training robust deep learning models, particularly in domains where labelled data is scarce and expensive to obtain.
Scalable Tree Ensemble Proximities in PythonAdrien Aumon, Guy Wolf, Kevin R. Moon, Jake S. Rhodes2026-01-06下载Tree ensemble methods such as Random Forests naturally induce supervised similarity measures through their decision tree structure, but existing implementations of proximities derived from tree ensemb...
Embedding Retrofitting: Data Engineering for better RAGAnantha Sharma2026-01-06下载Embedding retrofitting adjusts pre-trained word vectors using knowledge graph constraints to improve domain-specific retrieval. However, the effectiveness of retrofitting depends critically on knowled...

基于 VitePress 构建