Skip to content

2026-02-19

cs.AR - Architecture

标题作者发布日期PDF摘要
When Models Ignore Definitions: Measuring Semantic Override Hallucinations in LLM ReasoningYogeswar Reddy Thota, Setareh Rafatirad, Homayoun Houman, Tooraj Nikoubin2026-02-19下载Large language models (LLMs) demonstrate strong performance on standard digital logic and Boolean reasoning tasks, yet their reliability under locally redefined semantics remains poorly understood.
ARKV: Adaptive and Resource-Efficient KV Cache Management under Limited Memory Budget for Long-Context Inference in LLMsJianlong Lei, Shashikant Ilager2026-02-19下载Large Language Models (LLMs) are increasingly deployed in scenarios demanding ultra-long context reasoning, such as agentic workflows and deep research understanding.
SimulatorCoder: DNN Accelerator Simulator Code Generation and Optimization via Large Language ModelsYuhuan Xia, Tun Li, Hongji Zhou, Xianfa Zhou, Chong Chen, Ruiyu Zhang2026-02-19下载This paper presents SimulatorCoder, an agent powered by large language models (LLMs), designed to generate and optimize deep neural network (DNN) accelerator simulators based on natural language descr...
A Data-Driven Dynamic Execution Orchestration ArchitectureZhenyu Bai, Pranav Dangi, Rohan Juneja, Zhaoying Li, Zhanglu Yan, Huiying Lan, Tulika Mitra2026-02-19下载Domain-specific accelerators deliver exceptional performance on their target workloads through fabrication-time orchestrated datapaths. However, such specialized architectures often exhibit performanc...
Low-Cost IoT-Enabled Tele-ECG Monitoring for Resource-Constrained Settings: System Design and PrototypeSeemron Neupane, Aashish Ghimire2026-02-19下载With the availability of automation machinery and its superiority, are being slothful and inviting many diseases to invade them. The world still has so many places where people lack basic health facil...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Distributed Triangle Enumeration in HypergraphsDuncan Adamson, Will Rosenbaum, Paul G. Spirakis2026-02-19下载In the last decade, subgraph detection and enumeration have emerged as a central problem in distributed graph algorithms. This is largely due to the theoretical challenges and practical applications o...
GPU Memory and Utilization Estimation for Training-Aware Resource Management: Opportunities and LimitationsEhsan Yousefzadeh-Asl-Miandoab, Reza Karimzadeh, Danyal Yorulmaz, Bulat Ibragimov, Pınar Tözün2026-02-19下载Collocating deep learning training tasks improves GPU utilization but risks resource contention, severe slowdowns, and out-of-memory (OOM) failures.
Faster Parallel Batch-Dynamic Algorithms for Low Out-Degree OrientationGuy Blelloch, Andrew Brady, Laxman Dhulipala, Jeremy Fineman, Kishen Gowda, Chase Hutton2026-02-19下载A low out-degree orientation directs each edge of an undirected graph with the goal of minimizing the maximum out-degree of a vertex. In the parallel batch-dynamic setting, one can insert or delete ba...
Collaborative Processing for Multi-Tenant Inference on Memory-Constrained Edge TPUsNathan Ng, Walid A. Hanafy, Prashanthi Kadambi, Balachandra Sunil, Ayush Gupta, David Irwin, Yogesh Simmhan, Prashant Shenoy2026-02-19下载IoT applications are increasingly relying on on-device AI accelerators to ensure high performance, especially in limited connectivity and safety-critical scenarios.
Message-Oriented Middleware Systems: Technology OverviewWael Al-Manasrah, Zuhair AlSader, Tim Brecht, Ahmed Alquraan, Samer Al-Kiswany2026-02-19下载We present a comprehensive characterization study of open-source message-oriented middleware (MOM) systems. We followed a rigorous methodology to select and study ten popular and diverse MOM systems.
Catastrophic Forgetting Resilient One-Shot Incremental Federated LearningObaidullah Zaland, Zulfiqar Ahmad Khan, Monowar Bhuyan2026-02-19下载Modern big-data systems generate massive, heterogeneous, and geographically dispersed streams that are large-scale and privacy-sensitive, making centralization challenging.
Guarding the Middle: Protecting Intermediate Representations in Federated Split LearningObaidullah Zaland, Sajib Mistry, Monowar Bhuyan2026-02-19下载Big data scenarios, where massive, heterogeneous datasets are distributed across clients, demand scalable, privacy-preserving learning methods.
Exploring Novel Data Storage Approaches for Large-Scale Numerical Weather PredictionNicolau Manubens Gil2026-02-19下载Driven by scientific and industry ambition, HPC and AI applications such as operational Numerical Weather Prediction (NWP) require processing and storing ever-increasing data volumes as fast as possib...
TopoSZp: Lightweight Topology-Aware Error-controlled Compression for Scientific DataTripti Agarwal, Sheng Di, Xin Liang, Zhaoyuan Su, Yuxiao Li, Ganesh Gopalakrishnan, Hanqi Guo, Franck Cappello2026-02-19下载Error-bounded lossy compression is essential for managing the massive data volumes produced by large-scale HPC simulations. While state-of-the-art compressors such as SZ and ZFP provide strong numeric...
Informative Trains: A Memory-Efficient Journey to a Self-Stabilizing Leader Election Algorithm in Anonymous GraphsLelia Blin, Sylvain Gay, Isabella Ziccardi2026-02-19下载We study the self-stabilizing leader election problem in anonymous nn-nodes networks. Achieving self-stabilization with low space memory complexity is particularly challenging, and designing space-op...
ARKV: Adaptive and Resource-Efficient KV Cache Management under Limited Memory Budget for Long-Context Inference in LLMsJianlong Lei, Shashikant Ilager2026-02-19下载Large Language Models (LLMs) are increasingly deployed in scenarios demanding ultra-long context reasoning, such as agentic workflows and deep research understanding.
Do GPUs Really Need New Tabular File Formats?Jigao Luo, Qi Chen, Carsten Binnig2026-02-19下载Parquet is the de facto columnar file format in modern analytical systems, yet its configuration guidelines have largely been shaped by CPU-centric execution models.
Evaluating Malleable Job Scheduling in HPC Clusters using Real-World WorkloadsPatrick Zojer, Jonas Posner, Taylan Özden2026-02-19下载Optimizing resource utilization in high-performance computing (HPC) clusters is essential for maximizing both system efficiency and user satisfaction.
Visual Insights into Agentic Optimization of Pervasive Stream Processing ServicesBoris Sedlak, Víctor Casamayor Pujol, Schahram Dustdar2026-02-19下载Processing sensory data close to the data source, often involving Edge devices, promises low latency for pervasive applications, like smart cities.
A Framework for Hybrid Collective Inference in Distributed Sensor NetworksAndrew Nash, Dirk Pesch, Krishnendu Guha2026-02-19下载With the ever-increasing range of applications of Internet in Things (IoT) and sensor networks, challenges are emerging in various categories of classification tasks.
Trivance: Latency-Optimal AllReduce by Shortcutting Multiport NetworksAnton Juerss, Vamsi Addanki, Stefan Schmid2026-02-19下载AllReduce is a fundamental collective operation in distributed computing and a key performance bottleneck for large-scale training and inference.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
EDRP: Enhanced Dynamic Relay Point Protocol for Data Dissemination in Multi-hop Wireless IoT NetworksJothi Prasanna Shanmuga Sundaram, Magzhan Gabidolla, Luis Fujarte, Shawn Duong, Jianlin Guo, Toshiaki Koike-Akino, Pu, Wang, Kieran Parsons, Philip V. Orlik, Takenori Sumi, Yukimasa Nagai, Miguel A. Carreira-Perpinan, Alberto E. Cerpa2026-02-19下载Emerging IoT applications are transitioning from battery-powered to grid-powered nodes. DRP, a contention-based data dissemination protocol, was developed for these applications.
HAP Networks for the Future: Applications in Sensing, Computing, and CommunicationSultan Çoğay, T. Tolga Sari, Muhammad Nadeem Ali, Byung-Seo Kim, Gökhan Seçinti2026-02-19下载High Altitude Platforms (HAPs) are a major advancement in non-terrestrial networks, offering broad coverage and unique capabilities. They form a vital link between satellite systems and terrestrial ne...
ACOS: Arrays of Cheap Optical SwitchesDaniel Amir, Ori Cohen, Jakob Krebs, Mark Silberstein2026-02-19下载Machine learning training places immense demands on cluster networks, motivating specialized architectures and co-design with parallelization strategies.
Voice-Driven Semantic Perception for UAV-Assisted Emergency NetworksNuno Saavedra, Pedro Ribeiro, André Coelho, Rui Campos2026-02-19下载Unmanned Aerial Vehicle (UAV)-assisted networks are increasingly foreseen as a promising approach for emergency response, providing rapid, flexible, and resilient communications in environments where ...
End-to-End Latency Measurement Methodology for Connected and Autonomous Vehicle TeleoperationFrançois Provost, Faisal Hawlader, Mehdi Testouri, Raphaël Frank2026-02-19下载Connected and Autonomous Vehicles (CAVs) continue to evolve rapidly, and system latency remains one of their most critical performance parameters, particularly when vehicles are operated remotely.
Trivance: Latency-Optimal AllReduce by Shortcutting Multiport NetworksAnton Juerss, Vamsi Addanki, Stefan Schmid2026-02-19下载AllReduce is a fundamental collective operation in distributed computing and a key performance bottleneck for large-scale training and inference.
On the Value of Base Station Motion Knowledge for Goal-Oriented Remote Monitoring with Energy-Harvesting SensorsSehani Siriwardana, Jean Michel de Souza Sant'Ana, Richard Demo Souza, Abolfazl Zakeri, Onel Luis Alcaraz López2026-02-19下载This paper investigates goal-oriented remote monitoring of an unobservable Markov source using energy-harvesting sensors that communicate with a mobile receiver, such as a Low Earth Orbit (LEO) satell...
Hierarchical Edge-Cloud Task Offloading in NTN for Remote HealthcareAlejandro Flores, Danial Shafaie, Konstantinos Ntontin, Elli Kartsakli, Symeon Chatzinotas2026-02-19下载In this work, we study a hierarchical non-terrestrial network as an edge-cloud platform for remote computing of tasks generated by remote ad-hoc healthcare facility deployments, or internet of medical...
RIS Control through the Lens of Stochastic Network Calculus: An O-RAN Framework for Delay-Sensitive 6G ApplicationsOscar Adamuz-Hinojosa, Lanfranco Zanzi, Vincenzo Sciancalepore, Marco Di Renzo, Xavier Costa-Pérez2026-02-19下载Reconfigurable Intelligent Surfaces (RIS) enable dynamic electromagnetic control for 6G networks, but existing control schemes lack responsiveness to fast-varying network conditions, limiting their ap...
Robust and Extensible Measurement of Broadband Plans with BQT+Laasya Koduru, Sylee Beltiukov, Alexander Nguyen, Eugene Vuong, Jaber Daneshamooz, Tejas Narechania, Elizabeth Belding, Arpit Gupta2026-02-19下载Independent, street address-level broadband data is essential for evaluating Internet infrastructure investments, such as the $42B Broadband Equity, Access, and Deployment (BEAD) program.

cs.PF - Performance

标题作者发布日期PDF摘要
Collaborative Processing for Multi-Tenant Inference on Memory-Constrained Edge TPUsNathan Ng, Walid A. Hanafy, Prashanthi Kadambi, Balachandra Sunil, Ayush Gupta, David Irwin, Yogesh Simmhan, Prashant Shenoy2026-02-19下载IoT applications are increasingly relying on on-device AI accelerators to ensure high performance, especially in limited connectivity and safety-critical scenarios.
ARKV: Adaptive and Resource-Efficient KV Cache Management under Limited Memory Budget for Long-Context Inference in LLMsJianlong Lei, Shashikant Ilager2026-02-19下载Large Language Models (LLMs) are increasingly deployed in scenarios demanding ultra-long context reasoning, such as agentic workflows and deep research understanding.
Visual Insights into Agentic Optimization of Pervasive Stream Processing ServicesBoris Sedlak, Víctor Casamayor Pujol, Schahram Dustdar2026-02-19下载Processing sensory data close to the data source, often involving Edge devices, promises low latency for pervasive applications, like smart cities.

基于 VitePress 构建