2025-12-13

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
V-Rex: Real-Time Streaming Video LLM Acceleration via Dynamic KV Cache Retrieval	Donghyuk Kim, Sejeong Yang, Wonjin Shin, Joo-Young Kim	2025-12-13	下载	Streaming video large language models (LLMs) are increasingly used for real-time multimodal tasks such as video captioning, question answering, conversational agents, and augmented reality.
A Cache-Aware Hybrid Sieve Combining Segmentation and Bit-Packing for Fast Prime Generation	Kathi Lakshmi Mani Thirdhana	2025-12-13	下载	Prime generation is a fundamental task in cryptography, number theory, and randomized algorithms. While the classical Sieve of Eratosthenes is simple and efficient in theory, its practical performance...
DreamRAM: A Fine-Grained Configurable Design Space Modeling Tool for Custom 3D Die-Stacked DRAM	Victor Cai, Jennifer Zhou, Haebin Do, David Brooks, Gu-Yeon Wei	2025-12-13	下载	3D die-stacked DRAM has emerged as a key technology for delivering high bandwidth and high density for applications such as high-performance computing, graphics, and machine learning.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
HetRL: Efficient Reinforcement Learning for LLMs in Heterogeneous Environments	Yongjun He, Shuai Zhang, Jiading Gai, Xiyuan Zhang, Boran Han, Bernie Wang, Huzefa Rangwala, George Karypis	2025-12-13	下载	As large language models (LLMs) continue to scale and new GPUs are released even more frequently, there is an increasing demand for LLM post-training in heterogeneous environments to fully leverage un...
On Harnessing Idle Compute at the Edge for Foundation Model Training	Leyang Xue, Meghana Madhyastha, Myungjin Lee, Amos Storkey, Randal Burns, Mahesh K. Marina	2025-12-13	下载	The ecosystem behind foundation model development today is highly centralized and limited to large-scale cloud data center operators: training foundation models is costly, needing immense compute reso...
Reputation-Based Leader Election under Partial Synchrony: Towards a Protocol-Independent Abstraction with Enhanced Guarantees	Xuyang Liu, Zijian Zhang, Zhen Li, Jiahang Sun, Jiamou Liu, Peng Jiang	2025-12-13	下载	Leader election serves a well-defined role in leader-based Byzantine Fault Tolerant (BFT) protocols. Existing reputation-based leader election frameworks for partially synchronous BFTs suffer from eit...
Evaluating Asynchronous Semantics in Trace-Discovered Resilience Models: A Case Study on the OpenTelemetry Demo	Anatoly A. Krasnovsky	2025-12-13	下载	While distributed tracing and chaos engineering are becoming standard for microservices, resilience models remain largely manual and bespoke. We revisit a trace-discovered connectivity model that deri...
A Conflict-Aware Resource Management Framework for the Computing Continuum	Vlad Popescu-Vifor, Ilir Murturi, Praveen Kumar Donta, Schahram Dustdar	2025-12-13	下载	The increasing device heterogeneity and decentralization requirements in the computing continuum (i.e., spanning edge, fog, and cloud) introduce new challenges in resource orchestration.
Near-Zero-Overhead Freshness for Recommendation Systems via Inference-Side Model Updates	Wenjun Yu, Sitian Chen, Cheng Chen, Amelie Chi Zhou	2025-12-13	下载	Deep Learning Recommendation Models (DLRMs) underpin personalized services but face a critical freshness-accuracy tradeoff due to massive parameter synchronization overheads.
Fast Online Digital Twinning on FPGA for Mission Critical Applications	Bin Xu, Ayan Banerjee, Sandeep K. S. Gupta	2025-12-13	下载	Digital twinning enables real-time simulation and predictive modeling by maintaining a continuously updated virtual representation of a physical system.
Accelerated Digital Twin Learning for Edge AI: A Comparison of FPGA and Mobile GPU	Bin Xu, Ayan Banerjee, Midhat Urooj, Sandeep K. S. Gupta	2025-12-13	下载	Digital twins (DTs) can enable precision healthcare by continually learning a mathematical representation of patient-specific dynamics. However, mission critical healthcare applications require fast, ...
BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models	Zhengyang Wang, Ziyue Liu, Ruijie Zhang, Avinash Maurya, Paul Hovland, Bogdan Nicolae, Franck Cappello, Zheng Zhang	2025-12-13	下载	The scale of transformer model pre-training is constrained by the increasing computation and communication cost. Low-rank bottleneck architectures offer a promising solution to significantly reduce th...
Beyond right or wrong: towards redefining adaptive learning indicators in virtual learning environments	Andreia dos Santos Sachete, Alba Valeria de SantAnna de Freitas Loiola, Fabio Diniz Rossi, Jose Valdeni de Lima, Raquel Salcedo Gomes	2025-12-13	下载	Student learning development must involve more than just correcting or incorrect questions. However, most adaptive learning methods in Virtual Learning Environments are based on whether the student's ...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Agentic AI for 6G: A New Paradigm for Autonomous RAN Security Compliance	Sotiris Chatzimiltis, Mahdi Boloursaz Mashhadi, Mohammad Shojafar, Merouane Debbah, Rahim Tafazolli	2025-12-13	下载	Agentic AI systems are emerging as powerful tools for automating complex, multi-step tasks across various industries. One such industry is telecommunications, where the growing complexity of next-gene...
Joint Power and Mobility Control	Yun Hou, Yening Zhang	2025-12-13	下载	This study addressed the challenge of improving network connectivity in autonomous V2X networks by jointly optimizing transmission power and vehicle mobility.
Dynamic SLA-aware Network Slice Monitoring	Niloy Saha, Mina Tahmasbi Arashloo, Nashid Shahriar, Raouf Boutaba	2025-12-13	下载	Next-generation networks increasingly rely on network slices - logical networks tailored to specific application requirements, each with distinct Service-Level Agreements (SLAs).

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Evaluating Asynchronous Semantics in Trace-Discovered Resilience Models: A Case Study on the OpenTelemetry Demo	Anatoly A. Krasnovsky	2025-12-13	下载	While distributed tracing and chaos engineering are becoming standard for microservices, resilience models remain largely manual and bespoke. We revisit a trace-discovered connectivity model that deri...