Skip to content

2025-12-13

cs.AR - Architecture

标题作者发布日期PDF摘要
V-Rex: Real-Time Streaming Video LLM Acceleration via Dynamic KV Cache RetrievalDonghyuk Kim, Sejeong Yang, Wonjin Shin, Joo-Young Kim2025-12-13下载Streaming video large language models (LLMs) are increasingly used for real-time multimodal tasks such as video captioning, question answering, conversational agents, and augmented reality.
A Cache-Aware Hybrid Sieve Combining Segmentation and Bit-Packing for Fast Prime GenerationKathi Lakshmi Mani Thirdhana2025-12-13下载Prime generation is a fundamental task in cryptography, number theory, and randomized algorithms. While the classical Sieve of Eratosthenes is simple and efficient in theory, its practical performance...
DreamRAM: A Fine-Grained Configurable Design Space Modeling Tool for Custom 3D Die-Stacked DRAMVictor Cai, Jennifer Zhou, Haebin Do, David Brooks, Gu-Yeon Wei2025-12-13下载3D die-stacked DRAM has emerged as a key technology for delivering high bandwidth and high density for applications such as high-performance computing, graphics, and machine learning.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
HetRL: Efficient Reinforcement Learning for LLMs in Heterogeneous EnvironmentsYongjun He, Shuai Zhang, Jiading Gai, Xiyuan Zhang, Boran Han, Bernie Wang, Huzefa Rangwala, George Karypis2025-12-13下载As large language models (LLMs) continue to scale and new GPUs are released even more frequently, there is an increasing demand for LLM post-training in heterogeneous environments to fully leverage un...
On Harnessing Idle Compute at the Edge for Foundation Model TrainingLeyang Xue, Meghana Madhyastha, Myungjin Lee, Amos Storkey, Randal Burns, Mahesh K. Marina2025-12-13下载The ecosystem behind foundation model development today is highly centralized and limited to large-scale cloud data center operators: training foundation models is costly, needing immense compute reso...
Reputation-Based Leader Election under Partial Synchrony: Towards a Protocol-Independent Abstraction with Enhanced GuaranteesXuyang Liu, Zijian Zhang, Zhen Li, Jiahang Sun, Jiamou Liu, Peng Jiang2025-12-13下载Leader election serves a well-defined role in leader-based Byzantine Fault Tolerant (BFT) protocols. Existing reputation-based leader election frameworks for partially synchronous BFTs suffer from eit...
Evaluating Asynchronous Semantics in Trace-Discovered Resilience Models: A Case Study on the OpenTelemetry DemoAnatoly A. Krasnovsky2025-12-13下载While distributed tracing and chaos engineering are becoming standard for microservices, resilience models remain largely manual and bespoke. We revisit a trace-discovered connectivity model that deri...
A Conflict-Aware Resource Management Framework for the Computing ContinuumVlad Popescu-Vifor, Ilir Murturi, Praveen Kumar Donta, Schahram Dustdar2025-12-13下载The increasing device heterogeneity and decentralization requirements in the computing continuum (i.e., spanning edge, fog, and cloud) introduce new challenges in resource orchestration.
Near-Zero-Overhead Freshness for Recommendation Systems via Inference-Side Model UpdatesWenjun Yu, Sitian Chen, Cheng Chen, Amelie Chi Zhou2025-12-13下载Deep Learning Recommendation Models (DLRMs) underpin personalized services but face a critical freshness-accuracy tradeoff due to massive parameter synchronization overheads.
Fast Online Digital Twinning on FPGA for Mission Critical ApplicationsBin Xu, Ayan Banerjee, Sandeep K. S. Gupta2025-12-13下载Digital twinning enables real-time simulation and predictive modeling by maintaining a continuously updated virtual representation of a physical system.
Accelerated Digital Twin Learning for Edge AI: A Comparison of FPGA and Mobile GPUBin Xu, Ayan Banerjee, Midhat Urooj, Sandeep K. S. Gupta2025-12-13下载Digital twins (DTs) can enable precision healthcare by continually learning a mathematical representation of patient-specific dynamics. However, mission critical healthcare applications require fast, ...
BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language ModelsZhengyang Wang, Ziyue Liu, Ruijie Zhang, Avinash Maurya, Paul Hovland, Bogdan Nicolae, Franck Cappello, Zheng Zhang2025-12-13下载The scale of transformer model pre-training is constrained by the increasing computation and communication cost. Low-rank bottleneck architectures offer a promising solution to significantly reduce th...
Beyond right or wrong: towards redefining adaptive learning indicators in virtual learning environmentsAndreia dos Santos Sachete, Alba Valeria de SantAnna de Freitas Loiola, Fabio Diniz Rossi, Jose Valdeni de Lima, Raquel Salcedo Gomes2025-12-13下载Student learning development must involve more than just correcting or incorrect questions. However, most adaptive learning methods in Virtual Learning Environments are based on whether the student's ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Agentic AI for 6G: A New Paradigm for Autonomous RAN Security ComplianceSotiris Chatzimiltis, Mahdi Boloursaz Mashhadi, Mohammad Shojafar, Merouane Debbah, Rahim Tafazolli2025-12-13下载Agentic AI systems are emerging as powerful tools for automating complex, multi-step tasks across various industries. One such industry is telecommunications, where the growing complexity of next-gene...
Joint Power and Mobility ControlYun Hou, Yening Zhang2025-12-13下载This study addressed the challenge of improving network connectivity in autonomous V2X networks by jointly optimizing transmission power and vehicle mobility.
Dynamic SLA-aware Network Slice MonitoringNiloy Saha, Mina Tahmasbi Arashloo, Nashid Shahriar, Raouf Boutaba2025-12-13下载Next-generation networks increasingly rely on network slices - logical networks tailored to specific application requirements, each with distinct Service-Level Agreements (SLAs).

cs.PF - Performance

标题作者发布日期PDF摘要
Evaluating Asynchronous Semantics in Trace-Discovered Resilience Models: A Case Study on the OpenTelemetry DemoAnatoly A. Krasnovsky2025-12-13下载While distributed tracing and chaos engineering are becoming standard for microservices, resilience models remain largely manual and bespoke. We revisit a trace-discovered connectivity model that deri...

基于 VitePress 构建