Appearance
2025-12-13
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| V-Rex: Real-Time Streaming Video LLM Acceleration via Dynamic KV Cache Retrieval | Donghyuk Kim, Sejeong Yang, Wonjin Shin, Joo-Young Kim | 2025-12-13 | 下载 | Streaming video large language models (LLMs) are increasingly used for real-time multimodal tasks such as video captioning, question answering, conversational agents, and augmented reality. |
| A Cache-Aware Hybrid Sieve Combining Segmentation and Bit-Packing for Fast Prime Generation | Kathi Lakshmi Mani Thirdhana | 2025-12-13 | 下载 | Prime generation is a fundamental task in cryptography, number theory, and randomized algorithms. While the classical Sieve of Eratosthenes is simple and efficient in theory, its practical performance... |
| DreamRAM: A Fine-Grained Configurable Design Space Modeling Tool for Custom 3D Die-Stacked DRAM | Victor Cai, Jennifer Zhou, Haebin Do, David Brooks, Gu-Yeon Wei | 2025-12-13 | 下载 | 3D die-stacked DRAM has emerged as a key technology for delivering high bandwidth and high density for applications such as high-performance computing, graphics, and machine learning. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| HetRL: Efficient Reinforcement Learning for LLMs in Heterogeneous Environments | Yongjun He, Shuai Zhang, Jiading Gai, Xiyuan Zhang, Boran Han, Bernie Wang, Huzefa Rangwala, George Karypis | 2025-12-13 | 下载 | As large language models (LLMs) continue to scale and new GPUs are released even more frequently, there is an increasing demand for LLM post-training in heterogeneous environments to fully leverage un... |
| On Harnessing Idle Compute at the Edge for Foundation Model Training | Leyang Xue, Meghana Madhyastha, Myungjin Lee, Amos Storkey, Randal Burns, Mahesh K. Marina | 2025-12-13 | 下载 | The ecosystem behind foundation model development today is highly centralized and limited to large-scale cloud data center operators: training foundation models is costly, needing immense compute reso... |
| Reputation-Based Leader Election under Partial Synchrony: Towards a Protocol-Independent Abstraction with Enhanced Guarantees | Xuyang Liu, Zijian Zhang, Zhen Li, Jiahang Sun, Jiamou Liu, Peng Jiang | 2025-12-13 | 下载 | Leader election serves a well-defined role in leader-based Byzantine Fault Tolerant (BFT) protocols. Existing reputation-based leader election frameworks for partially synchronous BFTs suffer from eit... |
| Evaluating Asynchronous Semantics in Trace-Discovered Resilience Models: A Case Study on the OpenTelemetry Demo | Anatoly A. Krasnovsky | 2025-12-13 | 下载 | While distributed tracing and chaos engineering are becoming standard for microservices, resilience models remain largely manual and bespoke. We revisit a trace-discovered connectivity model that deri... |
| A Conflict-Aware Resource Management Framework for the Computing Continuum | Vlad Popescu-Vifor, Ilir Murturi, Praveen Kumar Donta, Schahram Dustdar | 2025-12-13 | 下载 | The increasing device heterogeneity and decentralization requirements in the computing continuum (i.e., spanning edge, fog, and cloud) introduce new challenges in resource orchestration. |
| Near-Zero-Overhead Freshness for Recommendation Systems via Inference-Side Model Updates | Wenjun Yu, Sitian Chen, Cheng Chen, Amelie Chi Zhou | 2025-12-13 | 下载 | Deep Learning Recommendation Models (DLRMs) underpin personalized services but face a critical freshness-accuracy tradeoff due to massive parameter synchronization overheads. |
| Fast Online Digital Twinning on FPGA for Mission Critical Applications | Bin Xu, Ayan Banerjee, Sandeep K. S. Gupta | 2025-12-13 | 下载 | Digital twinning enables real-time simulation and predictive modeling by maintaining a continuously updated virtual representation of a physical system. |
| Accelerated Digital Twin Learning for Edge AI: A Comparison of FPGA and Mobile GPU | Bin Xu, Ayan Banerjee, Midhat Urooj, Sandeep K. S. Gupta | 2025-12-13 | 下载 | Digital twins (DTs) can enable precision healthcare by continually learning a mathematical representation of patient-specific dynamics. However, mission critical healthcare applications require fast, ... |
| BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models | Zhengyang Wang, Ziyue Liu, Ruijie Zhang, Avinash Maurya, Paul Hovland, Bogdan Nicolae, Franck Cappello, Zheng Zhang | 2025-12-13 | 下载 | The scale of transformer model pre-training is constrained by the increasing computation and communication cost. Low-rank bottleneck architectures offer a promising solution to significantly reduce th... |
| Beyond right or wrong: towards redefining adaptive learning indicators in virtual learning environments | Andreia dos Santos Sachete, Alba Valeria de SantAnna de Freitas Loiola, Fabio Diniz Rossi, Jose Valdeni de Lima, Raquel Salcedo Gomes | 2025-12-13 | 下载 | Student learning development must involve more than just correcting or incorrect questions. However, most adaptive learning methods in Virtual Learning Environments are based on whether the student's ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Agentic AI for 6G: A New Paradigm for Autonomous RAN Security Compliance | Sotiris Chatzimiltis, Mahdi Boloursaz Mashhadi, Mohammad Shojafar, Merouane Debbah, Rahim Tafazolli | 2025-12-13 | 下载 | Agentic AI systems are emerging as powerful tools for automating complex, multi-step tasks across various industries. One such industry is telecommunications, where the growing complexity of next-gene... |
| Joint Power and Mobility Control | Yun Hou, Yening Zhang | 2025-12-13 | 下载 | This study addressed the challenge of improving network connectivity in autonomous V2X networks by jointly optimizing transmission power and vehicle mobility. |
| Dynamic SLA-aware Network Slice Monitoring | Niloy Saha, Mina Tahmasbi Arashloo, Nashid Shahriar, Raouf Boutaba | 2025-12-13 | 下载 | Next-generation networks increasingly rely on network slices - logical networks tailored to specific application requirements, each with distinct Service-Level Agreements (SLAs). |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Evaluating Asynchronous Semantics in Trace-Discovered Resilience Models: A Case Study on the OpenTelemetry Demo | Anatoly A. Krasnovsky | 2025-12-13 | 下载 | While distributed tracing and chaos engineering are becoming standard for microservices, resilience models remain largely manual and bespoke. We revisit a trace-discovered connectivity model that deri... |