Appearance
2026-01-18
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CD-PIM: A High-Bandwidth and Compute-Efficient LPDDR5-Based PIM for Low-Batch LLM Acceleration on Edge-Device | Ye Lin, Chao Fang, Xiaoyong Song, Qi Wu, Anying Jiang, Yichuan Bai, Li Du | 2026-01-18 | 下载 | Edge deployment of low-batch large language models (LLMs) faces critical memory bandwidth bottlenecks when executing memory-intensive general matrix-vector multiplications (GEMV) operations. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Asynchronous MultiAgent Reinforcement Learning for 5G Routing under Side Constraints | Sebastian Racedo, Brigitte Jaumard, Oscar Delgado, Meysam Masoudi | 2026-01-18 | 下载 | Networks in the current 5G and beyond systems increasingly carry heterogeneous traffic with diverse quality-of-service constraints, making real-time routing decisions both complex and time-critical. |
| SGCP: A Self-Organized Game-Theoretic Framework For Collaborative Perception | Zechuan Gong, Hui Zhang, Yuquan Yang, Wenyu Lu | 2026-01-18 | 下载 | Collaborative perception holds great promise for improving safety in autonomous driving, particularly in dense traffic where vehicles can share sensory information to overcome individual blind spots a... |
| ASAS-BridgeAMM: Trust-Minimized Cross-Chain Bridge AMM with Failure Containment | Shengwei You, Aditya Joshi, Andrey Kuehlkamp, Jarek Nabrzyski | 2026-01-18 | 下载 | Cross-chain bridges constitute the single largest vector of systemic risk in Decentralized Finance (DeFi), accounting for over $2.8 billion in losses since 2021. |
| RIPPLE++: An Incremental Framework for Efficient GNN Inference on Evolving Graphs | Pranjal Naman, Parv Agarwal, Hrishikesh Haritas, Yogesh Simmhan | 2026-01-18 | 下载 | Real-world graphs are dynamic, with frequent updates to their structure and features due to evolving vertex and edge properties. These continual changes pose significant challenges for efficient infer... |
| Opportunistic Scheduling for Optimal Spot Instance Savings in the Cloud | Neelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman | 2026-01-18 | 下载 | We study the problem of scheduling delay-sensitive jobs over spot and on-demand cloud instances to minimize average cost while meeting an average delay constraint. |
| Spark-LLM-Eval: A Distributed Framework for Statistically Rigorous Large Language Model Evaluation | Subhadip Mitra | 2026-01-18 | 下载 | Evaluating large language models at scale remains a practical bottleneck for many organizations. While existing evaluation frameworks work well for thousands of examples, they struggle when datasets g... |
| Power Aware Dynamic Reallocation For Inference | Yiwei Jiang, Sangeeta Chowdhary, Nathaniel Morris, Rutwik Jain, Srilatha Manne, Sam Bayliss | 2026-01-18 | 下载 | Disaggregation has emerged as a powerful strategy for optimizing large language model (LLM) inference by separating compute-intensive prefill and memory-bound decode phases across specialized GPUs. |
| Canonicalization of Batched Einstein Summations for Tuning Retrieval | Kaushik Kulkarni, Andreas Klöckner | 2026-01-18 | 下载 | We present an algorithm for normalizing \emph{Batched Einstein Summation} expressions by mapping mathematically equivalent formulations to a unique normal form. |
| DaggerFFT: A Distributed FFT Framework Using Task Scheduling in Julia | Sana Taghipour Anvari, Julian Samaroo, Matin Raayai Ardakani, David Kaeli | 2026-01-18 | 下载 | The Fast Fourier Transform (FFT) is a fundamental numerical technique with widespread application in a range of scientific problems. As scientific simulations attempt to exploit exascale systems, ther... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Asynchronous MultiAgent Reinforcement Learning for 5G Routing under Side Constraints | Sebastian Racedo, Brigitte Jaumard, Oscar Delgado, Meysam Masoudi | 2026-01-18 | 下载 | Networks in the current 5G and beyond systems increasingly carry heterogeneous traffic with diverse quality-of-service constraints, making real-time routing decisions both complex and time-critical. |
| LiQSS: Post-Transformer Linear Quantum-Inspired State-Space Tensor Networks for Real-Time 6G | Farhad Rezazadeh, Hatim Chergui, Mehdi Bennis, Houbing Song, Lingjia Liu, Dusit Niyato, Merouane Debbah | 2026-01-18 | 下载 | Proactive and agentic control in Sixth-Generation (6G) Open Radio Access Networks (O-RAN) requires control-grade prediction under stringent Near-Real-Time (Near-RT) latency and computational constrain... |
| Cross-reality Location Privacy Protection in 6G-enabled Vehicular Metaverses: An LLM-enhanced Hybrid Generative Diffusion Model-based Approach | Xiaofeng Luo, Jiayi He, Jiawen Kang, Ruichen Zhang, Zhaoshui He, Ekram Hossain, Dong In Kim | 2026-01-18 | 下载 | The emergence of 6G-enabled vehicular metaverses enables Autonomous Vehicles (AVs) to operate across physical and virtual spaces through space-air-ground-sea integrated networks. |
| Opportunistic Scheduling for Optimal Spot Instance Savings in the Cloud | Neelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman | 2026-01-18 | 下载 | We study the problem of scheduling delay-sensitive jobs over spot and on-demand cloud instances to minimize average cost while meeting an average delay constraint. |
| Optimal Power Allocation and Sub-Optimal Channel Assignment for Downlink NOMA Systems Using Deep Reinforcement Learning | WooSeok Kim, Jeonghoon Lee, Sangho Kim, Taesun An, WonMin Lee, Dowon Kim, Kyungseop Shin | 2026-01-18 | 下载 | In recent years, Non-Orthogonal Multiple Access (NOMA) system has emerged as a promising candidate for multiple access frameworks due to the evolution of deep machine learning, trying to incorporate d... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Opportunistic Scheduling for Optimal Spot Instance Savings in the Cloud | Neelkamal Bhuyan, Randeep Bhatia, Murali Kodialam, TV Lakshman | 2026-01-18 | 下载 | We study the problem of scheduling delay-sensitive jobs over spot and on-demand cloud instances to minimize average cost while meeting an average delay constraint. |
| DDSA: Dual-Domain Strategic Attack for Spatial-Temporal Efficiency in Adversarial Robustness Testing | Jinwei Hu, Shiyuan Meng, Yi Dong, Xiaowei Huang | 2026-01-18 | 下载 | Image transmission and processing systems in resource-critical applications face significant challenges from adversarial perturbations that compromise mission-specific object classification. |