Appearance
2026-01-08
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| PiC-BNN: A 128-kbit 65 nm Processing-in-CAM-Based End-to-End Binary Neural Network Accelerator | Yuval Harary, Almog Sharoni, Esteban Garzón, Marco Lanuzza, Adam Teman, Leonid Yavits | 2026-01-08 | 下载 | Binary Neural Networks (BNNs), where weights and activations are constrained to binary values (+1, -1), are a highly efficient alternative to traditional neural networks. |
| Supporting Secured Integration of Microarchitectural Defenses | Kartik Ramkrishnan, Stephen McCamant, Antonia Zhai, Pen-Chung Yew | 2026-01-08 | 下载 | There has been a plethora of microarchitectural-level attacks leading to many proposed countermeasures. This has created an unexpected and unaddressed security issue where naive integration of those d... |
| Challenges and Research Directions for Large Language Model Inference Hardware | Xiaoyu Ma, David Patterson | 2026-01-08 | 下载 | Large Language Model (LLM) inference is hard. The autoregressive Decode phase of the underlying Transformer model makes LLM inference fundamentally different from training. |
| MPM-LLM4DSE: Reaching the Pareto Frontier in HLS with Multimodal Learning and LLM-Driven Exploration | Lei Xu, Shanshan Wang, Chenglong Xiao | 2026-01-08 | 下载 | High-Level Synthesis (HLS) design space exploration (DSE) seeks Pareto-optimal designs within expansive pragma configuration spaces. To accelerate HLS DSE, graph neural networks (GNNs) are commonly em... |
| Memory-Guided Unified Hardware Accelerator for Mixed-Precision Scientific Computing | Chuanzhen Wang, Leo Zhang, Eric Liu | 2026-01-08 | 下载 | Recent hardware acceleration advances have enabled powerful specialized accelerators for finite element computations, spiking neural network inference, and sparse tensor operations. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Nalar: An agent serving framework | Marco Laju, Donghyun Son, Saurabh Agarwal, Nitin Kedia, Myungjin Lee, Jayanth Srinivasa, Aditya Akella | 2026-01-08 | 下载 | LLM-driven agentic applications increasingly automate complex, multi-step tasks, but serving them efficiently remains challenging due to heterogeneous components, dynamic and model-driven control flow... |
| Asynchronous Secure Federated Learning with Byzantine aggregators | Antonella Del Pozzo, Achille Desreumaux, Mathieu Gestin, Alexandre Rapetti, Sara Tucci-Piergiovanni | 2026-01-08 | 下载 | Privacy-preserving federated averaging is a central approach for protecting client privacy in federated learning. In this paper, we study this problem in an asynchronous communications setting with ma... |
| Parallel Quadratic Selected Inversion in Quantum Transport Simulation | Vincent Maillou, Matthias Bollhofer, Olaf Schenk, Alexandros Nikolaos Ziogas, Mathieu Luisier | 2026-01-08 | 下载 | Driven by Moore's Law, the dimensions of transistors have been pushed down to the nanometer scale. Advanced quantum transport (QT) solvers are required to accurately simulate such nano-devices. |
| Proof of Commitment: A Human-Centric Resource for Permissionless Consensus | Homayoun Maleki, Nekane Sainz, Jon Legarda | 2026-01-08 | 下载 | Permissionless consensus protocols require a scarce resource to regulate leader election and provide Sybil resistance. Existing paradigms such as Proof of Work and Proof of Stake instantiate this scar... |
| Cognitive Infrastructure: A Unified DCIM Framework for AI Data Centers | Krishna Chaitanya Sunkara | 2026-01-08 | 下载 | This work presents DCIM 3.0, a unified framework integrating semantic reasoning, predictive analytics, autonomous orchestration, and unified connectivity for next-generation AI data center management. |
| MoEBlaze: Breaking the Memory Wall for Efficient MoE Training on Modern GPUs | Jiyuan Zhang, Yining Liu, Siqi Yan, Lisen Deng, Jennifer Cao, Shuqi Yang, Min Ni, Bi Xue, Shen Li | 2026-01-08 | 下载 | The pervasive "memory wall" bottleneck is significantly amplified in modern large-scale Mixture-of-Experts (MoE) architectures. MoE's inherent architectural sparsity leads to sparse arithmetic compute... |
| MQ-GNN: A Multi-Queue Pipelined Architecture for Scalable and Efficient GNN Training | Irfan Ullah, Young-Koo Lee | 2026-01-08 | 下载 | Graph Neural Networks (GNNs) are powerful tools for learning graph-structured data, but their scalability is hindered by inefficient mini-batch generation, data transfer bottlenecks, and costly inter-... |
| Quantifying Autoscaler Vulnerabilities: An Empirical Study of Resource Misallocation Induced by Cloud Infrastructure Faults | Gijun Park | 2026-01-08 | 下载 | Resource autoscaling mechanisms in cloud environments depend on accurate performance metrics to make optimal provisioning decisions. When infrastructure faults including hardware malfunctions, network... |
| Mechanism Design for Federated Learning with Non-Monotonic Network Effects | Xiang Li, Bing Luo, Jianwei Huang, Yuan Luo | 2026-01-08 | 下载 | Mechanism design is pivotal to federated learning (FL) for maximizing social welfare by coordinating self-interested clients. Existing mechanisms, however, often overlook the network effects of client... |
| Timeliness-Oriented Scheduling and Resource Allocation in Multi-Region Collaborative Perception | Mengmeng Zhu, Yuxuan Sun, Yukuan Jia, Wei Chen, Bo Ai, Sheng Zhou | 2026-01-08 | 下载 | Collaborative perception (CP) is a critical technology in applications like autonomous driving and smart cities. It involves the sharing and fusion of information among sensors to overcome the limitat... |
| Sharded Elimination and Combining for Highly-Efficient Concurrent Stacks | Ajay Singh, Nikos Metaxakis, Panagiota Fatourou | 2026-01-08 | 下载 | We present a new blocking linearizable stack implementation which utilizes sharding and fetch&increment to achieve significantly better performance than all existing concurrent stacks. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A DQN-based model for intelligent network selection in heterogeneous wireless systems | Fayssal Bendaoud, Asma Amraoui, karim Sehimi | 2026-01-08 | 下载 | Wireless communications have been at the center of the revolution in technology for the last few years. The 5G communication system is the pinnacle of these technologies; however 4G LTE, WiFi, and eve... |
| 5G NR Non-Terrestrial Networks: From Early Results to the Road Ahead | Mattia Figaro, Francesco Rossato, Marco Giordani, Alessandro Traspadini, Takayuki Shimizu, Chinmay Mahabal, Sanjeewa Herath, Chunghan Lee, Dogan Kutay Pekcan, Michele Zorzi | 2026-01-08 | 下载 | This paper overviews the 3GPP 5G NR-NTN standard, detailing the evolution from Rel. 18 to 19 and innovations for Rel. 20. Using realistic ns-3 simulations validated against 3GPP calibration data, we e... |
| Intelligent resource allocation in wireless networks via deep reinforcement learning | Marie Diane Iradukunda, Chabi F. Elégbédé, Yaé Ulrich Gaba | 2026-01-08 | 下载 | This study addresses the challenge of optimal power allocation in stochastic wireless networks by employing a Deep Reinforcement Learning (DRL) framework. |
| A Mathematical Theory of Payment Channel Networks | Rene Pickhardt | 2026-01-08 | 下载 | We introduce a geometric theory of payment channel networks that centers the polytope of feasible wealth distributions; liquidity states project onto via strict circulations. |
| Cognitive Infrastructure: A Unified DCIM Framework for AI Data Centers | Krishna Chaitanya Sunkara | 2026-01-08 | 下载 | This work presents DCIM 3.0, a unified framework integrating semantic reasoning, predictive analytics, autonomous orchestration, and unified connectivity for next-generation AI data center management. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| DAVOS: An Autonomous Vehicle Operating System in the Vehicle Computing Era | Yuxin Wang, Yuankai He, Boyang Tian, Lichen Xian, Weisong Shi | 2026-01-08 | 下载 | Vehicle computing represents a fundamental shift in how autonomous vehicles are designed and deployed, transforming them from isolated transportation systems into mobile computing platforms that suppo... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| EARL: Energy-Aware Optimization of Liquid State Machines for Pervasive AI | Zain Iqbal, Lorenzo Valerio | 2026-01-08 | 下载 | Pervasive AI increasingly depends on on-device learning systems that deliver low-latency and energy-efficient computation under strict resource constraints. |
| Parallel Quadratic Selected Inversion in Quantum Transport Simulation | Vincent Maillou, Matthias Bollhofer, Olaf Schenk, Alexandros Nikolaos Ziogas, Mathieu Luisier | 2026-01-08 | 下载 | Driven by Moore's Law, the dimensions of transistors have been pushed down to the nanometer scale. Advanced quantum transport (QT) solvers are required to accurately simulate such nano-devices. |
| The Dark Side of Dark Mode -- User behaviour rebound effects and consequences for digital energy consumption | Zak Datson | 2026-01-08 | 下载 | User devices are the largest contributor to media related global emissions. For web content, dark mode has been widely recommended as an energy-saving measure for certain display types. |
| GPU-Accelerated INT8 Quantization for KV Cache Compression in Large Language Models | Maanas Taneja, Purab Shingvi | 2026-01-08 | 下载 | The key-value (KV) cache in large language models presents a significant memory bottleneck during inference, growing linearly with sequence length and often exceeding the memory footprint of model wei... |
| MQ-GNN: A Multi-Queue Pipelined Architecture for Scalable and Efficient GNN Training | Irfan Ullah, Young-Koo Lee | 2026-01-08 | 下载 | Graph Neural Networks (GNNs) are powerful tools for learning graph-structured data, but their scalability is hindered by inefficient mini-batch generation, data transfer bottlenecks, and costly inter-... |
| Personalized Model-Based Design of Human Centric AI enabled CPS for Long term usage | Bernard Ngabonziza, Ayan Banerjee, Sandeep K. S. Gupta | 2026-01-08 | 下载 | Human centric critical systems are increasingly involving artificial intelligence to enable knowledge extraction from sensor collected data. Examples include medical monitoring and control systems, ge... |