Skip to content

2026-01-08

cs.AR - Architecture

标题作者发布日期PDF摘要
PiC-BNN: A 128-kbit 65 nm Processing-in-CAM-Based End-to-End Binary Neural Network AcceleratorYuval Harary, Almog Sharoni, Esteban Garzón, Marco Lanuzza, Adam Teman, Leonid Yavits2026-01-08下载Binary Neural Networks (BNNs), where weights and activations are constrained to binary values (+1, -1), are a highly efficient alternative to traditional neural networks.
Supporting Secured Integration of Microarchitectural DefensesKartik Ramkrishnan, Stephen McCamant, Antonia Zhai, Pen-Chung Yew2026-01-08下载There has been a plethora of microarchitectural-level attacks leading to many proposed countermeasures. This has created an unexpected and unaddressed security issue where naive integration of those d...
Challenges and Research Directions for Large Language Model Inference HardwareXiaoyu Ma, David Patterson2026-01-08下载Large Language Model (LLM) inference is hard. The autoregressive Decode phase of the underlying Transformer model makes LLM inference fundamentally different from training.
MPM-LLM4DSE: Reaching the Pareto Frontier in HLS with Multimodal Learning and LLM-Driven ExplorationLei Xu, Shanshan Wang, Chenglong Xiao2026-01-08下载High-Level Synthesis (HLS) design space exploration (DSE) seeks Pareto-optimal designs within expansive pragma configuration spaces. To accelerate HLS DSE, graph neural networks (GNNs) are commonly em...
Memory-Guided Unified Hardware Accelerator for Mixed-Precision Scientific ComputingChuanzhen Wang, Leo Zhang, Eric Liu2026-01-08下载Recent hardware acceleration advances have enabled powerful specialized accelerators for finite element computations, spiking neural network inference, and sparse tensor operations.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Nalar: An agent serving frameworkMarco Laju, Donghyun Son, Saurabh Agarwal, Nitin Kedia, Myungjin Lee, Jayanth Srinivasa, Aditya Akella2026-01-08下载LLM-driven agentic applications increasingly automate complex, multi-step tasks, but serving them efficiently remains challenging due to heterogeneous components, dynamic and model-driven control flow...
Asynchronous Secure Federated Learning with Byzantine aggregatorsAntonella Del Pozzo, Achille Desreumaux, Mathieu Gestin, Alexandre Rapetti, Sara Tucci-Piergiovanni2026-01-08下载Privacy-preserving federated averaging is a central approach for protecting client privacy in federated learning. In this paper, we study this problem in an asynchronous communications setting with ma...
Parallel Quadratic Selected Inversion in Quantum Transport SimulationVincent Maillou, Matthias Bollhofer, Olaf Schenk, Alexandros Nikolaos Ziogas, Mathieu Luisier2026-01-08下载Driven by Moore's Law, the dimensions of transistors have been pushed down to the nanometer scale. Advanced quantum transport (QT) solvers are required to accurately simulate such nano-devices.
Proof of Commitment: A Human-Centric Resource for Permissionless ConsensusHomayoun Maleki, Nekane Sainz, Jon Legarda2026-01-08下载Permissionless consensus protocols require a scarce resource to regulate leader election and provide Sybil resistance. Existing paradigms such as Proof of Work and Proof of Stake instantiate this scar...
Cognitive Infrastructure: A Unified DCIM Framework for AI Data CentersKrishna Chaitanya Sunkara2026-01-08下载This work presents DCIM 3.0, a unified framework integrating semantic reasoning, predictive analytics, autonomous orchestration, and unified connectivity for next-generation AI data center management.
MoEBlaze: Breaking the Memory Wall for Efficient MoE Training on Modern GPUsJiyuan Zhang, Yining Liu, Siqi Yan, Lisen Deng, Jennifer Cao, Shuqi Yang, Min Ni, Bi Xue, Shen Li2026-01-08下载The pervasive "memory wall" bottleneck is significantly amplified in modern large-scale Mixture-of-Experts (MoE) architectures. MoE's inherent architectural sparsity leads to sparse arithmetic compute...
MQ-GNN: A Multi-Queue Pipelined Architecture for Scalable and Efficient GNN TrainingIrfan Ullah, Young-Koo Lee2026-01-08下载Graph Neural Networks (GNNs) are powerful tools for learning graph-structured data, but their scalability is hindered by inefficient mini-batch generation, data transfer bottlenecks, and costly inter-...
Quantifying Autoscaler Vulnerabilities: An Empirical Study of Resource Misallocation Induced by Cloud Infrastructure FaultsGijun Park2026-01-08下载Resource autoscaling mechanisms in cloud environments depend on accurate performance metrics to make optimal provisioning decisions. When infrastructure faults including hardware malfunctions, network...
Mechanism Design for Federated Learning with Non-Monotonic Network EffectsXiang Li, Bing Luo, Jianwei Huang, Yuan Luo2026-01-08下载Mechanism design is pivotal to federated learning (FL) for maximizing social welfare by coordinating self-interested clients. Existing mechanisms, however, often overlook the network effects of client...
Timeliness-Oriented Scheduling and Resource Allocation in Multi-Region Collaborative PerceptionMengmeng Zhu, Yuxuan Sun, Yukuan Jia, Wei Chen, Bo Ai, Sheng Zhou2026-01-08下载Collaborative perception (CP) is a critical technology in applications like autonomous driving and smart cities. It involves the sharing and fusion of information among sensors to overcome the limitat...
Sharded Elimination and Combining for Highly-Efficient Concurrent StacksAjay Singh, Nikos Metaxakis, Panagiota Fatourou2026-01-08下载We present a new blocking linearizable stack implementation which utilizes sharding and fetch&increment to achieve significantly better performance than all existing concurrent stacks.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A DQN-based model for intelligent network selection in heterogeneous wireless systemsFayssal Bendaoud, Asma Amraoui, karim Sehimi2026-01-08下载Wireless communications have been at the center of the revolution in technology for the last few years. The 5G communication system is the pinnacle of these technologies; however 4G LTE, WiFi, and eve...
5G NR Non-Terrestrial Networks: From Early Results to the Road AheadMattia Figaro, Francesco Rossato, Marco Giordani, Alessandro Traspadini, Takayuki Shimizu, Chinmay Mahabal, Sanjeewa Herath, Chunghan Lee, Dogan Kutay Pekcan, Michele Zorzi2026-01-08下载This paper overviews the 3GPP 5G NR-NTN standard, detailing the evolution from Rel. 18 to 19 and innovations for Rel. 20. Using realistic ns-3 simulations validated against 3GPP calibration data, we e...
Intelligent resource allocation in wireless networks via deep reinforcement learningMarie Diane Iradukunda, Chabi F. Elégbédé, Yaé Ulrich Gaba2026-01-08下载This study addresses the challenge of optimal power allocation in stochastic wireless networks by employing a Deep Reinforcement Learning (DRL) framework.
A Mathematical Theory of Payment Channel NetworksRene Pickhardt2026-01-08下载We introduce a geometric theory of payment channel networks that centers the polytope WGW_G of feasible wealth distributions; liquidity states LGL_G project onto WGW_G via strict circulations.
Cognitive Infrastructure: A Unified DCIM Framework for AI Data CentersKrishna Chaitanya Sunkara2026-01-08下载This work presents DCIM 3.0, a unified framework integrating semantic reasoning, predictive analytics, autonomous orchestration, and unified connectivity for next-generation AI data center management.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
DAVOS: An Autonomous Vehicle Operating System in the Vehicle Computing EraYuxin Wang, Yuankai He, Boyang Tian, Lichen Xian, Weisong Shi2026-01-08下载Vehicle computing represents a fundamental shift in how autonomous vehicles are designed and deployed, transforming them from isolated transportation systems into mobile computing platforms that suppo...

cs.PF - Performance

标题作者发布日期PDF摘要
EARL: Energy-Aware Optimization of Liquid State Machines for Pervasive AIZain Iqbal, Lorenzo Valerio2026-01-08下载Pervasive AI increasingly depends on on-device learning systems that deliver low-latency and energy-efficient computation under strict resource constraints.
Parallel Quadratic Selected Inversion in Quantum Transport SimulationVincent Maillou, Matthias Bollhofer, Olaf Schenk, Alexandros Nikolaos Ziogas, Mathieu Luisier2026-01-08下载Driven by Moore's Law, the dimensions of transistors have been pushed down to the nanometer scale. Advanced quantum transport (QT) solvers are required to accurately simulate such nano-devices.
The Dark Side of Dark Mode -- User behaviour rebound effects and consequences for digital energy consumptionZak Datson2026-01-08下载User devices are the largest contributor to media related global emissions. For web content, dark mode has been widely recommended as an energy-saving measure for certain display types.
GPU-Accelerated INT8 Quantization for KV Cache Compression in Large Language ModelsMaanas Taneja, Purab Shingvi2026-01-08下载The key-value (KV) cache in large language models presents a significant memory bottleneck during inference, growing linearly with sequence length and often exceeding the memory footprint of model wei...
MQ-GNN: A Multi-Queue Pipelined Architecture for Scalable and Efficient GNN TrainingIrfan Ullah, Young-Koo Lee2026-01-08下载Graph Neural Networks (GNNs) are powerful tools for learning graph-structured data, but their scalability is hindered by inefficient mini-batch generation, data transfer bottlenecks, and costly inter-...
Personalized Model-Based Design of Human Centric AI enabled CPS for Long term usageBernard Ngabonziza, Ayan Banerjee, Sandeep K. S. Gupta2026-01-08下载Human centric critical systems are increasingly involving artificial intelligence to enable knowledge extraction from sensor collected data. Examples include medical monitoring and control systems, ge...

基于 VitePress 构建