Skip to content

2026-01-12

cs.AR - Architecture

标题作者发布日期PDF摘要
GRPO with State Mutations: Improving LLM-Based Hardware Test Plan GenerationDimple Vijay Kochar, Nathaniel Pinckney, Guan-Ting Liu, Chia-Tung Ho, Chenhui Deng, Haoxing Ren, Brucek Khailany2026-01-12下载RTL design often relies heavily on ad-hoc testbench creation early in the design cycle. While large language models (LLMs) show promise for RTL code generation, their ability to reason about hardware ...
VLM-CAD: VLM-Optimized Collaborative Agent Design Workflow for Analog Circuit SizingGuanyuan Pan, Shuai Wang, Yugui Lin, Tiansheng Zhou, Pietro Liò, Zhenxin Zhao, Yaqi Wang2026-01-12下载Vision Language Models (VLMs) have demonstrated remarkable potential in multimodal reasoning, yet they inherently suffer from spatial blindness and logical hallucinations when interpreting densely str...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Bridging the Gap: Empowering Small Models in Reliable OpenACC-based Parallelization via GEPA-Optimized PromptingSamyak Jhaveri, Cristina V. Lopes2026-01-12下载OpenACC lowers the barrier to GPU offloading, but writing high-performing pragma remains complex, requiring deep domain expertise in memory hierarchies, data movement, and parallelization strategies.
Hierarchical Precision and Recursion for Accelerating Symmetric Linear Solves on MXUsVicki Carrica, Rabab Alomairy, Evelyne Ringoot, Alan Edelman2026-01-12下载Symmetric linear solves are fundamental to a wide range of scientific and engineering applications, from climate modeling and structural analysis to machine learning and optimization.
Where to Split? A Pareto-Front Analysis of DNN Partitioning for Edge InferenceAdiba Masud, Nicholas Foley, Pragathi Durga Rajarajan, Palden Lama2026-01-12下载The deployment of deep neural networks (DNNs) on resource-constrained edge devices is frequently hindered by their significant computational and memory requirements.
CRAFT: Cost-aware Expert Replica Allocation with Fine-Grained Layerwise EstimationsAdrian Zhao, Zhenkun Cai, Zhenyu Song, Lingfan Yu, Haozheng Fan, Jun Wu, Yida Wang, Nandita Vijaykumar2026-01-12下载Mixture-of-Experts (MoE) has recently emerged as the mainstream architecture for efficiently scaling large language models while maintaining near-constant computational cost.
D-PDLP: Scaling PDLP to Distributed Multi-GPU SystemsHongpei Li, Yicheng Huang, Huikang Liu, Dongdong Ge, Yinyu Ye2026-01-12下载We present a distributed framework of the Primal-Dual Hybrid Gradient (PDHG) algorithm for solving massive-scale linear programming (LP) problems.
Peformance Isolation for Inference Processes in Edge GPU SystemsJuan José Martín, José Flich, Carles Hernández2026-01-12下载This work analyzes the main isolation mechanisms available in modern NVIDIA GPUs: MPS, MIG, and the recent Green Contexts, to ensure predictable inference time in safety-critical applications using de...
Radio Labeling of Strong Prismatic Network With StarLiming Wang, Feng Li, Linlin Cui2026-01-12下载The rapid development of wireless communication has made efficient spectrum assignment a crucial factor in enhancing network performance. As a combinatorial optimization model for channel assignment, ...
MegaFlow: Large-Scale Distributed Orchestration System for the Agentic EraLei Zhang, Mouxiang Chen, Ruisheng Cao, Jiawei Chen, Fan Zhou, Yiheng Xu, Jiaxi Yang, Zeyao Ma, Liang Chen, Changwei Luo, Kai Zhang, Fan Yan, KaShun Shum, Jiajun Zhang, Zeyu Cui, Feng Hu, Junyang Lin, Binyuan Hui, Min Yang2026-01-12下载The rapid development of interactive and autonomous AI systems signals our entry into the agentic era. Training and evaluating agents on complex agentic tasks such as software engineering and computer...
Advanced computing for reproducibility of astronomy Big Data Science, with a showcase of AMIGA and the SKA Science prototypeJulián Garrido, Susana Sánchez, Edgar Ribeiro João, Roger Ianjamasimanana, Manuel Parra, Lourdes Verdes-Montenegro2026-01-12下载The Square Kilometre Array Observatory (SKAO) faces unprecedented technological challenges due to the vast scale and complexity of its data. This paper provides an overview of research by the AMIGA gr...
OpenTinker: Separating Concerns in Agentic Reinforcement LearningSiqi Zhu, Jiaxuan You2026-01-12下载We introduce OpenTinker, an infrastructure for reinforcement learning (RL) of large language model (LLM) agents built around a separation of concerns across algorithm design, execution, and agent-envi...
Bringing Computation to the data: Interoperable serverless function execution for astrophysical data analysis in the SRCNetManuel Parra-Royón, Julián Garrido-Sánchez, Susana Sánchez-Expósito, María Ángeles Mendoza, Rob Barnsley, Anthony Moraghan, Jesús Sánchez, Laura Darriba, Carlos Ruíz-Monje, Edgar Joao, Javier Moldón, Jesús Salgado, Lourdes Verdes-Montenegro2026-01-12下载Serverless computing is a paradigm in which the underlying infrastructure is fully managed by the provider, enabling applications and services to be executed with elastic resource provisioning and min...
SC-MII: Infrastructure LiDAR-based 3D Object Detection on Edge Devices for Split Computing with Multiple Intermediate Outputs IntegrationTaisuke Noguchi, Takayuki Nishio, Takuya Azumi2026-01-12下载3D object detection using LiDAR-based point cloud data and deep neural networks is essential in autonomous driving technology. However, deploying state-of-the-art models on edge devices present challe...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
TeeMAF: A TEE-Based Mutual Attestation Framework for On-Chain and Off-Chain Functions in Blockchain DAppsXiangyu Liu, Brian Lee, Yuansong Qiao2026-01-12下载The rapid development of Internet of Things (IoT) technology has led to growing concerns about data security and user privacy in the interactions within distributed systems.
A Protocol-Aware P4 Pipeline for MQTT Security and Anomaly Mitigation in Edge IoT SystemsBui Ngoc Thanh Binh, Pham Hoai Luan, Le Vu Trung Duong, Vu Tuan Hai, Yasuhiko Nakashima2026-01-12下载MQTT is the dominant lightweight publish--subscribe protocol for IoT deployments, yet edge security remains inadequate. Cloud-based intrusion detection systems add latency that is unsuitable for real-...
A Scalable Solution for Node Mobility Problems in NDN-Based Massive LEO ConstellationsMiguel Rodríguez-Pérez, Sergio Herrería-Alonso, J. Carlos Lopez-Ardao, Andrés Suárez-González2026-01-12下载In recent years, there has been increasing investment in the deployment of massive commercial Low Earth Orbit (LEO) constellations to provide global Internet connectivity.
Low-Altitude Satellite-AAV Collaborative Joint Mobile Edge Computing and Data Collection via Diffusion-based Deep Reinforcement LearningBoxiong Wang, Hui Kang, Jiahui Li, Geng Sun, Zemin Sun, Jiacheng Wang, Dusit Niyato, Shiwen Mao2026-01-12下载The integration of satellite and autonomous aerial vehicle (AAV) communications has become essential for the scenarios requiring both wide coverage and rapid deployment, particularly in remote or disa...
A Safety-Constrained Reinforcement Learning Framework for Reliable Wireless AutonomyAbdikarim Mohamed Ibrahim, Rosdiadee Nordin2026-01-12下载Artificial intelligence (AI) and reinforcement learning (RL) have shown significant promise in wireless systems, enabling dynamic spectrum allocation, traffic management, and large-scale Internet of T...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Peformance Isolation for Inference Processes in Edge GPU SystemsJuan José Martín, José Flich, Carles Hernández2026-01-12下载This work analyzes the main isolation mechanisms available in modern NVIDIA GPUs: MPS, MIG, and the recent Green Contexts, to ensure predictable inference time in safety-critical applications using de...

cs.PF - Performance

标题作者发布日期PDF摘要
Hierarchical Precision and Recursion for Accelerating Symmetric Linear Solves on MXUsVicki Carrica, Rabab Alomairy, Evelyne Ringoot, Alan Edelman2026-01-12下载Symmetric linear solves are fundamental to a wide range of scientific and engineering applications, from climate modeling and structural analysis to machine learning and optimization.

基于 VitePress 构建