Skip to content

2025-12-23

cs.AR - Architecture

标题作者发布日期PDF摘要
NotSoTiny: A Large, Living Benchmark for RTL Code GenerationRazine Moundir Ghorab, Emanuele Parisi, Cristian Gutierrez, Miquel Alberti-Binimelis, Miquel Moreto, Dario Garcia-Gasulla, Gokcen Kestor2025-12-23下载LLMs have shown early promise in generating RTL code, yet evaluating their capabilities in realistic setups remains a challenge. So far, RTL benchmarks have been limited in scale, skewed toward trivia...
Composing Mini Oscilloscope on Embedded SystemsBrennan Romero, D. G. Perera2025-12-23下载In this paper, our goal is to reproduce the basic functionalities of a regular oscilloscope, using the Nuvoton NUC-140 embedded systems development platform as the front-end and display method.
Nebula: Enable City-Scale 3D Gaussian Splatting in Virtual Reality via Collaborative Rendering and Accelerated Stereo RasterizationHe Zhu, Zheng Liu, Xingyang Li, Anbang Wu, Jieru Zhao, Fangxin Liu, Yiming Gan, Jingwen Leng, Yu Feng2025-12-23下载3D Gaussian splatting (3DGS) has drawn significant attention in the architectural community recently. However, current architectural designs often overlook the 3DGS scalability, making them fragile fo...
Power Side-Channel Analysis of the CVA6 RISC-V Core at the RTL Level Using VeriSideBehnam Farnaghinejad, Antonio Porsia, Annachiara Ruospo, Alessandro Savino, Stefano Di Carlo, Ernesto Sanchez2025-12-23下载Security in modern RISC-V processors demands more than functional correctness: It requires resilience to side-channel attacks. This paper evaluates the vulnerability of the side channel of the CVA6 RI...
Designing Spatial Architectures for Sparse Attention: STAR Accelerator via Cross-Stage TilingHuizheng Wang, Taiquan Wei, Hongbin Wang, Zichuan Wang, Xinru Tang, Zhiheng Yue, Shaojun Wei, Yang Hu, Shouyi Yin2025-12-23下载Large language models (LLMs) rely on self-attention for contextual understanding, demanding high-throughput inference and large-scale token parallelism (LTPP).
3D Stack In-Sensor-Computing (3DS-ISC): Accelerating Time-Surface Construction for Neuromorphic Event CamerasHongyang Shang, Shuai Dong, Ye Ke, Arindam Basu2025-12-23下载This work proposes a 3D Stack In-Sensor-Computing (3DS-ISC) architecture for efficient event-based vision processing. A real-time normalization method using an exponential decay function is introduced...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
RHAPSODY: Execution of Hybrid AI-HPC Workflows at ScaleAymen Alsaadi, Mason Hooten, Mariya Goliyad, Andre Merzky, Andrew Shao, Mikhail Titov, Tianle Wang, Yian Chen, Maria Kalantzi, Kent Lee, Andrew Park, Indira Pimpalkhare, Nick Radcliffe, Colin Wahl, Pete Mendygral, Matteo Turilli, Shantenu Jha2025-12-23下载Hybrid AI-HPC workflows combine large-scale simulation, training, high-throughput inference, and tightly coupled, agent-driven control within a single execution campaign.
SoK: Speedy Secure FinalityYash Saraswat, Abhimanyu Nag2025-12-23下载While Ethereum has successfully achieved dynamic availability together with safety, a fundamental delay remains between transaction execution and immutable finality.
Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMsRui Pan, Zhuofu Chen, Hongyi Liu, Arvind Krishnamurthy, Ravi Netravali2025-12-23下载Diffusion Large Language Models (dLLMs) offer fast, parallel token generation, but their standalone use is plagued by an inherent efficiency-quality tradeoff.
WOC: Dual-Path Weighted Object Consensus Made EfficientTanisha Fonseca, Gengrui Zhang2025-12-23下载Modern distributed systems face a critical challenge: existing consensus protocols optimize for either node heterogeneity or workload independence, but not both.
Resilient Packet Forwarding: A Reinforcement Learning Approach to Routing in Gaussian Interconnected Networks with Clustered FaultsMohammad Walid Charrwi, Zaid Hussain2025-12-23下载As Network-on-Chip (NoC) and Wireless Sensor Network architectures continue to scale, the topology of the underlying network becomes a critical factor in performance.
Clust-PSI-PFL: A Population Stability Index Approach for Clustered Non-IID Personalized Federated LearningDaniel M. Jimenez-Gutierrez, Mehrdad Hassanzadeh, David Solans, Mohammed Elbamby, Nicolas Kourtellis, Aris Anagnostopoulos, Ioannis Chatzigiannakis, Andrea Vitaletti2025-12-23下载Federated learning (FL) supports privacy-preserving, decentralized machine learning (ML) model training by keeping data on client devices. However, non-independent and identically distributed (non-IID...
Predictive-LoRA: A Proactive and Fragmentation-Aware Serverless Inference System for LLMsYinan Ni, Xiao Yang, Yuqi Tang, Zhimin Qiu, Chen Wang, Tingzhou Yuan2025-12-23下载The serverless computing paradigm offers compelling advantages for deploying Large Language Model (LLM) inference services, including elastic scaling and pay-per-use billing.
Reaching Agreement Among Reasoning LLM AgentsChaoyi Ruan, Yiliang Wang, Ziji Shi, Jialin Li2025-12-23下载Multi-agent systems have extended the capability of agentic AI. Instead of single inference passes, multiple agents perform collective reasoning to derive high quality answers.
SHIRO: Near-Optimal Communication Strategies for Distributed Sparse Matrix MultiplicationChen Zhuang, Lingqi Zhang, Benjamin Brock, Du Wu, Peng Chen, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib2025-12-23下载Distributed Sparse Matrix-Matrix Multiplication (SpMM) is a fundamental operation in numerous high-performance computing and deep learning applications.
Population Protocols Revisited: Parity and BeyondLeszek Gąsieniec, Tytus Grodzicki, Tomasz Jurdziński, Jakub Kowalski, Grzegorz Stachowiak2025-12-23下载For nearly two decades, population protocols have been extensively studied, yielding efficient solutions for central problems in distributed computing, including leader election, and majority computat...
Scalable Cloud-Native Architectures for Intelligent PMU Data ProcessingNachiappan Chockalingam, Akshay Deshpande, Lokesh Butra, Ram Sekhar Bodala, Nitin Saksena, Adithya Parthasarathy, Balakrishna Pothineni, Akash Kumar Agarwal2025-12-23下载Phasor Measurement Units (PMUs) generate high-frequency, time-synchronized data essential for real-time power grid monitoring, yet the growing scale of PMU deployments creates significant challenges i...
FastMPS: Revisit Data Parallel in Large-scale Matrix Product State SamplingYaojian Chen, Si-Qiu Gong, Lin Gan, Yanfei Liu, An Yang, Yinuo Wang, Chao-yang Lu, Guangwen Yang2025-12-23下载Matrix Product State (MPS) is a versatile tensor network representation widely applied in quantum physics, quantum chemistry, and machine learning, etc.
Scaling Point-based Differentiable Rendering for Large-scale ReconstructionHexu Zhao, Xiaoteng Liu, Xiwen Min, Jianhao Huang, Youming Deng, Yanfei Li, Ang Li, Jinyang Li, Aurojit Panda2025-12-23下载Point-based Differentiable Rendering (PBDR) enables high-fidelity 3D scene reconstruction, but scaling PBDR to high-resolution and large scenes requires efficient distributed training systems.
Rethinking Knowledge Distillation in Collaborative Machine Learning: Memory, Knowledge, and Their InteractionsPengchao Han, Xi Huang, Yi Fang, Guojun Han2025-12-23下载Collaborative learning has emerged as a key paradigm in large-scale intelligent systems, enabling distributed agents to cooperatively train their models while addressing their privacy concerns.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
AI-Driven Green Cognitive Radio Networks for Sustainable 6G CommunicationAnshul Sharma, Shujaatali Badami, Biky Chouhan, Pushpanjali Pandey, Brijeena Rana, Navneet Kaur2025-12-23下载The 6G wireless aims at the Tb/s peak data rates are expected, a sub-millisecond latency, massive Internet of Things/vehicle connectivity, which requires sustainable access to audio over the air and e...
Towards a Security Plane for 6G EcosystemsXavi Masip-Bruin, Eva Rodríguez, Admela Jukan, Panos Trakadas2025-12-23下载6G networks promise to be the proper technology to support a wide deployment of highly demanding services, satisfying key users-related aspects such as extremely high quality, and persistent communica...
Evasion-Resilient Detection of DNS-over-HTTPS Data Exfiltration: A Practical Evaluation and ToolkitAdam Elaoumari2025-12-23下载The purpose of this project is to assess how well defenders can detect DNS-over-HTTPS (DoH) file exfiltration, and which evasion strategies can be used by attackers.
Base Station Deployment under EMF constrain by Deep Reinforcement learningMohammed Mallik, Guillaume Villemaud2025-12-23下载As 5G networks rapidly expand and 6G technologies emerge, characterized by dense deployments, millimeter-wave communications, and dynamic beamforming, the need for scalable simulation tools becomes in...
Graph-Symbolic Policy Enforcement and Control (G-SPEC): A Neuro-Symbolic Framework for Safe Agentic AI in 5G Autonomous NetworksDivya Vijay, Vignesh Ethiraj2025-12-23下载As networks evolve toward 5G Standalone and 6G, operators face orchestration challenges that exceed the limits of static automation and Deep Reinforcement Learning.
Post-Quantum Cryptography in the 5G CoreThomas Attema, Bor de Kock, Sandesh Manganahalli Jayaprakash, Dimitrios Schoinianakis, Thom Sijpesteijn, Rintse van de Vlasakker2025-12-23下载In this work, the conventional cryptographic algorithms used in the 5G Core are replaced with post-quantum alternatives and the practical impact of this transition is evaluated.
Edge-Served Congestion Control for Wireless Multipath Transmission with a Transformer AgentLiang Wang2025-12-23下载Multipath TCP is widely adopted to enhance connection quality-of-service by leveraging multiple network pathways on modern devices. However, the evolution of its core congestion control is hindered by...
CBA: Communication-Bound-Aware Cross-Domain Resource Assignment for Pipeline-Parallel Distributed LLM Training in Dynamic Multi-DC Optical NetworksDianxuan Fu, Xiaomin Liu, Yihao Zhang, Shikui Shen, Weisheng Hu, Qunbi Zhuge2025-12-23下载We propose a communication-bound-aware cross-domain resource assignment framework for pipeline-parallel distributed training over multi-datacenter optical networks, which lowers iteration time by 31.
VNF-Cache: An In-Network Key-Value Store Cache Based on Network Function VirtualizationBruno E. Farias, José Flauzino, Elias P. Duarte2025-12-23下载With the exponential growth of the amount of data available on the Internet, optimizing the response time and resource usage for data access becomes essential.
ReGAIN: Retrieval-Grounded AI Framework for Network Traffic AnalysisShaghayegh Shajarian, Kennedy Marsh, James Benson, Sajad Khorsandroo, Mahmoud Abdelsalam2025-12-23下载Modern networks generate vast, heterogeneous traffic that must be continuously analyzed for security and performance. Traditional network traffic analysis systems, whether rule-based or machine learni...

cs.PF - Performance

标题作者发布日期PDF摘要
Post-Quantum Cryptography in the 5G CoreThomas Attema, Bor de Kock, Sandesh Manganahalli Jayaprakash, Dimitrios Schoinianakis, Thom Sijpesteijn, Rintse van de Vlasakker2025-12-23下载In this work, the conventional cryptographic algorithms used in the 5G Core are replaced with post-quantum alternatives and the practical impact of this transition is evaluated.
SHIRO: Near-Optimal Communication Strategies for Distributed Sparse Matrix MultiplicationChen Zhuang, Lingqi Zhang, Benjamin Brock, Du Wu, Peng Chen, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib2025-12-23下载Distributed Sparse Matrix-Matrix Multiplication (SpMM) is a fundamental operation in numerous high-performance computing and deep learning applications.

基于 VitePress 构建