2025-08-25

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
OmniSim: Simulating Hardware with C Speed and RTL Accuracy for High-Level Synthesis Designs	Rishov Sarkar, Cong Hao	2025-08-25	下载	High-Level Synthesis (HLS) is increasingly popular for hardware design using C/C++ instead of Register-Transfer Level (RTL). To express concurrent hardware behavior in a sequential language like C/C++...
Views: a hardware-friendly graph database model for storing semantic information	Yanjun Yang, Adrian Wheeldon, Yihan Pan, Themis Prodromakis, Alex Serb	2025-08-25	下载	The graph database (GDB) is an increasingly common storage model for data involving relationships between entries. Beyond its widespread usage in database industries, the advantages of GDBs indicate a...
Anatomy of the gem5 Simulator: AtomicSimpleCPU, TimingSimpleCPU, O3CPU, and Their Interaction with the Ruby Memory System	Johan Söderström, Yuan Yao	2025-08-25	下载	gem5 is a popular modular-based computer system simulator, widely used in computer architecture research and known for its long simulation time and steep learning curve.
Opportunities and Challenges for 3D Systems and Their Design	Philip Emma, Eren Kurshan	2025-08-25	下载	Although it is not a new concept, 3D integration increasingly receives widespread interest and focus as lithographic scaling becomes more challenging, and as the ability to make miniature vias greatly...
LLMulator: Generalizable Cost Modeling for Dataflow Accelerators with Input-Adaptive Control Flow	Kaiyan Chang, Wenlong Zhu, Shengwen Liang, Huawei Li, Ying Wang	2025-08-25	下载	Accurate and fast performance prediction for dataflow-based accelerators is vital for efficient hardware design and design space exploration, yet existing methods struggle to generalize across archite...
In-Memory Computing Enabled Deep MIMO Detection to Support Ultra-Low-Latency Communications	Tingyu Ding, Qunsong Zeng, Kaibin Huang	2025-08-25	下载	The development of sixth-generation (6G) mobile networks imposes unprecedented latency and reliability demands on multiple-input multiple-output (MIMO) communication systems, a key enabler of high-spe...
TLGLock: A New Approach in Logic Locking Using Key-Driven Charge Recycling in Threshold Logic Gates	Abdullah Sahruri, Martin Margala	2025-08-25	下载	Logic locking remains one of the most promising defenses against hardware piracy, yet current approaches often face challenges in scalability and design overhead.
Structural Mutation Based Differential Testing for FPGA Logic Synthesis Compilers	Zhihao Xu, Shikai Guo, Guilin Zhao, Siwen Wang, Qian Ma, Hui Li, Furui Zhan	2025-08-25	下载	Field Programmable Gate Arrays (FPGAs) play a crucial role in Electronic Design Automation (EDA) applications, which have been widely used in safety-critical environments, including aerospace, chip ma...
Characterizing the Behavior of Training Mamba-based State Space Models on GPUs	Trinayan Baruah, Kaustubh Shivdikar, Sara Prescott, David Kaeli	2025-08-25	下载	Mamba-based State Space Models (SSM) have emerged as a promising alternative to the ubiquitous transformers. Despite the expressive power of transformers, the quadratic complexity of computing attenti...
A 28nm 1.80Mb/mm2 Digital/Analog Hybrid SRAM-CIM Macro Using 2D-Weighted Capacitor Array for Complex Number Mac Operations	Shota Konno, Che-Kai Liu, Sigang Ryu, Samuel Spetalnick, Arijit Raychowdhury	2025-08-25	下载	A 28nm dense 6T-SRAM Digital(D)/Analog(A) Hybrid compute-in-memory (CIM) macro supporting complex num-ber MAC operation is presented. By introducing a 2D-weighted Capacitor Array, a hybrid configurati...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Managing Multi Instance GPUs for High Throughput and Energy Savings	Abhijeet Saraha, Yuanbo Li, Chris Porter, Santosh Pande	2025-08-25	下载	Modern GPUs such as the Ampere series (A30, A100) as well as the Hopper series (H100, H200) offer performance as well as security isolation features.
Experiences with Model Context Protocol Servers for Science and High Performance Computing	Haochen Pan, Ryan Chard, Reid Mello, Christopher Grams, Tanjin He, Alexander Brace, Owen Price Skelly, Will Engler, Hayden Holbrook, Song Young Oh, Maxime Gonthier, Michael Papka, Ben Blaiszik, Kyle Chard, Ian Foster	2025-08-25	下载	Large language model (LLM)-powered agents are increasingly used to plan and execute scientific workflows, yet most research cyberinfrastructure (CI) exposes heterogeneous APIs and implements security ...
DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction	Weilin Cai, Le Qin, Shwai He, Junwei Cui, Ang Li, Jiayi Huang	2025-08-25	下载	Mixture of Experts (MoE) has become a mainstream architecture for building Large Language Models (LLMs) by reducing per-token computation while enabling model scaling.
FSA: An Alternative Efficient Implementation of Native Sparse Attention Kernel	Ran Yan, Youhe Jiang, Zhuoming Chen, Haohui Mai, Beidi Chen, Binhang Yuan	2025-08-25	下载	Recent advance in sparse attention mechanisms has demonstrated strong potential for reducing the computational cost of long-context training and inference in large language models (LLMs).
Practical GPU Choices for Earth Observation: ResNet-50 Training Throughput on Integrated, Laptop, and Cloud Accelerators	Ritvik Chaturvedi	2025-08-25	下载	This project implements a ResNet-based pipeline for land use and land cover (LULC) classification on Sentinel-2 imagery, benchmarked across three heterogeneous GPUs.
Wait-free Replicated Data Types and Fair Reconciliation	Petr Kuznetsov, Maxence Perion, Sara Tucci-Piergiovanni	2025-08-25	下载	Replication ensures data availability in fault-prone distributed systems. The celebrated CAP theorem stipulates that replicas cannot guarantee both strong consistency and availability under network pa...
Views: a hardware-friendly graph database model for storing semantic information	Yanjun Yang, Adrian Wheeldon, Yihan Pan, Themis Prodromakis, Alex Serb	2025-08-25	下载	The graph database (GDB) is an increasingly common storage model for data involving relationships between entries. Beyond its widespread usage in database industries, the advantages of GDBs indicate a...
Scalable Engine and the Performance of Different LLM Models in a SLURM based HPC architecture	Anderson de Lima Luiz, Shubham Vijay Kurlekar, Munir Georges	2025-08-25	下载	This work elaborates on a High performance computing (HPC) architecture based on Simple Linux Utility for Resource Management (SLURM) [1] for deploying heterogeneous Large Language Models (LLMs) into ...
ExpertWeave: Efficiently Serving Expert-Specialized Fine-Tuned Adapters at Scale	Ge Shi, Hanieh Sadri, Qian Wang, Yu Zhang, Ying Xiong, Yong Zhang, Zhenan Fan	2025-08-25	下载	Expert-Specialized Fine-Tuning (ESFT) adapts Mixture-of-Experts (MoE) large language models to enhance their task-specific performance by selectively tuning the top-activated experts for the task.
Zen-Attention: A Compiler Framework for Dynamic Attention Folding on AMD NPUs	Aadesh Deshmukh, Venkata Yaswanth Raparti, Samuel Hsu	2025-08-25	下载	Transformer-based deep learning models are increasingly deployed on energy, and DRAM bandwidth constrained devices such as laptops and gaming consoles, which presents significant challenges in meeting...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Digital Twin-Guided Energy Management over Real-Time Pub/Sub Protocol in 6G Smart Cities	Kubra Duran, Lal Verda Cakir, Sana Ullah Jan, Kerem Gursu, Berk Canberk	2025-08-25	下载	Although the emergence of 6G IoT networks has accelerated the deployment of enhanced smart city services, the resource limitations of IoT devices remain as a significant problem.
Quantum Paths: a Quantum Walk approach	Claudio Pellitteri, Marcello Caleffi, Angela Sara Cacciapuoti	2025-08-25	下载	The quantum switch, a process enabling a coherent superposition of different orders of quantum channels, has garnered significant attention due to its ability to enable noiseless communications throug...
Automating Conflict-Aware ACL Configurations with Natural Language Intents	Wenlong Ding, Jianqiang Li, Zhixiong Niu, Huangxun Chen, Yongqiang Xiong, Hong Xu	2025-08-25	下载	ACL configuration is essential for managing network flow reachability, yet its complexity grows significantly with topologies and pre-existing rules.
Digital Twin Assisted Proactive Management in Zero Touch Networks	Tamizhelakkiya K, Dibakar Das, Komal Sharma, Jyotsna Bapat, Debabrata Das	2025-08-25	下载	The rapid expansion of cellular networks and rising demand for high-quality services require efficient and autonomous network management solutions.
PRZK-Bind: A Physically Rooted Zero-Knowledge Authentication Protocol for Secure Digital Twin Binding in Smart Cities	Yagmur Yigit, Mehmet Ali Erturk, Kerem Gursu, Berk Canberk	2025-08-25	下载	Digital twin (DT) technology is rapidly becoming essential for smart city ecosystems, enabling real-time synchronisation and autonomous decision-making across physical and digital domains.
Real World Assets on-Chain Assistance Low-Altitude Computility Networks: Architecture, Methodology, and Challenges	Haoxiang Luo, Ruichen Zhang, Yinqiu Liu, Gang Sun, Hongfang Yu, Zhu Han	2025-08-25	下载	Low-altitude airspace is becoming a new frontier for smart city services and commerce. Networks of drones, electric Vertical Takeoff and Landing (eVTOL) vehicles, and other aircraft, termed Low-Altitu...
AgentRAN: An Agentic AI Architecture for Autonomous Control of Open 6G Networks	Maxime Elkael, Salvatore D'Oro, Leonardo Bonati, Michele Polese, Yunseong Lee, Koichiro Furueda, Tommaso Melodia	2025-08-25	下载	Despite the programmable architecture of Open RAN, today's deployments still rely heavily on static control and manual operations. To move beyond this limitation, we introduce AgentRAN, an AI-native, ...
Sustainability or Survivability? Eliminating the Need to Choose in LEO Satellite Constellations	Chris Misa, Ramakrishnan Durairajan	2025-08-25	下载	LEO Satellite Networks (LSNs) are revolutionizing global connectivity, but their reliance on tens of thousands of satellites raises pressing concerns over sustainability and survivability.
Optimizing Anonymity and Efficiency: A Critical Review of Path Selection Strategies in Tor	Siddique Abubakr Muntaka, Jacques Bou Abdo	2025-08-25	下载	The Onion Router (Tor) relies on path selection algorithms to balance performance and anonymity by determining how traffic flows through its relay network.
Not All Visitors are Bilingual: A Measurement Study of the Multilingual Web from an Accessibility Perspective	Masudul Hasan Masud Bhuiyan, Matteo Varvello, Yasir Zaki, Cristian-Alexandru Staicu	2025-08-25	下载	English is the predominant language on the web, powering nearly half of the world's top ten million websites. Support for multilingual content is nevertheless growing, with many websites increasingly ...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Puzzle: Scheduling Multiple Deep Learning Models on Mobile Device with Heterogeneous Processors	Duseok Kang, Yunseong Lee, Junghoon Kim	2025-08-25	下载	As deep learning models are increasingly deployed on mobile devices, modern mobile devices incorporate deep learning-specific accelerators to handle the growing computational demands, thus increasing ...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Secure Password Generator Based on Secure Pseudo-Random Number Generator	Abel C. H. Chen	2025-08-25	下载	In recent years, numerous incidents involving the leakage of website accounts and text passwords (referred to as passwords) have raised significant concerns regarding the potential exposure of persona...
OmniSim: Simulating Hardware with C Speed and RTL Accuracy for High-Level Synthesis Designs	Rishov Sarkar, Cong Hao	2025-08-25	下载	High-Level Synthesis (HLS) is increasingly popular for hardware design using C/C++ instead of Register-Transfer Level (RTL). To express concurrent hardware behavior in a sequential language like C/C++...