Skip to content

2025-08-25

cs.AR - Architecture

标题作者发布日期PDF摘要
OmniSim: Simulating Hardware with C Speed and RTL Accuracy for High-Level Synthesis DesignsRishov Sarkar, Cong Hao2025-08-25下载High-Level Synthesis (HLS) is increasingly popular for hardware design using C/C++ instead of Register-Transfer Level (RTL). To express concurrent hardware behavior in a sequential language like C/C++...
Views: a hardware-friendly graph database model for storing semantic informationYanjun Yang, Adrian Wheeldon, Yihan Pan, Themis Prodromakis, Alex Serb2025-08-25下载The graph database (GDB) is an increasingly common storage model for data involving relationships between entries. Beyond its widespread usage in database industries, the advantages of GDBs indicate a...
Anatomy of the gem5 Simulator: AtomicSimpleCPU, TimingSimpleCPU, O3CPU, and Their Interaction with the Ruby Memory SystemJohan Söderström, Yuan Yao2025-08-25下载gem5 is a popular modular-based computer system simulator, widely used in computer architecture research and known for its long simulation time and steep learning curve.
Opportunities and Challenges for 3D Systems and Their DesignPhilip Emma, Eren Kurshan2025-08-25下载Although it is not a new concept, 3D integration increasingly receives widespread interest and focus as lithographic scaling becomes more challenging, and as the ability to make miniature vias greatly...
LLMulator: Generalizable Cost Modeling for Dataflow Accelerators with Input-Adaptive Control FlowKaiyan Chang, Wenlong Zhu, Shengwen Liang, Huawei Li, Ying Wang2025-08-25下载Accurate and fast performance prediction for dataflow-based accelerators is vital for efficient hardware design and design space exploration, yet existing methods struggle to generalize across archite...
In-Memory Computing Enabled Deep MIMO Detection to Support Ultra-Low-Latency CommunicationsTingyu Ding, Qunsong Zeng, Kaibin Huang2025-08-25下载The development of sixth-generation (6G) mobile networks imposes unprecedented latency and reliability demands on multiple-input multiple-output (MIMO) communication systems, a key enabler of high-spe...
TLGLock: A New Approach in Logic Locking Using Key-Driven Charge Recycling in Threshold Logic GatesAbdullah Sahruri, Martin Margala2025-08-25下载Logic locking remains one of the most promising defenses against hardware piracy, yet current approaches often face challenges in scalability and design overhead.
Structural Mutation Based Differential Testing for FPGA Logic Synthesis CompilersZhihao Xu, Shikai Guo, Guilin Zhao, Siwen Wang, Qian Ma, Hui Li, Furui Zhan2025-08-25下载Field Programmable Gate Arrays (FPGAs) play a crucial role in Electronic Design Automation (EDA) applications, which have been widely used in safety-critical environments, including aerospace, chip ma...
Characterizing the Behavior of Training Mamba-based State Space Models on GPUsTrinayan Baruah, Kaustubh Shivdikar, Sara Prescott, David Kaeli2025-08-25下载Mamba-based State Space Models (SSM) have emerged as a promising alternative to the ubiquitous transformers. Despite the expressive power of transformers, the quadratic complexity of computing attenti...
A 28nm 1.80Mb/mm2 Digital/Analog Hybrid SRAM-CIM Macro Using 2D-Weighted Capacitor Array for Complex Number Mac OperationsShota Konno, Che-Kai Liu, Sigang Ryu, Samuel Spetalnick, Arijit Raychowdhury2025-08-25下载A 28nm dense 6T-SRAM Digital(D)/Analog(A) Hybrid compute-in-memory (CIM) macro supporting complex num-ber MAC operation is presented. By introducing a 2D-weighted Capacitor Array, a hybrid configurati...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Managing Multi Instance GPUs for High Throughput and Energy SavingsAbhijeet Saraha, Yuanbo Li, Chris Porter, Santosh Pande2025-08-25下载Modern GPUs such as the Ampere series (A30, A100) as well as the Hopper series (H100, H200) offer performance as well as security isolation features.
Experiences with Model Context Protocol Servers for Science and High Performance ComputingHaochen Pan, Ryan Chard, Reid Mello, Christopher Grams, Tanjin He, Alexander Brace, Owen Price Skelly, Will Engler, Hayden Holbrook, Song Young Oh, Maxime Gonthier, Michael Papka, Ben Blaiszik, Kyle Chard, Ian Foster2025-08-25下载Large language model (LLM)-powered agents are increasingly used to plan and execute scientific workflows, yet most research cyberinfrastructure (CI) exposes heterogeneous APIs and implements security ...
DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and ReconstructionWeilin Cai, Le Qin, Shwai He, Junwei Cui, Ang Li, Jiayi Huang2025-08-25下载Mixture of Experts (MoE) has become a mainstream architecture for building Large Language Models (LLMs) by reducing per-token computation while enabling model scaling.
FSA: An Alternative Efficient Implementation of Native Sparse Attention KernelRan Yan, Youhe Jiang, Zhuoming Chen, Haohui Mai, Beidi Chen, Binhang Yuan2025-08-25下载Recent advance in sparse attention mechanisms has demonstrated strong potential for reducing the computational cost of long-context training and inference in large language models (LLMs).
Practical GPU Choices for Earth Observation: ResNet-50 Training Throughput on Integrated, Laptop, and Cloud AcceleratorsRitvik Chaturvedi2025-08-25下载This project implements a ResNet-based pipeline for land use and land cover (LULC) classification on Sentinel-2 imagery, benchmarked across three heterogeneous GPUs.
Wait-free Replicated Data Types and Fair ReconciliationPetr Kuznetsov, Maxence Perion, Sara Tucci-Piergiovanni2025-08-25下载Replication ensures data availability in fault-prone distributed systems. The celebrated CAP theorem stipulates that replicas cannot guarantee both strong consistency and availability under network pa...
Views: a hardware-friendly graph database model for storing semantic informationYanjun Yang, Adrian Wheeldon, Yihan Pan, Themis Prodromakis, Alex Serb2025-08-25下载The graph database (GDB) is an increasingly common storage model for data involving relationships between entries. Beyond its widespread usage in database industries, the advantages of GDBs indicate a...
Scalable Engine and the Performance of Different LLM Models in a SLURM based HPC architectureAnderson de Lima Luiz, Shubham Vijay Kurlekar, Munir Georges2025-08-25下载This work elaborates on a High performance computing (HPC) architecture based on Simple Linux Utility for Resource Management (SLURM) [1] for deploying heterogeneous Large Language Models (LLMs) into ...
ExpertWeave: Efficiently Serving Expert-Specialized Fine-Tuned Adapters at ScaleGe Shi, Hanieh Sadri, Qian Wang, Yu Zhang, Ying Xiong, Yong Zhang, Zhenan Fan2025-08-25下载Expert-Specialized Fine-Tuning (ESFT) adapts Mixture-of-Experts (MoE) large language models to enhance their task-specific performance by selectively tuning the top-activated experts for the task.
Zen-Attention: A Compiler Framework for Dynamic Attention Folding on AMD NPUsAadesh Deshmukh, Venkata Yaswanth Raparti, Samuel Hsu2025-08-25下载Transformer-based deep learning models are increasingly deployed on energy, and DRAM bandwidth constrained devices such as laptops and gaming consoles, which presents significant challenges in meeting...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Digital Twin-Guided Energy Management over Real-Time Pub/Sub Protocol in 6G Smart CitiesKubra Duran, Lal Verda Cakir, Sana Ullah Jan, Kerem Gursu, Berk Canberk2025-08-25下载Although the emergence of 6G IoT networks has accelerated the deployment of enhanced smart city services, the resource limitations of IoT devices remain as a significant problem.
Quantum Paths: a Quantum Walk approachClaudio Pellitteri, Marcello Caleffi, Angela Sara Cacciapuoti2025-08-25下载The quantum switch, a process enabling a coherent superposition of different orders of quantum channels, has garnered significant attention due to its ability to enable noiseless communications throug...
Automating Conflict-Aware ACL Configurations with Natural Language IntentsWenlong Ding, Jianqiang Li, Zhixiong Niu, Huangxun Chen, Yongqiang Xiong, Hong Xu2025-08-25下载ACL configuration is essential for managing network flow reachability, yet its complexity grows significantly with topologies and pre-existing rules.
Digital Twin Assisted Proactive Management in Zero Touch NetworksTamizhelakkiya K, Dibakar Das, Komal Sharma, Jyotsna Bapat, Debabrata Das2025-08-25下载The rapid expansion of cellular networks and rising demand for high-quality services require efficient and autonomous network management solutions.
PRZK-Bind: A Physically Rooted Zero-Knowledge Authentication Protocol for Secure Digital Twin Binding in Smart CitiesYagmur Yigit, Mehmet Ali Erturk, Kerem Gursu, Berk Canberk2025-08-25下载Digital twin (DT) technology is rapidly becoming essential for smart city ecosystems, enabling real-time synchronisation and autonomous decision-making across physical and digital domains.
Real World Assets on-Chain Assistance Low-Altitude Computility Networks: Architecture, Methodology, and ChallengesHaoxiang Luo, Ruichen Zhang, Yinqiu Liu, Gang Sun, Hongfang Yu, Zhu Han2025-08-25下载Low-altitude airspace is becoming a new frontier for smart city services and commerce. Networks of drones, electric Vertical Takeoff and Landing (eVTOL) vehicles, and other aircraft, termed Low-Altitu...
AgentRAN: An Agentic AI Architecture for Autonomous Control of Open 6G NetworksMaxime Elkael, Salvatore D'Oro, Leonardo Bonati, Michele Polese, Yunseong Lee, Koichiro Furueda, Tommaso Melodia2025-08-25下载Despite the programmable architecture of Open RAN, today's deployments still rely heavily on static control and manual operations. To move beyond this limitation, we introduce AgentRAN, an AI-native, ...
Sustainability or Survivability? Eliminating the Need to Choose in LEO Satellite ConstellationsChris Misa, Ramakrishnan Durairajan2025-08-25下载LEO Satellite Networks (LSNs) are revolutionizing global connectivity, but their reliance on tens of thousands of satellites raises pressing concerns over sustainability and survivability.
Optimizing Anonymity and Efficiency: A Critical Review of Path Selection Strategies in TorSiddique Abubakr Muntaka, Jacques Bou Abdo2025-08-25下载The Onion Router (Tor) relies on path selection algorithms to balance performance and anonymity by determining how traffic flows through its relay network.
Not All Visitors are Bilingual: A Measurement Study of the Multilingual Web from an Accessibility PerspectiveMasudul Hasan Masud Bhuiyan, Matteo Varvello, Yasir Zaki, Cristian-Alexandru Staicu2025-08-25下载English is the predominant language on the web, powering nearly half of the world's top ten million websites. Support for multilingual content is nevertheless growing, with many websites increasingly ...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Puzzle: Scheduling Multiple Deep Learning Models on Mobile Device with Heterogeneous ProcessorsDuseok Kang, Yunseong Lee, Junghoon Kim2025-08-25下载As deep learning models are increasingly deployed on mobile devices, modern mobile devices incorporate deep learning-specific accelerators to handle the growing computational demands, thus increasing ...

cs.PF - Performance

标题作者发布日期PDF摘要
Secure Password Generator Based on Secure Pseudo-Random Number GeneratorAbel C. H. Chen2025-08-25下载In recent years, numerous incidents involving the leakage of website accounts and text passwords (referred to as passwords) have raised significant concerns regarding the potential exposure of persona...
OmniSim: Simulating Hardware with C Speed and RTL Accuracy for High-Level Synthesis DesignsRishov Sarkar, Cong Hao2025-08-25下载High-Level Synthesis (HLS) is increasingly popular for hardware design using C/C++ instead of Register-Transfer Level (RTL). To express concurrent hardware behavior in a sequential language like C/C++...

基于 VitePress 构建