Appearance
2025-08-25
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| OmniSim: Simulating Hardware with C Speed and RTL Accuracy for High-Level Synthesis Designs | Rishov Sarkar, Cong Hao | 2025-08-25 | 下载 | High-Level Synthesis (HLS) is increasingly popular for hardware design using C/C++ instead of Register-Transfer Level (RTL). To express concurrent hardware behavior in a sequential language like C/C++... |
| Views: a hardware-friendly graph database model for storing semantic information | Yanjun Yang, Adrian Wheeldon, Yihan Pan, Themis Prodromakis, Alex Serb | 2025-08-25 | 下载 | The graph database (GDB) is an increasingly common storage model for data involving relationships between entries. Beyond its widespread usage in database industries, the advantages of GDBs indicate a... |
| Anatomy of the gem5 Simulator: AtomicSimpleCPU, TimingSimpleCPU, O3CPU, and Their Interaction with the Ruby Memory System | Johan Söderström, Yuan Yao | 2025-08-25 | 下载 | gem5 is a popular modular-based computer system simulator, widely used in computer architecture research and known for its long simulation time and steep learning curve. |
| Opportunities and Challenges for 3D Systems and Their Design | Philip Emma, Eren Kurshan | 2025-08-25 | 下载 | Although it is not a new concept, 3D integration increasingly receives widespread interest and focus as lithographic scaling becomes more challenging, and as the ability to make miniature vias greatly... |
| LLMulator: Generalizable Cost Modeling for Dataflow Accelerators with Input-Adaptive Control Flow | Kaiyan Chang, Wenlong Zhu, Shengwen Liang, Huawei Li, Ying Wang | 2025-08-25 | 下载 | Accurate and fast performance prediction for dataflow-based accelerators is vital for efficient hardware design and design space exploration, yet existing methods struggle to generalize across archite... |
| In-Memory Computing Enabled Deep MIMO Detection to Support Ultra-Low-Latency Communications | Tingyu Ding, Qunsong Zeng, Kaibin Huang | 2025-08-25 | 下载 | The development of sixth-generation (6G) mobile networks imposes unprecedented latency and reliability demands on multiple-input multiple-output (MIMO) communication systems, a key enabler of high-spe... |
| TLGLock: A New Approach in Logic Locking Using Key-Driven Charge Recycling in Threshold Logic Gates | Abdullah Sahruri, Martin Margala | 2025-08-25 | 下载 | Logic locking remains one of the most promising defenses against hardware piracy, yet current approaches often face challenges in scalability and design overhead. |
| Structural Mutation Based Differential Testing for FPGA Logic Synthesis Compilers | Zhihao Xu, Shikai Guo, Guilin Zhao, Siwen Wang, Qian Ma, Hui Li, Furui Zhan | 2025-08-25 | 下载 | Field Programmable Gate Arrays (FPGAs) play a crucial role in Electronic Design Automation (EDA) applications, which have been widely used in safety-critical environments, including aerospace, chip ma... |
| Characterizing the Behavior of Training Mamba-based State Space Models on GPUs | Trinayan Baruah, Kaustubh Shivdikar, Sara Prescott, David Kaeli | 2025-08-25 | 下载 | Mamba-based State Space Models (SSM) have emerged as a promising alternative to the ubiquitous transformers. Despite the expressive power of transformers, the quadratic complexity of computing attenti... |
| A 28nm 1.80Mb/mm2 Digital/Analog Hybrid SRAM-CIM Macro Using 2D-Weighted Capacitor Array for Complex Number Mac Operations | Shota Konno, Che-Kai Liu, Sigang Ryu, Samuel Spetalnick, Arijit Raychowdhury | 2025-08-25 | 下载 | A 28nm dense 6T-SRAM Digital(D)/Analog(A) Hybrid compute-in-memory (CIM) macro supporting complex num-ber MAC operation is presented. By introducing a 2D-weighted Capacitor Array, a hybrid configurati... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Managing Multi Instance GPUs for High Throughput and Energy Savings | Abhijeet Saraha, Yuanbo Li, Chris Porter, Santosh Pande | 2025-08-25 | 下载 | Modern GPUs such as the Ampere series (A30, A100) as well as the Hopper series (H100, H200) offer performance as well as security isolation features. |
| Experiences with Model Context Protocol Servers for Science and High Performance Computing | Haochen Pan, Ryan Chard, Reid Mello, Christopher Grams, Tanjin He, Alexander Brace, Owen Price Skelly, Will Engler, Hayden Holbrook, Song Young Oh, Maxime Gonthier, Michael Papka, Ben Blaiszik, Kyle Chard, Ian Foster | 2025-08-25 | 下载 | Large language model (LLM)-powered agents are increasingly used to plan and execute scientific workflows, yet most research cyberinfrastructure (CI) exposes heterogeneous APIs and implements security ... |
| DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction | Weilin Cai, Le Qin, Shwai He, Junwei Cui, Ang Li, Jiayi Huang | 2025-08-25 | 下载 | Mixture of Experts (MoE) has become a mainstream architecture for building Large Language Models (LLMs) by reducing per-token computation while enabling model scaling. |
| FSA: An Alternative Efficient Implementation of Native Sparse Attention Kernel | Ran Yan, Youhe Jiang, Zhuoming Chen, Haohui Mai, Beidi Chen, Binhang Yuan | 2025-08-25 | 下载 | Recent advance in sparse attention mechanisms has demonstrated strong potential for reducing the computational cost of long-context training and inference in large language models (LLMs). |
| Practical GPU Choices for Earth Observation: ResNet-50 Training Throughput on Integrated, Laptop, and Cloud Accelerators | Ritvik Chaturvedi | 2025-08-25 | 下载 | This project implements a ResNet-based pipeline for land use and land cover (LULC) classification on Sentinel-2 imagery, benchmarked across three heterogeneous GPUs. |
| Wait-free Replicated Data Types and Fair Reconciliation | Petr Kuznetsov, Maxence Perion, Sara Tucci-Piergiovanni | 2025-08-25 | 下载 | Replication ensures data availability in fault-prone distributed systems. The celebrated CAP theorem stipulates that replicas cannot guarantee both strong consistency and availability under network pa... |
| Views: a hardware-friendly graph database model for storing semantic information | Yanjun Yang, Adrian Wheeldon, Yihan Pan, Themis Prodromakis, Alex Serb | 2025-08-25 | 下载 | The graph database (GDB) is an increasingly common storage model for data involving relationships between entries. Beyond its widespread usage in database industries, the advantages of GDBs indicate a... |
| Scalable Engine and the Performance of Different LLM Models in a SLURM based HPC architecture | Anderson de Lima Luiz, Shubham Vijay Kurlekar, Munir Georges | 2025-08-25 | 下载 | This work elaborates on a High performance computing (HPC) architecture based on Simple Linux Utility for Resource Management (SLURM) [1] for deploying heterogeneous Large Language Models (LLMs) into ... |
| ExpertWeave: Efficiently Serving Expert-Specialized Fine-Tuned Adapters at Scale | Ge Shi, Hanieh Sadri, Qian Wang, Yu Zhang, Ying Xiong, Yong Zhang, Zhenan Fan | 2025-08-25 | 下载 | Expert-Specialized Fine-Tuning (ESFT) adapts Mixture-of-Experts (MoE) large language models to enhance their task-specific performance by selectively tuning the top-activated experts for the task. |
| Zen-Attention: A Compiler Framework for Dynamic Attention Folding on AMD NPUs | Aadesh Deshmukh, Venkata Yaswanth Raparti, Samuel Hsu | 2025-08-25 | 下载 | Transformer-based deep learning models are increasingly deployed on energy, and DRAM bandwidth constrained devices such as laptops and gaming consoles, which presents significant challenges in meeting... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Digital Twin-Guided Energy Management over Real-Time Pub/Sub Protocol in 6G Smart Cities | Kubra Duran, Lal Verda Cakir, Sana Ullah Jan, Kerem Gursu, Berk Canberk | 2025-08-25 | 下载 | Although the emergence of 6G IoT networks has accelerated the deployment of enhanced smart city services, the resource limitations of IoT devices remain as a significant problem. |
| Quantum Paths: a Quantum Walk approach | Claudio Pellitteri, Marcello Caleffi, Angela Sara Cacciapuoti | 2025-08-25 | 下载 | The quantum switch, a process enabling a coherent superposition of different orders of quantum channels, has garnered significant attention due to its ability to enable noiseless communications throug... |
| Automating Conflict-Aware ACL Configurations with Natural Language Intents | Wenlong Ding, Jianqiang Li, Zhixiong Niu, Huangxun Chen, Yongqiang Xiong, Hong Xu | 2025-08-25 | 下载 | ACL configuration is essential for managing network flow reachability, yet its complexity grows significantly with topologies and pre-existing rules. |
| Digital Twin Assisted Proactive Management in Zero Touch Networks | Tamizhelakkiya K, Dibakar Das, Komal Sharma, Jyotsna Bapat, Debabrata Das | 2025-08-25 | 下载 | The rapid expansion of cellular networks and rising demand for high-quality services require efficient and autonomous network management solutions. |
| PRZK-Bind: A Physically Rooted Zero-Knowledge Authentication Protocol for Secure Digital Twin Binding in Smart Cities | Yagmur Yigit, Mehmet Ali Erturk, Kerem Gursu, Berk Canberk | 2025-08-25 | 下载 | Digital twin (DT) technology is rapidly becoming essential for smart city ecosystems, enabling real-time synchronisation and autonomous decision-making across physical and digital domains. |
| Real World Assets on-Chain Assistance Low-Altitude Computility Networks: Architecture, Methodology, and Challenges | Haoxiang Luo, Ruichen Zhang, Yinqiu Liu, Gang Sun, Hongfang Yu, Zhu Han | 2025-08-25 | 下载 | Low-altitude airspace is becoming a new frontier for smart city services and commerce. Networks of drones, electric Vertical Takeoff and Landing (eVTOL) vehicles, and other aircraft, termed Low-Altitu... |
| AgentRAN: An Agentic AI Architecture for Autonomous Control of Open 6G Networks | Maxime Elkael, Salvatore D'Oro, Leonardo Bonati, Michele Polese, Yunseong Lee, Koichiro Furueda, Tommaso Melodia | 2025-08-25 | 下载 | Despite the programmable architecture of Open RAN, today's deployments still rely heavily on static control and manual operations. To move beyond this limitation, we introduce AgentRAN, an AI-native, ... |
| Sustainability or Survivability? Eliminating the Need to Choose in LEO Satellite Constellations | Chris Misa, Ramakrishnan Durairajan | 2025-08-25 | 下载 | LEO Satellite Networks (LSNs) are revolutionizing global connectivity, but their reliance on tens of thousands of satellites raises pressing concerns over sustainability and survivability. |
| Optimizing Anonymity and Efficiency: A Critical Review of Path Selection Strategies in Tor | Siddique Abubakr Muntaka, Jacques Bou Abdo | 2025-08-25 | 下载 | The Onion Router (Tor) relies on path selection algorithms to balance performance and anonymity by determining how traffic flows through its relay network. |
| Not All Visitors are Bilingual: A Measurement Study of the Multilingual Web from an Accessibility Perspective | Masudul Hasan Masud Bhuiyan, Matteo Varvello, Yasir Zaki, Cristian-Alexandru Staicu | 2025-08-25 | 下载 | English is the predominant language on the web, powering nearly half of the world's top ten million websites. Support for multilingual content is nevertheless growing, with many websites increasingly ... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Puzzle: Scheduling Multiple Deep Learning Models on Mobile Device with Heterogeneous Processors | Duseok Kang, Yunseong Lee, Junghoon Kim | 2025-08-25 | 下载 | As deep learning models are increasingly deployed on mobile devices, modern mobile devices incorporate deep learning-specific accelerators to handle the growing computational demands, thus increasing ... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Secure Password Generator Based on Secure Pseudo-Random Number Generator | Abel C. H. Chen | 2025-08-25 | 下载 | In recent years, numerous incidents involving the leakage of website accounts and text passwords (referred to as passwords) have raised significant concerns regarding the potential exposure of persona... |
| OmniSim: Simulating Hardware with C Speed and RTL Accuracy for High-Level Synthesis Designs | Rishov Sarkar, Cong Hao | 2025-08-25 | 下载 | High-Level Synthesis (HLS) is increasingly popular for hardware design using C/C++ instead of Register-Transfer Level (RTL). To express concurrent hardware behavior in a sequential language like C/C++... |