Appearance
2025-08-18
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error Correction | Huynh Q. N. Vo, Md Tawsif Rahman Chowdhury, Paritosh Ramanan, Murat Yildirim, Gozde Tutuncuoglu | 2025-08-18 | 下载 | Exponential growth in global computing demand is exacerbated due to the higher-energy requirements of conventional architectures, primarily due to energy-intensive data movement. |
| AI Agents for Photonic Integrated Circuit Design Automation | Ankita Sharma, YuQi Fu, Vahid Ansari, Rishabh Iyer, Fiona Kuang, Kashish Mistry, Raisa Islam Aishy, Sara Ahmad, Joaquin Matres, Dirk R. Englund, Joyce K. S. Poon | 2025-08-18 | 下载 | We present Photonics Intelligent Design and Optimization (PhIDO), a multi-agent framework that converts natural-language photonic integrated circuit (PIC) design requests into layout mask files. |
| ViTAD: Timing Violation-Aware Debugging of RTL Code using Large Language Models | Wenhao Lv, Yingjie Xia, Xiyuan Chen, Li Kuang | 2025-08-18 | 下载 | In modern Very Large Scale Integrated (VLSI) circuit design flow, the Register-Transfer Level (RTL) stage presents a critical opportunity for timing optimization. |
| XR-NPE: High-Throughput Mixed-precision SIMD Neural Processing Engine for Extended Reality Perception Workloads | Tejas Chaudhari, Akarsh J., Tanushree Dewangan, Mukul Lokhande, Santosh Kumar Vishvakarma | 2025-08-18 | 下载 | This work proposes XR-NPE, a high-throughput Mixed-precision SIMD Neural Processing Engine, designed for extended reality (XR) perception workloads like visual inertial odometry (VIO), object classifi... |
| e-boost: Boosted E-Graph Extraction with Adaptive Heuristics and Exact Solving | Jiaqi Yin, Zhan Song, Chen Chen, Yaohui Cai, Zhiru Zhang, Cunxi Yu | 2025-08-18 | 下载 | E-graphs have attracted growing interest in many fields, particularly in logic synthesis and formal verification. E-graph extraction is a challenging NP-hard combinatorial optimization problem. |
| SecFSM: Knowledge Graph-Guided Verilog Code Generation for Secure Finite State Machines in Systems-on-Chip | Ziteng Hu, Yingjie Xia, Xiyuan Chen, Li Kuang | 2025-08-18 | 下载 | Finite State Machines (FSMs) play a critical role in implementing control logic for Systems-on-Chip (SoC). Traditionally, FSMs are implemented by hardware engineers through Verilog coding, which is of... |
| Multi-Metric Algorithmic Complexity: Beyond Asymptotic Analysis | Sergii Kavun | 2025-08-18 | 下载 | Traditional algorithm analysis treats all basic operations as equally costly, which hides significant differences in time, energy consumption, and cost between different types of computations on moder... |
| IzhiRISC-V -- a RISC-V-based Processor with Custom ISA Extension for Spiking Neuron Networks Processing with Izhikevich Neurons | Wiktor J. Szczerek, Artur Podobas | 2025-08-18 | 下载 | Spiking Neural Network processing promises to provide high energy efficiency due to the sparsity of the spiking events. However, when realized on general-purpose hardware -- such as a RISC-V processor... |
| Sub-Millisecond Event-Based Eye Tracking on a Resource-Constrained Microcontroller | Marco Giordano, Pietro Bonazzi, Luca Benini, Michele Magno | 2025-08-18 | 下载 | This paper presents a novel event-based eye-tracking system deployed on a resource-constrained microcontroller, addressing the challenges of real-time, low-latency, and low-power performance in embedd... |
| HOMI: Ultra-Fast EdgeAI platform for Event Cameras | Shankaranarayanan H, Satyapreet Singh Yadav, Adithya Krishna, Ajay Vikram P, Mahesh Mehendale, Chetan Singh Thakur | 2025-08-18 | 下载 | Event cameras offer significant advantages for edge robotics applications due to their asynchronous operation and sparse, event-driven output, making them well-suited for tasks requiring fast and effi... |
| MemorySim: An RTL-level, timing accurate simulator model for the Chisel ecosystem | Ansh Chaurasia | 2025-08-18 | 下载 | The rapid growth of AI applications has driven increased demand for specialized AI hardware, highlighting critical opportunities within the memory subsystem, which often serves as a performance bottle... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Towards Integrated Energy-Communication-Transportation Hub: A Base-Station-Centric Design in 5G and Beyond | Linfeng Shen, Guanzhen Wu, Cong Zhang, Xiaoyi Fan, Jiangchuan Liu | 2025-08-18 | 下载 | The rise of 5G communication has transformed the telecom industry for critical applications. With the widespread deployment of 5G base stations comes a significant concern about energy consumption. |
| Optimizing Allreduce Operations for Modern Heterogeneous Architectures with Multiple Processes per GPU | Michael Adams, Amanda Bienz | 2025-08-18 | 下载 | Large inter-GPU all-reduce operations, prevalent throughout deep learning, are bottlenecked by communication costs. Emerging heterogeneous architectures are comprised of complex nodes, often containin... |
| OrbitChain: Orchestrating In-orbit Real-time Analytics of Earth Observation Data | Zhouyu Li, Zhijin Yang, Huayue Gu, Xiaojian Wang, Yuchen Liu, Ruozhou Yu | 2025-08-18 | 下载 | Earth observation analytics have the potential to transform many sectors. However, due to limited ground connections, it currently takes hours to days to download and analyze Earth observation data, d... |
| Persistent and Partitioned MPI for Stencil Communication | Gerald Collom, Jason Burmark, Olga Pearce, Amanda Bienz | 2025-08-18 | 下载 | Many parallel applications rely on iterative stencil operations, whose performance are dominated by communication costs at large scales. Several MPI optimizations, such as persistent and partitioned c... |
| X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms | Yueming Yuan, Ahan Gupta, Jianping Li, Sajal Dash, Feiyi Wang, Minjia Zhang | 2025-08-18 | 下载 | Emerging expert-specialized Mixture-of-Experts (MoE) architectures, such as DeepSeek-MoE, deliver strong model quality through fine-grained expert segmentation and large top-k routing. |
| Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error Correction | Huynh Q. N. Vo, Md Tawsif Rahman Chowdhury, Paritosh Ramanan, Murat Yildirim, Gozde Tutuncuoglu | 2025-08-18 | 下载 | Exponential growth in global computing demand is exacerbated due to the higher-energy requirements of conventional architectures, primarily due to energy-intensive data movement. |
| Team Formation and Applications | Yuval Emek, Shay Kutten, Ido Rafael, Gadi Taubenfeld | 2025-08-18 | 下载 | A novel long-lived distributed problem, called Team Formation (TF), is introduced together with a message- and time-efficient randomized algorithm. |
| Congested Clique Counting for Local Gibbs Distributions | Joshua Z. Sobel | 2025-08-18 | 下载 | There are well established reductions between combinatorial sampling and counting problems (Jerrum, Valiant, Vazirani TCS 1986). Building off of a very recent parallel algorithm utilizing this connect... |
| Beyond Trade-offs: A Unified Framework for Privacy, Robustness, and Communication Efficiency in Federated Learning | Yue Xia, Tayyebeh Jahani-Nezhad, Rawad Bitar | 2025-08-18 | 下载 | We propose Fed-DPRoC, a novel federated learning framework designed to jointly provide differential privacy (DP), Byzantine robustness, and communication efficiency. |
| WANify: Gauging and Balancing Runtime WAN Bandwidth for Geo-distributed Data Analytics | Anshuman Das Mohapatra, Kwangsung Oh | 2025-08-18 | 下载 | Accurate wide area network (WAN) bandwidth (BW) is essential for geo-distributed data analytics (GDA) systems to make optimal decisions such as data and task placement to improve performance. |
| Accelerating Edge Inference for Distributed MoE Models with Latency-Optimized Expert Placement | Tian Wu, Liming Wang, Zijian Wen, Xiaoxi Zhang, Jingpu Duan, Xianwei Zhang, Jinhang Zuo | 2025-08-18 | 下载 | The emergence of Mixture-of-Experts (MoE) has transformed the scaling of large language models by enabling vast model capacity through sparse activation. |
| Federated Action Recognition for Smart Worker Assistance Using FastPose | Vinit Hegiste, Vidit Goyal, Tatjana Legler, Martin Ruskowski | 2025-08-18 | 下载 | In smart manufacturing environments, accurate and real-time recognition of worker actions is essential for productivity, safety, and human-machine collaboration. |
| Dissecting CPU-GPU Unified Physical Memory on AMD MI300A APUs | Jacob Wahlgren, Gabin Schieffer, Ruimin Shi, Edgar A. León, Roger Pearce, Maya Gokhale, Ivy Peng | 2025-08-18 | 下载 | Discrete GPUs are a cornerstone of HPC and data center systems, requiring management of separate CPU and GPU memory spaces. Unified Virtual Memory (UVM) has been proposed to ease the burden of memory ... |
| DIT: Dimension Reduction View on Optimal NFT Rarity Meters | Dmitry Belousov, Yury Yanovich | 2025-08-18 | 下载 | Non-fungible tokens (NFTs) have become a significant digital asset class, each uniquely representing virtual entities such as artworks. These tokens are stored in collections within smart contracts an... |
| Data-driven Trust Bootstrapping for Mobile Edge Computing-based Industrial IoT Services | Prabath Abeysekara, Hai Dong | 2025-08-18 | 下载 | We propose a data-driven and context-aware approach to bootstrap trustworthiness of homogeneous Internet of Things (IoT) services in Mobile Edge Computing (MEC) based industrial IoT (IIoT) systems. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Towards Integrated Energy-Communication-Transportation Hub: A Base-Station-Centric Design in 5G and Beyond | Linfeng Shen, Guanzhen Wu, Cong Zhang, Xiaoyi Fan, Jiangchuan Liu | 2025-08-18 | 下载 | The rise of 5G communication has transformed the telecom industry for critical applications. With the widespread deployment of 5G base stations comes a significant concern about energy consumption. |
| OrbitChain: Orchestrating In-orbit Real-time Analytics of Earth Observation Data | Zhouyu Li, Zhijin Yang, Huayue Gu, Xiaojian Wang, Yuchen Liu, Ruozhou Yu | 2025-08-18 | 下载 | Earth observation analytics have the potential to transform many sectors. However, due to limited ground connections, it currently takes hours to days to download and analyze Earth observation data, d... |
| SL-ACC: A Communication-Efficient Split Learning Framework with Adaptive Channel-wise Compression | Zehang Lin, Zheng Lin, Miao Yang, Jianhao Huang, Yuxin Zhang, Zihan Fang, Xia Du, Zhe Chen, Shunzhi Zhu, Wei Ni | 2025-08-18 | 下载 | The increasing complexity of neural networks poses a significant barrier to the deployment of distributed machine learning (ML) on resource-constrained devices, such as federated learning (FL). |
| REACH: Reinforcement Learning for Efficient Allocation in Community and Heterogeneous Networks | Zhiwei Yu, Chengze Du, Heng Xu, Ying Zhou, Bo Liu, Jialong Li | 2025-08-18 | 下载 | Community GPU platforms are emerging as a cost-effective and democratized alternative to centralized GPU clusters for AI workloads, aggregating idle consumer GPUs from globally distributed and heterog... |
| RoTO: Robust Topology Obfuscation Against Tomography Inference Attacks | Chengze Du, Heng Xu, Zhiwei Yu, Ying Zhou, Zili Meng, Jialong Li | 2025-08-18 | 下载 | Tomography inference attacks aim to reconstruct network topology by analyzing end-to-end probe delays. Existing defenses mitigate these attacks by manipulating probe delays to mislead inference, but r... |
| SDAP-based QoS Flow Multiplexing Support in Simu5G for 5G NR Simulation | Mohamed Seliem, Utz Roedig, Cormac Sreenan, Dirk Pesch | 2025-08-18 | 下载 | The Service Data Adaptation Protocol (SDAP) plays a central role in 5G New Radio (NR), acting as a bridge between the core and radio networks, by enabling QoS Flow multiplexing over shared Data Radio ... |
| Some optimization possibilities in data plane programming | Altangerel Gereltsetseg, Tejfel Máté | 2025-08-18 | 下载 | Software-defined networking (SDN) technology aims to create a highly flexible network by decoupling control plane and the data plane and programming them independently. |
| Cooperative Sensing-Assisted Predictive Beam Tracking for MIMO-OFDM Networked ISAC Systems | Xiaoyu Yang, Zhiqing Wei, Jie Xu, Huici Wu, Zhiyong Feng | 2025-08-18 | 下载 | This paper studies a multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) networked integrated sensing and communication (ISAC) system, in which multiple base statio... |
| Towards Nomadic 6G Communication Networks: Implications on Architecture, Standardization, and Regulatory Aspects | Daniel Lindenschmitt, Marcos Rates Crippa, Hans D. Schotten | 2025-08-18 | 下载 | The emergence of nomadic mobile communication networks for sixth-generation (6G) introduces a paradigm shift in how network infrastructure is conceptualized, deployed, and operated. |
| Game-Theoretic and Reinforcement Learning-Based Cluster Head Selection for Energy-Efficient Wireless Sensor Network | Mehrshad Eskandarpour, Saba Pirahmadian, Parham Soltani, Hossein Soleimani | 2025-08-18 | 下载 | Energy in Wireless Sensor Networks (WSNs) is critical to network lifetime and data delivery. However, the primary impediment to the durability and dependability of these sensor nodes is their short ba... |
| An Efficient and Adaptive Framework for Achieving Underwater High-performance Maintenance Networks | Yu Gou, Tong Zhang, Jun Liu, Zhongyang Qi, Dezhi Zheng | 2025-08-18 | 下载 | With the development of space-air-ground-aqua integrated networks (SAGAIN), high-speed and reliable network services are accessible at any time and any location. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| OS-R1: Agentic Operating System Kernel Tuning with Reinforcement Learning | Hongyu Lin, Yuchen Li, Haoran Luo, Kaichun Yao, Libo Zhang, Mingjie Xing, Yanjun Wu | 2025-08-18 | 下载 | Linux kernel tuning is essential for optimizing operating system (OS) performance. However, existing methods often face challenges in terms of efficiency, scalability, and generalization. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error Correction | Huynh Q. N. Vo, Md Tawsif Rahman Chowdhury, Paritosh Ramanan, Murat Yildirim, Gozde Tutuncuoglu | 2025-08-18 | 下载 | Exponential growth in global computing demand is exacerbated due to the higher-energy requirements of conventional architectures, primarily due to energy-intensive data movement. |
| Hierarchical Evaluation Function: A Multi-Metric Approach for Optimizing Demand Forecasting Models | Adolfo González, Víctor Parada | 2025-08-18 | 下载 | Demand forecasting in competitive, uncertain business environments requires models that can integrate multiple evaluation perspectives rather than being restricted to hyperparameter optimization based... |
| Multi-Metric Algorithmic Complexity: Beyond Asymptotic Analysis | Sergii Kavun | 2025-08-18 | 下载 | Traditional algorithm analysis treats all basic operations as equally costly, which hides significant differences in time, energy consumption, and cost between different types of computations on moder... |
| SYCL for Energy-Efficient Numerical Astrophysics: the case of DPEcho | Salvatore Cielo, Alexander Pöppl, Ivan Pribec | 2025-08-18 | 下载 | Energy awareness and efficiency policies are gaining more attention, over pure performance (time-to-solution) Key Performance Indicators (KPIs) when comparing the possibilities offered by accelerated ... |
| Dissecting CPU-GPU Unified Physical Memory on AMD MI300A APUs | Jacob Wahlgren, Gabin Schieffer, Ruimin Shi, Edgar A. León, Roger Pearce, Maya Gokhale, Ivy Peng | 2025-08-18 | 下载 | Discrete GPUs are a cornerstone of HPC and data center systems, requiring management of separate CPU and GPU memory spaces. Unified Virtual Memory (UVM) has been proposed to ease the burden of memory ... |