Skip to content

2025-08-18

cs.AR - Architecture

标题作者发布日期PDF摘要
Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error CorrectionHuynh Q. N. Vo, Md Tawsif Rahman Chowdhury, Paritosh Ramanan, Murat Yildirim, Gozde Tutuncuoglu2025-08-18下载Exponential growth in global computing demand is exacerbated due to the higher-energy requirements of conventional architectures, primarily due to energy-intensive data movement.
AI Agents for Photonic Integrated Circuit Design AutomationAnkita Sharma, YuQi Fu, Vahid Ansari, Rishabh Iyer, Fiona Kuang, Kashish Mistry, Raisa Islam Aishy, Sara Ahmad, Joaquin Matres, Dirk R. Englund, Joyce K. S. Poon2025-08-18下载We present Photonics Intelligent Design and Optimization (PhIDO), a multi-agent framework that converts natural-language photonic integrated circuit (PIC) design requests into layout mask files.
ViTAD: Timing Violation-Aware Debugging of RTL Code using Large Language ModelsWenhao Lv, Yingjie Xia, Xiyuan Chen, Li Kuang2025-08-18下载In modern Very Large Scale Integrated (VLSI) circuit design flow, the Register-Transfer Level (RTL) stage presents a critical opportunity for timing optimization.
XR-NPE: High-Throughput Mixed-precision SIMD Neural Processing Engine for Extended Reality Perception WorkloadsTejas Chaudhari, Akarsh J., Tanushree Dewangan, Mukul Lokhande, Santosh Kumar Vishvakarma2025-08-18下载This work proposes XR-NPE, a high-throughput Mixed-precision SIMD Neural Processing Engine, designed for extended reality (XR) perception workloads like visual inertial odometry (VIO), object classifi...
e-boost: Boosted E-Graph Extraction with Adaptive Heuristics and Exact SolvingJiaqi Yin, Zhan Song, Chen Chen, Yaohui Cai, Zhiru Zhang, Cunxi Yu2025-08-18下载E-graphs have attracted growing interest in many fields, particularly in logic synthesis and formal verification. E-graph extraction is a challenging NP-hard combinatorial optimization problem.
SecFSM: Knowledge Graph-Guided Verilog Code Generation for Secure Finite State Machines in Systems-on-ChipZiteng Hu, Yingjie Xia, Xiyuan Chen, Li Kuang2025-08-18下载Finite State Machines (FSMs) play a critical role in implementing control logic for Systems-on-Chip (SoC). Traditionally, FSMs are implemented by hardware engineers through Verilog coding, which is of...
Multi-Metric Algorithmic Complexity: Beyond Asymptotic AnalysisSergii Kavun2025-08-18下载Traditional algorithm analysis treats all basic operations as equally costly, which hides significant differences in time, energy consumption, and cost between different types of computations on moder...
IzhiRISC-V -- a RISC-V-based Processor with Custom ISA Extension for Spiking Neuron Networks Processing with Izhikevich NeuronsWiktor J. Szczerek, Artur Podobas2025-08-18下载Spiking Neural Network processing promises to provide high energy efficiency due to the sparsity of the spiking events. However, when realized on general-purpose hardware -- such as a RISC-V processor...
Sub-Millisecond Event-Based Eye Tracking on a Resource-Constrained MicrocontrollerMarco Giordano, Pietro Bonazzi, Luca Benini, Michele Magno2025-08-18下载This paper presents a novel event-based eye-tracking system deployed on a resource-constrained microcontroller, addressing the challenges of real-time, low-latency, and low-power performance in embedd...
HOMI: Ultra-Fast EdgeAI platform for Event CamerasShankaranarayanan H, Satyapreet Singh Yadav, Adithya Krishna, Ajay Vikram P, Mahesh Mehendale, Chetan Singh Thakur2025-08-18下载Event cameras offer significant advantages for edge robotics applications due to their asynchronous operation and sparse, event-driven output, making them well-suited for tasks requiring fast and effi...
MemorySim: An RTL-level, timing accurate simulator model for the Chisel ecosystemAnsh Chaurasia2025-08-18下载The rapid growth of AI applications has driven increased demand for specialized AI hardware, highlighting critical opportunities within the memory subsystem, which often serves as a performance bottle...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Towards Integrated Energy-Communication-Transportation Hub: A Base-Station-Centric Design in 5G and BeyondLinfeng Shen, Guanzhen Wu, Cong Zhang, Xiaoyi Fan, Jiangchuan Liu2025-08-18下载The rise of 5G communication has transformed the telecom industry for critical applications. With the widespread deployment of 5G base stations comes a significant concern about energy consumption.
Optimizing Allreduce Operations for Modern Heterogeneous Architectures with Multiple Processes per GPUMichael Adams, Amanda Bienz2025-08-18下载Large inter-GPU all-reduce operations, prevalent throughout deep learning, are bottlenecked by communication costs. Emerging heterogeneous architectures are comprised of complex nodes, often containin...
OrbitChain: Orchestrating In-orbit Real-time Analytics of Earth Observation DataZhouyu Li, Zhijin Yang, Huayue Gu, Xiaojian Wang, Yuchen Liu, Ruozhou Yu2025-08-18下载Earth observation analytics have the potential to transform many sectors. However, due to limited ground connections, it currently takes hours to days to download and analyze Earth observation data, d...
Persistent and Partitioned MPI for Stencil CommunicationGerald Collom, Jason Burmark, Olga Pearce, Amanda Bienz2025-08-18下载Many parallel applications rely on iterative stencil operations, whose performance are dominated by communication costs at large scales. Several MPI optimizations, such as persistent and partitioned c...
X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC PlatformsYueming Yuan, Ahan Gupta, Jianping Li, Sajal Dash, Feiyi Wang, Minjia Zhang2025-08-18下载Emerging expert-specialized Mixture-of-Experts (MoE) architectures, such as DeepSeek-MoE, deliver strong model quality through fine-grained expert segmentation and large top-k routing.
Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error CorrectionHuynh Q. N. Vo, Md Tawsif Rahman Chowdhury, Paritosh Ramanan, Murat Yildirim, Gozde Tutuncuoglu2025-08-18下载Exponential growth in global computing demand is exacerbated due to the higher-energy requirements of conventional architectures, primarily due to energy-intensive data movement.
Team Formation and ApplicationsYuval Emek, Shay Kutten, Ido Rafael, Gadi Taubenfeld2025-08-18下载A novel long-lived distributed problem, called Team Formation (TF), is introduced together with a message- and time-efficient randomized algorithm.
Congested Clique Counting for Local Gibbs DistributionsJoshua Z. Sobel2025-08-18下载There are well established reductions between combinatorial sampling and counting problems (Jerrum, Valiant, Vazirani TCS 1986). Building off of a very recent parallel algorithm utilizing this connect...
Beyond Trade-offs: A Unified Framework for Privacy, Robustness, and Communication Efficiency in Federated LearningYue Xia, Tayyebeh Jahani-Nezhad, Rawad Bitar2025-08-18下载We propose Fed-DPRoC, a novel federated learning framework designed to jointly provide differential privacy (DP), Byzantine robustness, and communication efficiency.
WANify: Gauging and Balancing Runtime WAN Bandwidth for Geo-distributed Data AnalyticsAnshuman Das Mohapatra, Kwangsung Oh2025-08-18下载Accurate wide area network (WAN) bandwidth (BW) is essential for geo-distributed data analytics (GDA) systems to make optimal decisions such as data and task placement to improve performance.
Accelerating Edge Inference for Distributed MoE Models with Latency-Optimized Expert PlacementTian Wu, Liming Wang, Zijian Wen, Xiaoxi Zhang, Jingpu Duan, Xianwei Zhang, Jinhang Zuo2025-08-18下载The emergence of Mixture-of-Experts (MoE) has transformed the scaling of large language models by enabling vast model capacity through sparse activation.
Federated Action Recognition for Smart Worker Assistance Using FastPoseVinit Hegiste, Vidit Goyal, Tatjana Legler, Martin Ruskowski2025-08-18下载In smart manufacturing environments, accurate and real-time recognition of worker actions is essential for productivity, safety, and human-machine collaboration.
Dissecting CPU-GPU Unified Physical Memory on AMD MI300A APUsJacob Wahlgren, Gabin Schieffer, Ruimin Shi, Edgar A. León, Roger Pearce, Maya Gokhale, Ivy Peng2025-08-18下载Discrete GPUs are a cornerstone of HPC and data center systems, requiring management of separate CPU and GPU memory spaces. Unified Virtual Memory (UVM) has been proposed to ease the burden of memory ...
DIT: Dimension Reduction View on Optimal NFT Rarity MetersDmitry Belousov, Yury Yanovich2025-08-18下载Non-fungible tokens (NFTs) have become a significant digital asset class, each uniquely representing virtual entities such as artworks. These tokens are stored in collections within smart contracts an...
Data-driven Trust Bootstrapping for Mobile Edge Computing-based Industrial IoT ServicesPrabath Abeysekara, Hai Dong2025-08-18下载We propose a data-driven and context-aware approach to bootstrap trustworthiness of homogeneous Internet of Things (IoT) services in Mobile Edge Computing (MEC) based industrial IoT (IIoT) systems.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Towards Integrated Energy-Communication-Transportation Hub: A Base-Station-Centric Design in 5G and BeyondLinfeng Shen, Guanzhen Wu, Cong Zhang, Xiaoyi Fan, Jiangchuan Liu2025-08-18下载The rise of 5G communication has transformed the telecom industry for critical applications. With the widespread deployment of 5G base stations comes a significant concern about energy consumption.
OrbitChain: Orchestrating In-orbit Real-time Analytics of Earth Observation DataZhouyu Li, Zhijin Yang, Huayue Gu, Xiaojian Wang, Yuchen Liu, Ruozhou Yu2025-08-18下载Earth observation analytics have the potential to transform many sectors. However, due to limited ground connections, it currently takes hours to days to download and analyze Earth observation data, d...
SL-ACC: A Communication-Efficient Split Learning Framework with Adaptive Channel-wise CompressionZehang Lin, Zheng Lin, Miao Yang, Jianhao Huang, Yuxin Zhang, Zihan Fang, Xia Du, Zhe Chen, Shunzhi Zhu, Wei Ni2025-08-18下载The increasing complexity of neural networks poses a significant barrier to the deployment of distributed machine learning (ML) on resource-constrained devices, such as federated learning (FL).
REACH: Reinforcement Learning for Efficient Allocation in Community and Heterogeneous NetworksZhiwei Yu, Chengze Du, Heng Xu, Ying Zhou, Bo Liu, Jialong Li2025-08-18下载Community GPU platforms are emerging as a cost-effective and democratized alternative to centralized GPU clusters for AI workloads, aggregating idle consumer GPUs from globally distributed and heterog...
RoTO: Robust Topology Obfuscation Against Tomography Inference AttacksChengze Du, Heng Xu, Zhiwei Yu, Ying Zhou, Zili Meng, Jialong Li2025-08-18下载Tomography inference attacks aim to reconstruct network topology by analyzing end-to-end probe delays. Existing defenses mitigate these attacks by manipulating probe delays to mislead inference, but r...
SDAP-based QoS Flow Multiplexing Support in Simu5G for 5G NR SimulationMohamed Seliem, Utz Roedig, Cormac Sreenan, Dirk Pesch2025-08-18下载The Service Data Adaptation Protocol (SDAP) plays a central role in 5G New Radio (NR), acting as a bridge between the core and radio networks, by enabling QoS Flow multiplexing over shared Data Radio ...
Some optimization possibilities in data plane programmingAltangerel Gereltsetseg, Tejfel Máté2025-08-18下载Software-defined networking (SDN) technology aims to create a highly flexible network by decoupling control plane and the data plane and programming them independently.
Cooperative Sensing-Assisted Predictive Beam Tracking for MIMO-OFDM Networked ISAC SystemsXiaoyu Yang, Zhiqing Wei, Jie Xu, Huici Wu, Zhiyong Feng2025-08-18下载This paper studies a multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) networked integrated sensing and communication (ISAC) system, in which multiple base statio...
Towards Nomadic 6G Communication Networks: Implications on Architecture, Standardization, and Regulatory AspectsDaniel Lindenschmitt, Marcos Rates Crippa, Hans D. Schotten2025-08-18下载The emergence of nomadic mobile communication networks for sixth-generation (6G) introduces a paradigm shift in how network infrastructure is conceptualized, deployed, and operated.
Game-Theoretic and Reinforcement Learning-Based Cluster Head Selection for Energy-Efficient Wireless Sensor NetworkMehrshad Eskandarpour, Saba Pirahmadian, Parham Soltani, Hossein Soleimani2025-08-18下载Energy in Wireless Sensor Networks (WSNs) is critical to network lifetime and data delivery. However, the primary impediment to the durability and dependability of these sensor nodes is their short ba...
An Efficient and Adaptive Framework for Achieving Underwater High-performance Maintenance NetworksYu Gou, Tong Zhang, Jun Liu, Zhongyang Qi, Dezhi Zheng2025-08-18下载With the development of space-air-ground-aqua integrated networks (SAGAIN), high-speed and reliable network services are accessible at any time and any location.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
OS-R1: Agentic Operating System Kernel Tuning with Reinforcement LearningHongyu Lin, Yuchen Li, Haoran Luo, Kaichun Yao, Libo Zhang, Mingjie Xing, Yanjun Wu2025-08-18下载Linux kernel tuning is essential for optimizing operating system (OS) performance. However, existing methods often face challenges in terms of efficiency, scalability, and generalization.

cs.PF - Performance

标题作者发布日期PDF摘要
Harnessing the Full Potential of RRAMs through Scalable and Distributed In-Memory Computing with Integrated Error CorrectionHuynh Q. N. Vo, Md Tawsif Rahman Chowdhury, Paritosh Ramanan, Murat Yildirim, Gozde Tutuncuoglu2025-08-18下载Exponential growth in global computing demand is exacerbated due to the higher-energy requirements of conventional architectures, primarily due to energy-intensive data movement.
Hierarchical Evaluation Function: A Multi-Metric Approach for Optimizing Demand Forecasting ModelsAdolfo González, Víctor Parada2025-08-18下载Demand forecasting in competitive, uncertain business environments requires models that can integrate multiple evaluation perspectives rather than being restricted to hyperparameter optimization based...
Multi-Metric Algorithmic Complexity: Beyond Asymptotic AnalysisSergii Kavun2025-08-18下载Traditional algorithm analysis treats all basic operations as equally costly, which hides significant differences in time, energy consumption, and cost between different types of computations on moder...
SYCL for Energy-Efficient Numerical Astrophysics: the case of DPEchoSalvatore Cielo, Alexander Pöppl, Ivan Pribec2025-08-18下载Energy awareness and efficiency policies are gaining more attention, over pure performance (time-to-solution) Key Performance Indicators (KPIs) when comparing the possibilities offered by accelerated ...
Dissecting CPU-GPU Unified Physical Memory on AMD MI300A APUsJacob Wahlgren, Gabin Schieffer, Ruimin Shi, Edgar A. León, Roger Pearce, Maya Gokhale, Ivy Peng2025-08-18下载Discrete GPUs are a cornerstone of HPC and data center systems, requiring management of separate CPU and GPU memory spaces. Unified Virtual Memory (UVM) has been proposed to ease the burden of memory ...

基于 VitePress 构建