Appearance
2025-11-16
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Physics-Constrained Adaptive Neural Networks Enable Real-Time Semiconductor Manufacturing Optimization with Minimal Training Data | Rubén Darío Guerrero | 2025-11-16 | 下载 | The semiconductor industry faces a computational crisis in extreme ultraviolet (EUV) lithography optimization, where traditional methods consume billions of CPU hours while failing to achieve sub-nano... |
| On the Excitability of Ultra-Low-Power CMOS Analog Spiking Neurons | Léopold Van Brandt, Grégoire Brandsteert, Denis Flandre | 2025-11-16 | 下载 | The excitability property of spiking neurons describes their capability to output an action potential as a real-time response to an input synaptic excitation current and is central to the event-based ... |
| SynapticCore-X: A Modular Neural Processing Architecture for Low-Cost FPGA Acceleration | Arya Parameshwara | 2025-11-16 | 下载 | This paper presents SynapticCore-X, a modular and resource-efficient neural processing architecture optimized for deployment on low-cost FPGA platforms. |
| SetupKit: Efficient Multi-Corner Setup/Hold Time Characterization Using Bias-Enhanced Interpolation and Active Learning | Junzhuo Zhou, Ziwen Wang, Haoxuan Xia, Yuxin Yan, Chengyu Zhu, Ting-Jung Lin, Wei Xing, Lei He | 2025-11-16 | 下载 | Accurate setup/hold time characterization is crucial for modern chip timing closure, but its reliance on potentially millions of SPICE simulations across diverse process-voltagetemperature (PVT) corne... |
| FERMI-ML: A Flexible and Resource-Efficient Memory-In-Situ SRAM Macro for TinyML acceleration | Mukul Lokhande, Akash Sankhe, S. V. Jaya Chand, Santosh Kumar Vishvakarma | 2025-11-16 | 下载 | The growing demand for low-power and area-efficient TinyML inference on AIoT devices necessitates memory architectures that minimise data movement while sustaining high computational efficiency. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Closer Look at Personalized Fine-Tuning in Heterogeneous Federated Learning | Minghui Chen, Hrad Ghoukasian, Ruinan Jin, Zehua Wang, Sai Praneeth Karimireddy, Xiaoxiao Li | 2025-11-16 | 下载 | Federated Learning (FL) enables decentralized, privacy-preserving model training but struggles to balance global generalization and local personalization due to non-identical data distributions across... |
| The Time to Consensus in a Blockchain: Insights into Bitcoin's "6 Blocks Rule'' | Partha S. Dey, Aditya S. Gopalan, Vijay G. Subramanian | 2025-11-16 | 下载 | We investigate the time to consensus in Nakamoto blockchains. Specifically, we consider two competing growth processes, labeled \emph{honest} and \emph{adversarial}, and determine the time after which... |
| Artifact for A Non-Intrusive Framework for Deferred Integration of Cloud Patterns in Energy-Efficient Data-Sharing Pipelines | Sepideh Masoudi, Mark Edward Michael Daly, Jannis Kiesel | 2025-11-16 | 下载 | As data mesh architectures grow, organizations increasingly build consumer-specific data-sharing pipelines from modular, cloud-based transformation services. |
| QPU Micro-Kernels for Stencil Computation | Stefano Markidis, Luca Pennati, Marco Pasquale, Gilbert Netzer, Ivy Peng | 2025-11-16 | 下载 | We introduce QPU micro-kernels: shallow quantum circuits that perform a stencil node update and return a Monte Carlo estimate from repeated measurements. |
| Asynchronous Cooperative Optimization of a Capacitated Vehicle Routing Problem Solution | Luca Accorsi, Demetrio Laganà, Federico Michelotto, Roberto Musmanno, Daniele Vigo | 2025-11-16 | 下载 | We propose a parallel shared-memory schema to cooperatively optimize the solution of a Capacitated Vehicle Routing Problem instance with minimal synchronization effort and without the need for an expl... |
| Iris: First-Class Multi-GPU Programming Experience in Triton | Muhammad Awad, Muhammad Osama, Brandon Potter | 2025-11-16 | 下载 | Multi-GPU programming traditionally requires developers to navigate complex trade-offs between performance and programmability. High-performance implementations typically rely on low-level HIP/CUDA co... |
| A Decentralized Root Cause Localization Approach for Edge Computing Environments | Duneesha Fernando, Maria A. Rodriguez, Rajkumar Buyya | 2025-11-16 | 下载 | Edge computing environments host increasingly complex microservice-based IoT applications, which are prone to performance anomalies that can propagate across dependent services. |
| Design of A Low-Latency and Parallelizable SVD Dataflow Architecture on FPGA | Fangqiang Du, Sixuan Chong, Zixuan Huang, Rui Qin, Fengnan Mi, Caibao Hu, Jiangang Chen | 2025-11-16 | 下载 | Singular value decomposition (SVD) is widely used for dimensionality reduction and noise suppression, and it plays a pivotal role in numerous scientific and engineering applications. |
| SEE++: Evolving Snowpark Execution Environment for Modern Workloads | Gaurav Jain, Brandon Baker, Joe Yin, Chenwei Xie, Zihao Ye, Sidh Kulkarni, Sara Abdelrahman, Nova Qi, Urjeet Shrestha, Mike Halcrow, Dave Bailey, Yuxiong He | 2025-11-16 | 下载 | Snowpark enables Data Engineering and AI/ML workloads to run directly within Snowflake by deploying a secure sandbox on virtual warehouse nodes. |
| Semantic Multiplexing | Mohammad Abdi, Francesca Meneghello, Francesco Restuccia | 2025-11-16 | 下载 | Mobile devices increasingly require the parallel execution of several computing tasks offloaded at the wireless edge. Existing communication systems only support parallel transmissions at the bit leve... |
| Guaranteed DGEMM Accuracy While Using Reduced Precision Tensor Cores Through Extensions of the Ozaki Scheme | Angelika Schwarz, Anton Anders, Cole Brower, Harun Bayraktar, John Gunnels, Kate Clark, RuQing G. Xu, Samuel Rodriguez, Sebastien Cayrols, Paweł Tabaszewski, Victor Podlozhnyuk | 2025-11-16 | 下载 | The rapid growth of artificial intelligence (AI) has made low-precision formats such as FP16, FP8, and, most recently, block-scaled FP4 the primary focus of modern GPUs, where Tensor Cores now deliver... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Distributed Pulse-Wave Simulator for DDoS Dataset Generation | Karim Khamaisi, Pascal Kiechl, Katharina Müller, Burkhard Stiller, Bruno Rodrigues | 2025-11-16 | 下载 | Pulse-wave Distributed Denial-of-Service (DDoS) attacks generate short, synchronized bursts of traffic that circumvent pattern-based detection and quickly exhaust traditional defense systems. |
| CareNet: Linking Home-router Network Traffic to DSM-5 Depressive Behavior Indicators | Stephan Nef, Bruno Rodrigues | 2025-11-16 | 下载 | Digital mental-health sensing increasingly depends on mobile or wearable devices that require intrusive permissions and continuous user compliance. |
| Cybersecurity of High-Altitude Platform Stations: Threat Taxonomy, Attacks and Defenses with Standards Mapping - DDoS Attack Use Case | Chaouki Hjaiji, Bassem Ouni, Mohamed-Slim Alouini | 2025-11-16 | 下载 | High-Altitude Platform Stations (HAPS) are emerging stratospheric nodes within non-terrestrial networks. We provide a structured overview of HAPS subsystems and principal communication links, map cybe... |
| Adaptive Dual-Layer Web Application Firewall (ADL-WAF) Leveraging Machine Learning for Enhanced Anomaly and Threat Detection | Ahmed Sameh, Sahar Selim | 2025-11-16 | 下载 | Web Application Firewalls are crucial for protecting web applications against a wide range of cyber threats. Traditional Web Application Firewalls often struggle to effectively distinguish between mal... |
| Collaborative Charging Optimization for Wireless Rechargeable Sensor Networks via Heterogeneous Mobile Chargers | Jianhang Yao, Hui Kang, Geng Sun, Jiahui Li, Hongjuan Li, Jiacheng Wang, Yinqiu Liu, Dusit Niyato | 2025-11-16 | 下载 | Despite the rapid proliferation of Internet of Things applications driving widespread wireless sensor network (WSN) deployment, traditional WSNs remain fundamentally constrained by persistent energy l... |
| Semantic Multiplexing | Mohammad Abdi, Francesca Meneghello, Francesco Restuccia | 2025-11-16 | 下载 | Mobile devices increasingly require the parallel execution of several computing tasks offloaded at the wireless edge. Existing communication systems only support parallel transmissions at the bit leve... |