2025-04-15

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
E-morphic: Scalable Equality Saturation for Structural Exploration in Logic Synthesis	Chen Chen, Guangyu HU, Cunxi Yu, Yuzhe Ma, Hongce Zhang	2025-04-15	下载	In technology mapping, the quality of the final implementation heavily relies on the circuit structure after technology-independent optimization.
HeatSense: Intelligent Thermal Anomaly Detection for Securing NoC-Enabled MPSoCs	Mahdi Hasanzadeh, Kasem Khalil, Cynthia Sturton, Ahmad Patooghy	2025-04-15	下载	Multi-Processor System-on-Chips (MPSoCs) are highly vulnerable to thermal attacks that manipulate dynamic thermal management systems. To counter this, we propose an adaptive real-time monitoring mecha...
A Multi-Stage Potts Machine based on Coupled CMOS Ring Oscillators	Yilmaz Ege Gonul, Baris Taskin	2025-04-15	下载	This work presents a multi-stage coupled ring oscillator based Potts machine, designed with phase-shifted Sub Harmonic-Injection-Locking (SHIL) to represent multi valued Potts spins at different sol...
VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in Transformers	Run Wang, Gamze Islamoglu, Andrea Belano, Viviane Potocnik, Francesco Conti, Angelo Garofalo, Luca Benini	2025-04-15	下载	While Transformers are dominated by Floating-Point (FP) Matrix-Multiplications, their aggressive acceleration through dedicated hardware or many-core programmable systems has shifted the performance b...
Unlimited Vector Processing for Wireless Baseband Based on RISC-V Extension	Limin Jiang, Yi Shi, Yihao Shen, Shan Cao, Zhiyuan Jiang, Sheng Zhou	2025-04-15	下载	Wireless baseband processing (WBP) serves as an ideal scenario for utilizing vector processing, which excels in managing data-parallel operations due to its parallel structure.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Engineering MultiQueues: Fast Relaxed Concurrent Priority Queues	Marvin Williams, Peter Sanders	2025-04-15	下载	Priority queues are used in a wide range of applications, including prioritized online scheduling, discrete event simulation, and greedy algorithms.
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11)	Tianyi Zhang, Mohsen Hariri, Shaochen Zhong, Vipin Chaudhary, Yang Sui, Xia Hu, Anshumali Shrivastava	2025-04-15	下载	Large-scale AI models, such as Large Language Models (LLMs) and Diffusion Models (DMs), have grown rapidly in size, creating significant challenges for efficient deployment on resource-constrained har...
FlowUnits: Extending Dataflow for the Edge-to-Cloud Computing Continuum	Fabio Chini, Luca De Martini, Alessandro Margara, Gianpaolo Cugola	2025-04-15	下载	This paper introduces FlowUnits, a novel programming and deployment model that extends the traditional dataflow paradigm to address the unique challenges of edge-to-cloud computing environments.
Transformer-Based Model for Cold Start Mitigation in FaaS Architecture	Alexandre Savi Fayam Mbala Mouen, Jerry Lacmou Zeutouo, Vianney Kengne Tchendji	2025-04-15	下载	Serverless architectures, particularly the Function as a Service (FaaS) model, have become a cornerstone of modern cloud computing due to their ability to simplify resource management and enhance appl...
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints	Ruicheng Ao, Gan Luo, David Simchi-Levi, Xinshang Wang	2025-04-15	下载	Large Language Models (LLMs) power many modern applications, but their inference procedure poses unique scheduling challenges: the Key-Value (KV) cache grows dynamically during response generation, an...
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance	Shangyu Liu, Zhenzhe Zheng, Xiaoyao Huang, Fan Wu, Guihai Chen, Jie Wu	2025-04-15	下载	Small language models (SLMs) support efficient deployments on resource-constrained edge devices, but their limited capacity compromises inference performance.
Uma extensão de Raft com propagação epidémica	André Gonçalves, Ana Nunes Alonso, José Pereira, Rui Oliveira	2025-04-15	下载	The Raft agreement algorithm is recognized for its ease of understanding and practical implementation, and is currently adopted in systems such as Kubernetes.
Morphing-based Compression for Data-centric ML Pipelines	Sebastian Baunsgaard, Matthias Boehm	2025-04-15	下载	Data-centric ML pipelines extend traditional machine learning (ML) pipelines -- of feature transformations and ML model training -- by outer loops for data cleaning, augmentation, and feature engineer...
Kubernetes in the Cloud vs. Bare Metal: A Comparative Study of Network Costs	Rodrigo Mompo Redoli, Amjad Ullah	2025-04-15	下载	Modern cloud-native applications increasingly utilise managed cloud services and containerisation technologies, such as Kubernetes, to achieve rapid time-to-market and scalable deployments.
Denoising Application Performance Models with Noise-Resilient Priors	Gustavo de Morais, Alexander Geiß, Alexandru Calotoiu, Gregor Corbin, Ahmad Tarraf, Torsten Hoefler, Bernd Mohr, Felix Wolf	2025-04-15	下载	As parallel codes are scaled to larger computing systems, performance models play a crucial role in identifying potential bottlenecks. However, constructing these models analytically is often challeng...
High-Efficiency Split Computing for Cooperative Edge Systems: A Novel Compressed Sensing Bottleneck	Hailin Zhong, Donglong Chen	2025-04-15	下载	The advent of big data and AI has precipitated a demand for computational frameworks that ensure real-time performance, accuracy, and privacy.
Mosaic: Client-driven Account Allocation Framework in Sharded Blockchains	Yuanzhe Zhang, Shirui Pan, Jiangshan Yu	2025-04-15	下载	Recent account allocation studies in sharded blockchains are typically miner-driven, requiring miners to perform global optimizations for all accounts to enhance system-wide performance.
Matrix representation and GPU-optimized parallel B-spline computing	Jiayu Wu, Qiang Zou	2025-04-15	下载	B-spline modeling is fundamental to CAD systems, and its evaluation and manipulation algorithms currently in use were developed decades ago, specifically for CPU architectures.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Beam Misalignment in 3GPP mmWave NR	Noe Bernadas i Busquets, Xavier Gelabert, Bleron Klaiqi, Ki Won Sung, Slimane Ben Slimane	2025-04-15	下载	This paper presents an analytical framework for evaluating beam misalignment in 3GPP mmWave NR systems implementing analog beamforming. Our approach captures the interaction between user mobility, bea...
Fuzzy Based Secure Clustering Schemes for Wireless Sensor Networks	Mohd Adnan	2025-04-15	下载	This dissertation presents three independent novel approaches for distinct scenarios to solve one or more open challenges. The first concern explains the focus on the lifetime of the networks: this di...
A Mathematical Framework of Semantic Communication based on Category Theory	Shuheng Hua, Yao Sun, Kairong Ma, Dusit Niyato, Muhammad Ali Imran	2025-04-15	下载	While semantic communication (SemCom) has recently demonstrated great potential to enhance transmission efficiency and reliability by leveraging machine learning (ML) and knowledge base (KB), there is...
Reconstructing Fine-Grained Network Data using Autoencoder Architectures with Domain Knowledge Penalties	Mark Cheung, Sridhar Venkatesan	2025-04-15	下载	The ability to reconstruct fine-grained network session data, including individual packets, from coarse-grained feature vectors is crucial for improving network security models.
AutoRAN: Automated and Zero-Touch Open RAN Systems	Stefano Maxenti, Ravis Shirkhani, Maxime Elkael, Leonardo Bonati, Salvatore D'Oro, Tommaso Melodia, Michele Polese	2025-04-15	下载	[...] This paper presents AutoRAN, an automated, intent-driven framework for zero-touch provisioning of open, programmable cellular networks. Leveraging cloud-native principles, AutoRAN employs virtua...
A Quantum Speedup in Localizing Transmission Loss Change in Optical Networks	Yufei Zheng, Yu-Zhen Janice Chen, Prithwish Basu, Don Towsley	2025-04-15	下载	The ability to localize transmission loss change to a subset of links in optical networks is crucial for maintaining network reliability, performance and security.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Engineering MultiQueues: Fast Relaxed Concurrent Priority Queues	Marvin Williams, Peter Sanders	2025-04-15	下载	Priority queues are used in a wide range of applications, including prioritized online scheduling, discrete event simulation, and greedy algorithms.
Denoising Application Performance Models with Noise-Resilient Priors	Gustavo de Morais, Alexander Geiß, Alexandru Calotoiu, Gregor Corbin, Ahmad Tarraf, Torsten Hoefler, Bernd Mohr, Felix Wolf	2025-04-15	下载	As parallel codes are scaled to larger computing systems, performance models play a crucial role in identifying potential bottlenecks. However, constructing these models analytically is often challeng...