Skip to content

2025-04-15

cs.AR - Architecture

标题作者发布日期PDF摘要
E-morphic: Scalable Equality Saturation for Structural Exploration in Logic SynthesisChen Chen, Guangyu HU, Cunxi Yu, Yuzhe Ma, Hongce Zhang2025-04-15下载In technology mapping, the quality of the final implementation heavily relies on the circuit structure after technology-independent optimization.
HeatSense: Intelligent Thermal Anomaly Detection for Securing NoC-Enabled MPSoCsMahdi Hasanzadeh, Kasem Khalil, Cynthia Sturton, Ahmad Patooghy2025-04-15下载Multi-Processor System-on-Chips (MPSoCs) are highly vulnerable to thermal attacks that manipulate dynamic thermal management systems. To counter this, we propose an adaptive real-time monitoring mecha...
A Multi-Stage Potts Machine based on Coupled CMOS Ring OscillatorsYilmaz Ege Gonul, Baris Taskin2025-04-15下载This work presents a multi-stage coupled ring oscillator based Potts machine, designed with phase-shifted Sub Harmonic-Injection-Locking (SHIL) to represent multi valued Potts spins at different sol...
VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in TransformersRun Wang, Gamze Islamoglu, Andrea Belano, Viviane Potocnik, Francesco Conti, Angelo Garofalo, Luca Benini2025-04-15下载While Transformers are dominated by Floating-Point (FP) Matrix-Multiplications, their aggressive acceleration through dedicated hardware or many-core programmable systems has shifted the performance b...
Unlimited Vector Processing for Wireless Baseband Based on RISC-V ExtensionLimin Jiang, Yi Shi, Yihao Shen, Shan Cao, Zhiyuan Jiang, Sheng Zhou2025-04-15下载Wireless baseband processing (WBP) serves as an ideal scenario for utilizing vector processing, which excels in managing data-parallel operations due to its parallel structure.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Engineering MultiQueues: Fast Relaxed Concurrent Priority QueuesMarvin Williams, Peter Sanders2025-04-15下载Priority queues are used in a wide range of applications, including prioritized online scheduling, discrete event simulation, and greedy algorithms.
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11)Tianyi Zhang, Mohsen Hariri, Shaochen Zhong, Vipin Chaudhary, Yang Sui, Xia Hu, Anshumali Shrivastava2025-04-15下载Large-scale AI models, such as Large Language Models (LLMs) and Diffusion Models (DMs), have grown rapidly in size, creating significant challenges for efficient deployment on resource-constrained har...
FlowUnits: Extending Dataflow for the Edge-to-Cloud Computing ContinuumFabio Chini, Luca De Martini, Alessandro Margara, Gianpaolo Cugola2025-04-15下载This paper introduces FlowUnits, a novel programming and deployment model that extends the traditional dataflow paradigm to address the unique challenges of edge-to-cloud computing environments.
Transformer-Based Model for Cold Start Mitigation in FaaS ArchitectureAlexandre Savi Fayam Mbala Mouen, Jerry Lacmou Zeutouo, Vianney Kengne Tchendji2025-04-15下载Serverless architectures, particularly the Function as a Service (FaaS) model, have become a cornerstone of modern cloud computing due to their ability to simplify resource management and enhance appl...
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory ConstraintsRuicheng Ao, Gan Luo, David Simchi-Levi, Xinshang Wang2025-04-15下载Large Language Models (LLMs) power many modern applications, but their inference procedure poses unique scheduling challenges: the Key-Value (KV) cache grows dynamically during response generation, an...
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model PerformanceShangyu Liu, Zhenzhe Zheng, Xiaoyao Huang, Fan Wu, Guihai Chen, Jie Wu2025-04-15下载Small language models (SLMs) support efficient deployments on resource-constrained edge devices, but their limited capacity compromises inference performance.
Uma extensão de Raft com propagação epidémicaAndré Gonçalves, Ana Nunes Alonso, José Pereira, Rui Oliveira2025-04-15下载The Raft agreement algorithm is recognized for its ease of understanding and practical implementation, and is currently adopted in systems such as Kubernetes.
Morphing-based Compression for Data-centric ML PipelinesSebastian Baunsgaard, Matthias Boehm2025-04-15下载Data-centric ML pipelines extend traditional machine learning (ML) pipelines -- of feature transformations and ML model training -- by outer loops for data cleaning, augmentation, and feature engineer...
Kubernetes in the Cloud vs. Bare Metal: A Comparative Study of Network CostsRodrigo Mompo Redoli, Amjad Ullah2025-04-15下载Modern cloud-native applications increasingly utilise managed cloud services and containerisation technologies, such as Kubernetes, to achieve rapid time-to-market and scalable deployments.
Denoising Application Performance Models with Noise-Resilient PriorsGustavo de Morais, Alexander Geiß, Alexandru Calotoiu, Gregor Corbin, Ahmad Tarraf, Torsten Hoefler, Bernd Mohr, Felix Wolf2025-04-15下载As parallel codes are scaled to larger computing systems, performance models play a crucial role in identifying potential bottlenecks. However, constructing these models analytically is often challeng...
High-Efficiency Split Computing for Cooperative Edge Systems: A Novel Compressed Sensing BottleneckHailin Zhong, Donglong Chen2025-04-15下载The advent of big data and AI has precipitated a demand for computational frameworks that ensure real-time performance, accuracy, and privacy.
Mosaic: Client-driven Account Allocation Framework in Sharded BlockchainsYuanzhe Zhang, Shirui Pan, Jiangshan Yu2025-04-15下载Recent account allocation studies in sharded blockchains are typically miner-driven, requiring miners to perform global optimizations for all accounts to enhance system-wide performance.
Matrix representation and GPU-optimized parallel B-spline computingJiayu Wu, Qiang Zou2025-04-15下载B-spline modeling is fundamental to CAD systems, and its evaluation and manipulation algorithms currently in use were developed decades ago, specifically for CPU architectures.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Beam Misalignment in 3GPP mmWave NRNoe Bernadas i Busquets, Xavier Gelabert, Bleron Klaiqi, Ki Won Sung, Slimane Ben Slimane2025-04-15下载This paper presents an analytical framework for evaluating beam misalignment in 3GPP mmWave NR systems implementing analog beamforming. Our approach captures the interaction between user mobility, bea...
Fuzzy Based Secure Clustering Schemes for Wireless Sensor NetworksMohd Adnan2025-04-15下载This dissertation presents three independent novel approaches for distinct scenarios to solve one or more open challenges. The first concern explains the focus on the lifetime of the networks: this di...
A Mathematical Framework of Semantic Communication based on Category TheoryShuheng Hua, Yao Sun, Kairong Ma, Dusit Niyato, Muhammad Ali Imran2025-04-15下载While semantic communication (SemCom) has recently demonstrated great potential to enhance transmission efficiency and reliability by leveraging machine learning (ML) and knowledge base (KB), there is...
Reconstructing Fine-Grained Network Data using Autoencoder Architectures with Domain Knowledge PenaltiesMark Cheung, Sridhar Venkatesan2025-04-15下载The ability to reconstruct fine-grained network session data, including individual packets, from coarse-grained feature vectors is crucial for improving network security models.
AutoRAN: Automated and Zero-Touch Open RAN SystemsStefano Maxenti, Ravis Shirkhani, Maxime Elkael, Leonardo Bonati, Salvatore D'Oro, Tommaso Melodia, Michele Polese2025-04-15下载[...] This paper presents AutoRAN, an automated, intent-driven framework for zero-touch provisioning of open, programmable cellular networks. Leveraging cloud-native principles, AutoRAN employs virtua...
A Quantum Speedup in Localizing Transmission Loss Change in Optical NetworksYufei Zheng, Yu-Zhen Janice Chen, Prithwish Basu, Don Towsley2025-04-15下载The ability to localize transmission loss change to a subset of links in optical networks is crucial for maintaining network reliability, performance and security.

cs.PF - Performance

标题作者发布日期PDF摘要
Engineering MultiQueues: Fast Relaxed Concurrent Priority QueuesMarvin Williams, Peter Sanders2025-04-15下载Priority queues are used in a wide range of applications, including prioritized online scheduling, discrete event simulation, and greedy algorithms.
Denoising Application Performance Models with Noise-Resilient PriorsGustavo de Morais, Alexander Geiß, Alexandru Calotoiu, Gregor Corbin, Ahmad Tarraf, Torsten Hoefler, Bernd Mohr, Felix Wolf2025-04-15下载As parallel codes are scaled to larger computing systems, performance models play a crucial role in identifying potential bottlenecks. However, constructing these models analytically is often challeng...

基于 VitePress 构建