Appearance
2025-04-15
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| E-morphic: Scalable Equality Saturation for Structural Exploration in Logic Synthesis | Chen Chen, Guangyu HU, Cunxi Yu, Yuzhe Ma, Hongce Zhang | 2025-04-15 | 下载 | In technology mapping, the quality of the final implementation heavily relies on the circuit structure after technology-independent optimization. |
| HeatSense: Intelligent Thermal Anomaly Detection for Securing NoC-Enabled MPSoCs | Mahdi Hasanzadeh, Kasem Khalil, Cynthia Sturton, Ahmad Patooghy | 2025-04-15 | 下载 | Multi-Processor System-on-Chips (MPSoCs) are highly vulnerable to thermal attacks that manipulate dynamic thermal management systems. To counter this, we propose an adaptive real-time monitoring mecha... |
| A Multi-Stage Potts Machine based on Coupled CMOS Ring Oscillators | Yilmaz Ege Gonul, Baris Taskin | 2025-04-15 | 下载 | This work presents a multi-stage coupled ring oscillator based Potts machine, designed with phase-shifted Sub Harmonic-Injection-Locking (SHIL) to represent multi valued Potts spins at different sol... |
| VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in Transformers | Run Wang, Gamze Islamoglu, Andrea Belano, Viviane Potocnik, Francesco Conti, Angelo Garofalo, Luca Benini | 2025-04-15 | 下载 | While Transformers are dominated by Floating-Point (FP) Matrix-Multiplications, their aggressive acceleration through dedicated hardware or many-core programmable systems has shifted the performance b... |
| Unlimited Vector Processing for Wireless Baseband Based on RISC-V Extension | Limin Jiang, Yi Shi, Yihao Shen, Shan Cao, Zhiyuan Jiang, Sheng Zhou | 2025-04-15 | 下载 | Wireless baseband processing (WBP) serves as an ideal scenario for utilizing vector processing, which excels in managing data-parallel operations due to its parallel structure. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Engineering MultiQueues: Fast Relaxed Concurrent Priority Queues | Marvin Williams, Peter Sanders | 2025-04-15 | 下载 | Priority queues are used in a wide range of applications, including prioritized online scheduling, discrete event simulation, and greedy algorithms. |
| 70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float (DFloat11) | Tianyi Zhang, Mohsen Hariri, Shaochen Zhong, Vipin Chaudhary, Yang Sui, Xia Hu, Anshumali Shrivastava | 2025-04-15 | 下载 | Large-scale AI models, such as Large Language Models (LLMs) and Diffusion Models (DMs), have grown rapidly in size, creating significant challenges for efficient deployment on resource-constrained har... |
| FlowUnits: Extending Dataflow for the Edge-to-Cloud Computing Continuum | Fabio Chini, Luca De Martini, Alessandro Margara, Gianpaolo Cugola | 2025-04-15 | 下载 | This paper introduces FlowUnits, a novel programming and deployment model that extends the traditional dataflow paradigm to address the unique challenges of edge-to-cloud computing environments. |
| Transformer-Based Model for Cold Start Mitigation in FaaS Architecture | Alexandre Savi Fayam Mbala Mouen, Jerry Lacmou Zeutouo, Vianney Kengne Tchendji | 2025-04-15 | 下载 | Serverless architectures, particularly the Function as a Service (FaaS) model, have become a cornerstone of modern cloud computing due to their ability to simplify resource management and enhance appl... |
| Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints | Ruicheng Ao, Gan Luo, David Simchi-Levi, Xinshang Wang | 2025-04-15 | 下载 | Large Language Models (LLMs) power many modern applications, but their inference procedure poses unique scheduling challenges: the Key-Value (KV) cache grows dynamically during response generation, an... |
| Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance | Shangyu Liu, Zhenzhe Zheng, Xiaoyao Huang, Fan Wu, Guihai Chen, Jie Wu | 2025-04-15 | 下载 | Small language models (SLMs) support efficient deployments on resource-constrained edge devices, but their limited capacity compromises inference performance. |
| Uma extensão de Raft com propagação epidémica | André Gonçalves, Ana Nunes Alonso, José Pereira, Rui Oliveira | 2025-04-15 | 下载 | The Raft agreement algorithm is recognized for its ease of understanding and practical implementation, and is currently adopted in systems such as Kubernetes. |
| Morphing-based Compression for Data-centric ML Pipelines | Sebastian Baunsgaard, Matthias Boehm | 2025-04-15 | 下载 | Data-centric ML pipelines extend traditional machine learning (ML) pipelines -- of feature transformations and ML model training -- by outer loops for data cleaning, augmentation, and feature engineer... |
| Kubernetes in the Cloud vs. Bare Metal: A Comparative Study of Network Costs | Rodrigo Mompo Redoli, Amjad Ullah | 2025-04-15 | 下载 | Modern cloud-native applications increasingly utilise managed cloud services and containerisation technologies, such as Kubernetes, to achieve rapid time-to-market and scalable deployments. |
| Denoising Application Performance Models with Noise-Resilient Priors | Gustavo de Morais, Alexander Geiß, Alexandru Calotoiu, Gregor Corbin, Ahmad Tarraf, Torsten Hoefler, Bernd Mohr, Felix Wolf | 2025-04-15 | 下载 | As parallel codes are scaled to larger computing systems, performance models play a crucial role in identifying potential bottlenecks. However, constructing these models analytically is often challeng... |
| High-Efficiency Split Computing for Cooperative Edge Systems: A Novel Compressed Sensing Bottleneck | Hailin Zhong, Donglong Chen | 2025-04-15 | 下载 | The advent of big data and AI has precipitated a demand for computational frameworks that ensure real-time performance, accuracy, and privacy. |
| Mosaic: Client-driven Account Allocation Framework in Sharded Blockchains | Yuanzhe Zhang, Shirui Pan, Jiangshan Yu | 2025-04-15 | 下载 | Recent account allocation studies in sharded blockchains are typically miner-driven, requiring miners to perform global optimizations for all accounts to enhance system-wide performance. |
| Matrix representation and GPU-optimized parallel B-spline computing | Jiayu Wu, Qiang Zou | 2025-04-15 | 下载 | B-spline modeling is fundamental to CAD systems, and its evaluation and manipulation algorithms currently in use were developed decades ago, specifically for CPU architectures. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Beam Misalignment in 3GPP mmWave NR | Noe Bernadas i Busquets, Xavier Gelabert, Bleron Klaiqi, Ki Won Sung, Slimane Ben Slimane | 2025-04-15 | 下载 | This paper presents an analytical framework for evaluating beam misalignment in 3GPP mmWave NR systems implementing analog beamforming. Our approach captures the interaction between user mobility, bea... |
| Fuzzy Based Secure Clustering Schemes for Wireless Sensor Networks | Mohd Adnan | 2025-04-15 | 下载 | This dissertation presents three independent novel approaches for distinct scenarios to solve one or more open challenges. The first concern explains the focus on the lifetime of the networks: this di... |
| A Mathematical Framework of Semantic Communication based on Category Theory | Shuheng Hua, Yao Sun, Kairong Ma, Dusit Niyato, Muhammad Ali Imran | 2025-04-15 | 下载 | While semantic communication (SemCom) has recently demonstrated great potential to enhance transmission efficiency and reliability by leveraging machine learning (ML) and knowledge base (KB), there is... |
| Reconstructing Fine-Grained Network Data using Autoencoder Architectures with Domain Knowledge Penalties | Mark Cheung, Sridhar Venkatesan | 2025-04-15 | 下载 | The ability to reconstruct fine-grained network session data, including individual packets, from coarse-grained feature vectors is crucial for improving network security models. |
| AutoRAN: Automated and Zero-Touch Open RAN Systems | Stefano Maxenti, Ravis Shirkhani, Maxime Elkael, Leonardo Bonati, Salvatore D'Oro, Tommaso Melodia, Michele Polese | 2025-04-15 | 下载 | [...] This paper presents AutoRAN, an automated, intent-driven framework for zero-touch provisioning of open, programmable cellular networks. Leveraging cloud-native principles, AutoRAN employs virtua... |
| A Quantum Speedup in Localizing Transmission Loss Change in Optical Networks | Yufei Zheng, Yu-Zhen Janice Chen, Prithwish Basu, Don Towsley | 2025-04-15 | 下载 | The ability to localize transmission loss change to a subset of links in optical networks is crucial for maintaining network reliability, performance and security. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Engineering MultiQueues: Fast Relaxed Concurrent Priority Queues | Marvin Williams, Peter Sanders | 2025-04-15 | 下载 | Priority queues are used in a wide range of applications, including prioritized online scheduling, discrete event simulation, and greedy algorithms. |
| Denoising Application Performance Models with Noise-Resilient Priors | Gustavo de Morais, Alexander Geiß, Alexandru Calotoiu, Gregor Corbin, Ahmad Tarraf, Torsten Hoefler, Bernd Mohr, Felix Wolf | 2025-04-15 | 下载 | As parallel codes are scaled to larger computing systems, performance models play a crucial role in identifying potential bottlenecks. However, constructing these models analytically is often challeng... |