2025-05-22

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Δ-Nets: Interaction-Based System for Optimal Parallel λ-Reduction	Daniel Augusto Rizzi Salvadori	2025-05-22	下载	I present a model of universal parallel computation called Δ-Nets, and a method to translate λ-terms into Δ-nets and back. Together, the model and the method constitute an algorithm for optimal ...
Modular SAIL: dream or reality?	Petr Kourzanov, Anmol	2025-05-22	下载	In order to truly benefit from RISC-V ISA modularity, the community has to address the issue of compositionality, going beyond modules at the specification level covering larger subsets of the RISC-V ...
CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark	Ahmed Heakl, Sarim Hashmi, Gustavo Bertolo Stahl, Seung Hun Eddie Han, Salman Khan, Abdulrahman Mahmoud	2025-05-22	下载	We introduce CASS, the first large-scale dataset and model suite for cross-architecture GPU code transpilation, targeting both source-level (CUDA <--> HIP) and assembly-level (Nvidia SASS <--> AMD RDN...
DAS-MP: Enabling High-Quality Macro Placement with Enhanced Dataflow Awareness	Xiaotian Zhao, Zixuan Li, Yichen Cai, Tianju Wang, Yushan Pan, Xinfei Guo	2025-05-22	下载	Dataflow is a critical yet underexplored factor in automatic macro placement, which is becoming increasingly important for developing intelligent design automation techniques that minimize reliance on...
How to keep pushing ML accelerator performance? Know your rooflines!	Marian Verhelst, Luca Benini, Naveen Verma	2025-05-22	下载	The rapidly growing importance of Machine Learning (ML) applications, coupled with their ever-increasing model size and inference energy footprint, has created a strong need for specialized ML hardwar...
Advanced Integration Strategies for ESD Protection and Termination in High-Speed LVDS Systems	Kavya Gaddipati	2025-05-22	下载	This technical article explores comprehensive strategies for integrating Electrostatic Discharge (ESD) protection diodes and termination resistors in LowVoltage Differential Signaling (LVDS) designs.
Cosmos: A CXL-Based Full In-Memory System for Approximate Nearest Neighbor Search	Seoyoung Ko, Hyunjeong Shim, Wanju Doh, Sungmin Yun, Jinin So, Yongsuk Kwon, Sang-Soo Park, Si-Dong Roh, Minyong Yoon, Taeksang Song, Jung Ho Ahn	2025-05-22	下载	Retrieval-Augmented Generation (RAG) is crucial for improving the quality of large language models by injecting proper contexts extracted from external sources.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Δ-Nets: Interaction-Based System for Optimal Parallel λ-Reduction	Daniel Augusto Rizzi Salvadori	2025-05-22	下载	I present a model of universal parallel computation called Δ-Nets, and a method to translate λ-terms into Δ-nets and back. Together, the model and the method constitute an algorithm for optimal ...
LogStamping: A blockchain-based log auditing approach for large-scale systems	Md Shariful Islam, M. Sohel Rahman	2025-05-22	下载	Log management is crucial for ensuring the security, integrity, and compliance of modern information systems. Traditional log management solutions face challenges in achieving tamper-proofing, scalabi...
Navigating the Edge-Cloud Continuum: A State-of-Practice Survey	Loris Belcastro, Fabrizio Marozzo, Alessio Orsino, Domenico Talia, Paolo Trunfio	2025-05-22	下载	The edge-cloud continuum has emerged as a transformative paradigm that meets the growing demand for low-latency, scalable, end-to-end service delivery by integrating decentralized edge resources with ...
Edge-First Language Model Inference: Models, Metrics, and Tradeoffs	SiYoung Jang, Roberto Morabito	2025-05-22	下载	The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge.
Recursive Offloading for LLM Serving in Multi-tier Networks	Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Jinda Lu, Zheming Yang, Tian Wen	2025-05-22	下载	Heterogeneous device-edge-cloud computing infrastructures have become widely adopted in telecommunication operators and Wide Area Networks (WANs), offering multi-tier computational support for emergin...
Smaller, Smarter, Closer: The Edge of Collaborative Generative AI	Roberto Morabito, SiYoung Jang	2025-05-22	下载	The rapid adoption of generative AI (GenAI), particularly Large Language Models (LLMs), has exposed critical limitations of cloud-centric deployments, including latency, cost, and privacy concerns.
Minimizing Energy in Reliability and Deadline-Ensured Workflow Scheduling in Cloud	Suvarthi Sarkar, Dhanesh V, Ketan Singh, Aryabartta Sahu	2025-05-22	下载	With the increasing prevalence of computationally intensive workflows in cloud environments, it has become crucial for cloud platforms to optimize energy consumption while ensuring the feasibility of ...
Redox: Improving I/O Efficiency of Model Training Through File Redirection	Yuhao Li, Xuanhua Shi, Yunfei Zhao, Yongluan Zhou, Yusheng Hua, Xuehai Qian	2025-05-22	下载	This paper proposes Redox, a training data management system designed to achieve high I/O efficiency. The key insight is a new observation of file redirection: for model training, when training data i...
On the Runtime of Local Mutual Exclusion for Anonymous Dynamic Networks	Anya Chaturvedi, Joshua J. Daymude, Andréa W. Richa	2025-05-22	下载	Algorithms for mutual exclusion aim to isolate potentially concurrent accesses to the same shared resources. Motivated by distributed computing research on programmable matter and population protocols...
Multimodal Online Federated Learning with Modality Missing in Internet of Things	Heqiang Wang, Xiang Liu, Xiaoxiong Zhong, Lixing Chen, Fangming Liu, Weizhe Zhang	2025-05-22	下载	The Internet of Things (IoT) ecosystem generates vast amounts of multimodal data from heterogeneous sources such as sensors, cameras, and microphones.
Towards Stream-Based Monitoring for EVM Networks	Emanuel Onica, Claudiu-Nicu Bărbieru, Andrei Arusoaie, Oana-Otilia Captarencu, Ciprian Amariei	2025-05-22	下载	We believe that leveraging real-time blockchain operational data is of particular interest in the context of the current rapid expansion of rollup networks in the Ethereum ecosystem.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
PREAMBLE and IMRECEIVING for Improved Large Message Handling in libp2p GossipSub	Muhammad Umar Farooq, Daniel Kaiser	2025-05-22	下载	Large message transmissions in libp2p GossipSub lead to longer than expected network-wide message dissemination times and very high bandwidth utilization.
LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols	Ziming Liu, Bryan Liu, Alvaro Valcarce, Xiaoli Chu	2025-05-22	下载	Integrating Large AI Models (LAMs) into 6G mobile networks is a key enabler of the AI-Native Air Interface (AI-AI), where protocol intelligence must scale beyond handcrafted logic.
Navigating the Edge-Cloud Continuum: A State-of-Practice Survey	Loris Belcastro, Fabrizio Marozzo, Alessio Orsino, Domenico Talia, Paolo Trunfio	2025-05-22	下载	The edge-cloud continuum has emerged as a transformative paradigm that meets the growing demand for low-latency, scalable, end-to-end service delivery by integrating decentralized edge resources with ...
SONIC: Cost-Effective Web Access for Developing Countries	Ayush Pandey, Rohail Asim, Jean Louis K. E. Fendji, Talal Rahwan, Matteo Varvello, Yasir Zaki	2025-05-22	下载	Over 2.6 billion people remain without access to the Internet in 2025. This phenomenon is especially pronounced in developing regions, where cost and infrastructure limitations are major barriers to c...
Edge-First Language Model Inference: Models, Metrics, and Tradeoffs	SiYoung Jang, Roberto Morabito	2025-05-22	下载	The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge.
Recursive Offloading for LLM Serving in Multi-tier Networks	Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Jinda Lu, Zheming Yang, Tian Wen	2025-05-22	下载	Heterogeneous device-edge-cloud computing infrastructures have become widely adopted in telecommunication operators and Wide Area Networks (WANs), offering multi-tier computational support for emergin...
Smaller, Smarter, Closer: The Edge of Collaborative Generative AI	Roberto Morabito, SiYoung Jang	2025-05-22	下载	The rapid adoption of generative AI (GenAI), particularly Large Language Models (LLMs), has exposed critical limitations of cloud-centric deployments, including latency, cost, and privacy concerns.
Graph Attention Network for Optimal User Association in Wireless Networks	Javad Mirzaei, Jeebak Mitra, Gwenael Poitau	2025-05-22	下载	With increased 5G deployments, network densification is higher than ever to support the exponentially high throughput requirements. However, this has meant a significant increase in energy consumption...
Resilient LLM-Empowered Semantic MAC Protocols via Zero-Shot Adaptation and Knowledge Distillation	Yongjun Kim, Jihong Park, Mehdi Bennis, Junil Choi	2025-05-22	下载	Neural network-based medium access control (MAC) protocol models (NPMs) improve goodput through site-specific operations but are vulnerable to shifts from their training network environments, such as ...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Edge-First Language Model Inference: Models, Metrics, and Tradeoffs	SiYoung Jang, Roberto Morabito	2025-05-22	下载	The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge.
Performance of Confidential Computing GPUs	Antonio Martínez Ibarra, Julian James Stephen, Aurora González Vidal, K. R. Jayaram, Antonio Fernando Skarmeta Gómez	2025-05-22	下载	This work examines latency, throughput, and other metrics when performing inference on confidential GPUs. We explore different traffic patterns and scheduling strategies using a single Virtual Machine...
Is Quantum Optimization Ready? An Effort Towards Neural Network Compression using Adiabatic Quantum Computing	Zhehui Wang, Benjamin Chen Ming Choong, Tian Huang, Daniel Gerlinghoff, Rick Siow Mong Goh, Cheng Liu, Tao Luo	2025-05-22	下载	Quantum optimization is the most mature quantum computing technology to date, providing a promising approach towards efficiently solving complex combinatorial problems.
Towards Stream-Based Monitoring for EVM Networks	Emanuel Onica, Claudiu-Nicu Bărbieru, Andrei Arusoaie, Oana-Otilia Captarencu, Ciprian Amariei	2025-05-22	下载	We believe that leveraging real-time blockchain operational data is of particular interest in the context of the current rapid expansion of rollup networks in the Ethereum ecosystem.