Skip to content

2025-05-22

cs.AR - Architecture

标题作者发布日期PDF摘要
Δ-Nets: Interaction-Based System for Optimal Parallel λ-ReductionDaniel Augusto Rizzi Salvadori2025-05-22下载I present a model of universal parallel computation called Δ-Nets, and a method to translate λ-terms into Δ-nets and back. Together, the model and the method constitute an algorithm for optimal ...
Modular SAIL: dream or reality?Petr Kourzanov, Anmol2025-05-22下载In order to truly benefit from RISC-V ISA modularity, the community has to address the issue of compositionality, going beyond modules at the specification level covering larger subsets of the RISC-V ...
CASS: Nvidia to AMD Transpilation with Data, Models, and BenchmarkAhmed Heakl, Sarim Hashmi, Gustavo Bertolo Stahl, Seung Hun Eddie Han, Salman Khan, Abdulrahman Mahmoud2025-05-22下载We introduce CASS, the first large-scale dataset and model suite for cross-architecture GPU code transpilation, targeting both source-level (CUDA <--> HIP) and assembly-level (Nvidia SASS <--> AMD RDN...
DAS-MP: Enabling High-Quality Macro Placement with Enhanced Dataflow AwarenessXiaotian Zhao, Zixuan Li, Yichen Cai, Tianju Wang, Yushan Pan, Xinfei Guo2025-05-22下载Dataflow is a critical yet underexplored factor in automatic macro placement, which is becoming increasingly important for developing intelligent design automation techniques that minimize reliance on...
How to keep pushing ML accelerator performance? Know your rooflines!Marian Verhelst, Luca Benini, Naveen Verma2025-05-22下载The rapidly growing importance of Machine Learning (ML) applications, coupled with their ever-increasing model size and inference energy footprint, has created a strong need for specialized ML hardwar...
Advanced Integration Strategies for ESD Protection and Termination in High-Speed LVDS SystemsKavya Gaddipati2025-05-22下载This technical article explores comprehensive strategies for integrating Electrostatic Discharge (ESD) protection diodes and termination resistors in LowVoltage Differential Signaling (LVDS) designs.
Cosmos: A CXL-Based Full In-Memory System for Approximate Nearest Neighbor SearchSeoyoung Ko, Hyunjeong Shim, Wanju Doh, Sungmin Yun, Jinin So, Yongsuk Kwon, Sang-Soo Park, Si-Dong Roh, Minyong Yoon, Taeksang Song, Jung Ho Ahn2025-05-22下载Retrieval-Augmented Generation (RAG) is crucial for improving the quality of large language models by injecting proper contexts extracted from external sources.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Δ-Nets: Interaction-Based System for Optimal Parallel λ-ReductionDaniel Augusto Rizzi Salvadori2025-05-22下载I present a model of universal parallel computation called Δ-Nets, and a method to translate λ-terms into Δ-nets and back. Together, the model and the method constitute an algorithm for optimal ...
LogStamping: A blockchain-based log auditing approach for large-scale systemsMd Shariful Islam, M. Sohel Rahman2025-05-22下载Log management is crucial for ensuring the security, integrity, and compliance of modern information systems. Traditional log management solutions face challenges in achieving tamper-proofing, scalabi...
Navigating the Edge-Cloud Continuum: A State-of-Practice SurveyLoris Belcastro, Fabrizio Marozzo, Alessio Orsino, Domenico Talia, Paolo Trunfio2025-05-22下载The edge-cloud continuum has emerged as a transformative paradigm that meets the growing demand for low-latency, scalable, end-to-end service delivery by integrating decentralized edge resources with ...
Edge-First Language Model Inference: Models, Metrics, and TradeoffsSiYoung Jang, Roberto Morabito2025-05-22下载The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge.
Recursive Offloading for LLM Serving in Multi-tier NetworksZhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Jinda Lu, Zheming Yang, Tian Wen2025-05-22下载Heterogeneous device-edge-cloud computing infrastructures have become widely adopted in telecommunication operators and Wide Area Networks (WANs), offering multi-tier computational support for emergin...
Smaller, Smarter, Closer: The Edge of Collaborative Generative AIRoberto Morabito, SiYoung Jang2025-05-22下载The rapid adoption of generative AI (GenAI), particularly Large Language Models (LLMs), has exposed critical limitations of cloud-centric deployments, including latency, cost, and privacy concerns.
Minimizing Energy in Reliability and Deadline-Ensured Workflow Scheduling in CloudSuvarthi Sarkar, Dhanesh V, Ketan Singh, Aryabartta Sahu2025-05-22下载With the increasing prevalence of computationally intensive workflows in cloud environments, it has become crucial for cloud platforms to optimize energy consumption while ensuring the feasibility of ...
Redox: Improving I/O Efficiency of Model Training Through File RedirectionYuhao Li, Xuanhua Shi, Yunfei Zhao, Yongluan Zhou, Yusheng Hua, Xuehai Qian2025-05-22下载This paper proposes Redox, a training data management system designed to achieve high I/O efficiency. The key insight is a new observation of file redirection: for model training, when training data i...
On the Runtime of Local Mutual Exclusion for Anonymous Dynamic NetworksAnya Chaturvedi, Joshua J. Daymude, Andréa W. Richa2025-05-22下载Algorithms for mutual exclusion aim to isolate potentially concurrent accesses to the same shared resources. Motivated by distributed computing research on programmable matter and population protocols...
Multimodal Online Federated Learning with Modality Missing in Internet of ThingsHeqiang Wang, Xiang Liu, Xiaoxiong Zhong, Lixing Chen, Fangming Liu, Weizhe Zhang2025-05-22下载The Internet of Things (IoT) ecosystem generates vast amounts of multimodal data from heterogeneous sources such as sensors, cameras, and microphones.
Towards Stream-Based Monitoring for EVM NetworksEmanuel Onica, Claudiu-Nicu Bărbieru, Andrei Arusoaie, Oana-Otilia Captarencu, Ciprian Amariei2025-05-22下载We believe that leveraging real-time blockchain operational data is of particular interest in the context of the current rapid expansion of rollup networks in the Ethereum ecosystem.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
PREAMBLE and IMRECEIVING for Improved Large Message Handling in libp2p GossipSubMuhammad Umar Farooq, Daniel Kaiser2025-05-22下载Large message transmissions in libp2p GossipSub lead to longer than expected network-wide message dissemination times and very high bandwidth utilization.
LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN ProtocolsZiming Liu, Bryan Liu, Alvaro Valcarce, Xiaoli Chu2025-05-22下载Integrating Large AI Models (LAMs) into 6G mobile networks is a key enabler of the AI-Native Air Interface (AI-AI), where protocol intelligence must scale beyond handcrafted logic.
Navigating the Edge-Cloud Continuum: A State-of-Practice SurveyLoris Belcastro, Fabrizio Marozzo, Alessio Orsino, Domenico Talia, Paolo Trunfio2025-05-22下载The edge-cloud continuum has emerged as a transformative paradigm that meets the growing demand for low-latency, scalable, end-to-end service delivery by integrating decentralized edge resources with ...
SONIC: Cost-Effective Web Access for Developing CountriesAyush Pandey, Rohail Asim, Jean Louis K. E. Fendji, Talal Rahwan, Matteo Varvello, Yasir Zaki2025-05-22下载Over 2.6 billion people remain without access to the Internet in 2025. This phenomenon is especially pronounced in developing regions, where cost and infrastructure limitations are major barriers to c...
Edge-First Language Model Inference: Models, Metrics, and TradeoffsSiYoung Jang, Roberto Morabito2025-05-22下载The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge.
Recursive Offloading for LLM Serving in Multi-tier NetworksZhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Jinda Lu, Zheming Yang, Tian Wen2025-05-22下载Heterogeneous device-edge-cloud computing infrastructures have become widely adopted in telecommunication operators and Wide Area Networks (WANs), offering multi-tier computational support for emergin...
Smaller, Smarter, Closer: The Edge of Collaborative Generative AIRoberto Morabito, SiYoung Jang2025-05-22下载The rapid adoption of generative AI (GenAI), particularly Large Language Models (LLMs), has exposed critical limitations of cloud-centric deployments, including latency, cost, and privacy concerns.
Graph Attention Network for Optimal User Association in Wireless NetworksJavad Mirzaei, Jeebak Mitra, Gwenael Poitau2025-05-22下载With increased 5G deployments, network densification is higher than ever to support the exponentially high throughput requirements. However, this has meant a significant increase in energy consumption...
Resilient LLM-Empowered Semantic MAC Protocols via Zero-Shot Adaptation and Knowledge DistillationYongjun Kim, Jihong Park, Mehdi Bennis, Junil Choi2025-05-22下载Neural network-based medium access control (MAC) protocol models (NPMs) improve goodput through site-specific operations but are vulnerable to shifts from their training network environments, such as ...

cs.PF - Performance

标题作者发布日期PDF摘要
Edge-First Language Model Inference: Models, Metrics, and TradeoffsSiYoung Jang, Roberto Morabito2025-05-22下载The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge.
Performance of Confidential Computing GPUsAntonio Martínez Ibarra, Julian James Stephen, Aurora González Vidal, K. R. Jayaram, Antonio Fernando Skarmeta Gómez2025-05-22下载This work examines latency, throughput, and other metrics when performing inference on confidential GPUs. We explore different traffic patterns and scheduling strategies using a single Virtual Machine...
Is Quantum Optimization Ready? An Effort Towards Neural Network Compression using Adiabatic Quantum ComputingZhehui Wang, Benjamin Chen Ming Choong, Tian Huang, Daniel Gerlinghoff, Rick Siow Mong Goh, Cheng Liu, Tao Luo2025-05-22下载Quantum optimization is the most mature quantum computing technology to date, providing a promising approach towards efficiently solving complex combinatorial problems.
Towards Stream-Based Monitoring for EVM NetworksEmanuel Onica, Claudiu-Nicu Bărbieru, Andrei Arusoaie, Oana-Otilia Captarencu, Ciprian Amariei2025-05-22下载We believe that leveraging real-time blockchain operational data is of particular interest in the context of the current rapid expansion of rollup networks in the Ethereum ecosystem.

基于 VitePress 构建