Appearance
2025-05-22
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Δ-Nets: Interaction-Based System for Optimal Parallel λ-Reduction | Daniel Augusto Rizzi Salvadori | 2025-05-22 | 下载 | I present a model of universal parallel computation called Δ-Nets, and a method to translate λ-terms into Δ-nets and back. Together, the model and the method constitute an algorithm for optimal ... |
| Modular SAIL: dream or reality? | Petr Kourzanov, Anmol | 2025-05-22 | 下载 | In order to truly benefit from RISC-V ISA modularity, the community has to address the issue of compositionality, going beyond modules at the specification level covering larger subsets of the RISC-V ... |
| CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark | Ahmed Heakl, Sarim Hashmi, Gustavo Bertolo Stahl, Seung Hun Eddie Han, Salman Khan, Abdulrahman Mahmoud | 2025-05-22 | 下载 | We introduce CASS, the first large-scale dataset and model suite for cross-architecture GPU code transpilation, targeting both source-level (CUDA <--> HIP) and assembly-level (Nvidia SASS <--> AMD RDN... |
| DAS-MP: Enabling High-Quality Macro Placement with Enhanced Dataflow Awareness | Xiaotian Zhao, Zixuan Li, Yichen Cai, Tianju Wang, Yushan Pan, Xinfei Guo | 2025-05-22 | 下载 | Dataflow is a critical yet underexplored factor in automatic macro placement, which is becoming increasingly important for developing intelligent design automation techniques that minimize reliance on... |
| How to keep pushing ML accelerator performance? Know your rooflines! | Marian Verhelst, Luca Benini, Naveen Verma | 2025-05-22 | 下载 | The rapidly growing importance of Machine Learning (ML) applications, coupled with their ever-increasing model size and inference energy footprint, has created a strong need for specialized ML hardwar... |
| Advanced Integration Strategies for ESD Protection and Termination in High-Speed LVDS Systems | Kavya Gaddipati | 2025-05-22 | 下载 | This technical article explores comprehensive strategies for integrating Electrostatic Discharge (ESD) protection diodes and termination resistors in LowVoltage Differential Signaling (LVDS) designs. |
| Cosmos: A CXL-Based Full In-Memory System for Approximate Nearest Neighbor Search | Seoyoung Ko, Hyunjeong Shim, Wanju Doh, Sungmin Yun, Jinin So, Yongsuk Kwon, Sang-Soo Park, Si-Dong Roh, Minyong Yoon, Taeksang Song, Jung Ho Ahn | 2025-05-22 | 下载 | Retrieval-Augmented Generation (RAG) is crucial for improving the quality of large language models by injecting proper contexts extracted from external sources. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Δ-Nets: Interaction-Based System for Optimal Parallel λ-Reduction | Daniel Augusto Rizzi Salvadori | 2025-05-22 | 下载 | I present a model of universal parallel computation called Δ-Nets, and a method to translate λ-terms into Δ-nets and back. Together, the model and the method constitute an algorithm for optimal ... |
| LogStamping: A blockchain-based log auditing approach for large-scale systems | Md Shariful Islam, M. Sohel Rahman | 2025-05-22 | 下载 | Log management is crucial for ensuring the security, integrity, and compliance of modern information systems. Traditional log management solutions face challenges in achieving tamper-proofing, scalabi... |
| Navigating the Edge-Cloud Continuum: A State-of-Practice Survey | Loris Belcastro, Fabrizio Marozzo, Alessio Orsino, Domenico Talia, Paolo Trunfio | 2025-05-22 | 下载 | The edge-cloud continuum has emerged as a transformative paradigm that meets the growing demand for low-latency, scalable, end-to-end service delivery by integrating decentralized edge resources with ... |
| Edge-First Language Model Inference: Models, Metrics, and Tradeoffs | SiYoung Jang, Roberto Morabito | 2025-05-22 | 下载 | The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge. |
| Recursive Offloading for LLM Serving in Multi-tier Networks | Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Jinda Lu, Zheming Yang, Tian Wen | 2025-05-22 | 下载 | Heterogeneous device-edge-cloud computing infrastructures have become widely adopted in telecommunication operators and Wide Area Networks (WANs), offering multi-tier computational support for emergin... |
| Smaller, Smarter, Closer: The Edge of Collaborative Generative AI | Roberto Morabito, SiYoung Jang | 2025-05-22 | 下载 | The rapid adoption of generative AI (GenAI), particularly Large Language Models (LLMs), has exposed critical limitations of cloud-centric deployments, including latency, cost, and privacy concerns. |
| Minimizing Energy in Reliability and Deadline-Ensured Workflow Scheduling in Cloud | Suvarthi Sarkar, Dhanesh V, Ketan Singh, Aryabartta Sahu | 2025-05-22 | 下载 | With the increasing prevalence of computationally intensive workflows in cloud environments, it has become crucial for cloud platforms to optimize energy consumption while ensuring the feasibility of ... |
| Redox: Improving I/O Efficiency of Model Training Through File Redirection | Yuhao Li, Xuanhua Shi, Yunfei Zhao, Yongluan Zhou, Yusheng Hua, Xuehai Qian | 2025-05-22 | 下载 | This paper proposes Redox, a training data management system designed to achieve high I/O efficiency. The key insight is a new observation of file redirection: for model training, when training data i... |
| On the Runtime of Local Mutual Exclusion for Anonymous Dynamic Networks | Anya Chaturvedi, Joshua J. Daymude, Andréa W. Richa | 2025-05-22 | 下载 | Algorithms for mutual exclusion aim to isolate potentially concurrent accesses to the same shared resources. Motivated by distributed computing research on programmable matter and population protocols... |
| Multimodal Online Federated Learning with Modality Missing in Internet of Things | Heqiang Wang, Xiang Liu, Xiaoxiong Zhong, Lixing Chen, Fangming Liu, Weizhe Zhang | 2025-05-22 | 下载 | The Internet of Things (IoT) ecosystem generates vast amounts of multimodal data from heterogeneous sources such as sensors, cameras, and microphones. |
| Towards Stream-Based Monitoring for EVM Networks | Emanuel Onica, Claudiu-Nicu Bărbieru, Andrei Arusoaie, Oana-Otilia Captarencu, Ciprian Amariei | 2025-05-22 | 下载 | We believe that leveraging real-time blockchain operational data is of particular interest in the context of the current rapid expansion of rollup networks in the Ethereum ecosystem. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| PREAMBLE and IMRECEIVING for Improved Large Message Handling in libp2p GossipSub | Muhammad Umar Farooq, Daniel Kaiser | 2025-05-22 | 下载 | Large message transmissions in libp2p GossipSub lead to longer than expected network-wide message dissemination times and very high bandwidth utilization. |
| LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols | Ziming Liu, Bryan Liu, Alvaro Valcarce, Xiaoli Chu | 2025-05-22 | 下载 | Integrating Large AI Models (LAMs) into 6G mobile networks is a key enabler of the AI-Native Air Interface (AI-AI), where protocol intelligence must scale beyond handcrafted logic. |
| Navigating the Edge-Cloud Continuum: A State-of-Practice Survey | Loris Belcastro, Fabrizio Marozzo, Alessio Orsino, Domenico Talia, Paolo Trunfio | 2025-05-22 | 下载 | The edge-cloud continuum has emerged as a transformative paradigm that meets the growing demand for low-latency, scalable, end-to-end service delivery by integrating decentralized edge resources with ... |
| SONIC: Cost-Effective Web Access for Developing Countries | Ayush Pandey, Rohail Asim, Jean Louis K. E. Fendji, Talal Rahwan, Matteo Varvello, Yasir Zaki | 2025-05-22 | 下载 | Over 2.6 billion people remain without access to the Internet in 2025. This phenomenon is especially pronounced in developing regions, where cost and infrastructure limitations are major barriers to c... |
| Edge-First Language Model Inference: Models, Metrics, and Tradeoffs | SiYoung Jang, Roberto Morabito | 2025-05-22 | 下载 | The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge. |
| Recursive Offloading for LLM Serving in Multi-tier Networks | Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Jinda Lu, Zheming Yang, Tian Wen | 2025-05-22 | 下载 | Heterogeneous device-edge-cloud computing infrastructures have become widely adopted in telecommunication operators and Wide Area Networks (WANs), offering multi-tier computational support for emergin... |
| Smaller, Smarter, Closer: The Edge of Collaborative Generative AI | Roberto Morabito, SiYoung Jang | 2025-05-22 | 下载 | The rapid adoption of generative AI (GenAI), particularly Large Language Models (LLMs), has exposed critical limitations of cloud-centric deployments, including latency, cost, and privacy concerns. |
| Graph Attention Network for Optimal User Association in Wireless Networks | Javad Mirzaei, Jeebak Mitra, Gwenael Poitau | 2025-05-22 | 下载 | With increased 5G deployments, network densification is higher than ever to support the exponentially high throughput requirements. However, this has meant a significant increase in energy consumption... |
| Resilient LLM-Empowered Semantic MAC Protocols via Zero-Shot Adaptation and Knowledge Distillation | Yongjun Kim, Jihong Park, Mehdi Bennis, Junil Choi | 2025-05-22 | 下载 | Neural network-based medium access control (MAC) protocol models (NPMs) improve goodput through site-specific operations but are vulnerable to shifts from their training network environments, such as ... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Edge-First Language Model Inference: Models, Metrics, and Tradeoffs | SiYoung Jang, Roberto Morabito | 2025-05-22 | 下载 | The widespread adoption of Language Models (LMs) across industries is driving interest in deploying these services across the computing continuum, from the cloud to the network edge. |
| Performance of Confidential Computing GPUs | Antonio Martínez Ibarra, Julian James Stephen, Aurora González Vidal, K. R. Jayaram, Antonio Fernando Skarmeta Gómez | 2025-05-22 | 下载 | This work examines latency, throughput, and other metrics when performing inference on confidential GPUs. We explore different traffic patterns and scheduling strategies using a single Virtual Machine... |
| Is Quantum Optimization Ready? An Effort Towards Neural Network Compression using Adiabatic Quantum Computing | Zhehui Wang, Benjamin Chen Ming Choong, Tian Huang, Daniel Gerlinghoff, Rick Siow Mong Goh, Cheng Liu, Tao Luo | 2025-05-22 | 下载 | Quantum optimization is the most mature quantum computing technology to date, providing a promising approach towards efficiently solving complex combinatorial problems. |
| Towards Stream-Based Monitoring for EVM Networks | Emanuel Onica, Claudiu-Nicu Bărbieru, Andrei Arusoaie, Oana-Otilia Captarencu, Ciprian Amariei | 2025-05-22 | 下载 | We believe that leveraging real-time blockchain operational data is of particular interest in the context of the current rapid expansion of rollup networks in the Ethereum ecosystem. |