2024-05-23

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Analog or Digital In-memory Computing? Benchmarking through Quantitative Modeling	Jiacong Sun, Pouya Houshmand, Marian Verhelst	2024-05-23	下载	In-Memory Computing (IMC) has emerged as a promising paradigm for energy-efficient, throughput-efficient and area-efficient machine learning at the edge.
Exploring and Evaluating Real-world CXL: Use Cases and System Adoption	Xi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li	2024-05-23	下载	Compute eXpress Link (CXL) is emerging as a promising memory interface technology. However, its performance characteristics remain largely unclear due to the limited availability of production hardwar...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Fast Transaction Scheduling in Blockchain Sharding	Ramesh Adhikari, Costas Busch, Miroslav Popovic	2024-05-23	下载	Sharding is a promising technique for addressing the scalability issues of blockchain, and this technique is especially important for IoT, edge, or mobile computing.
Dynamically Sharded Ledgers on a Distributed Hash Table	Christoffer Fink, Olov Schelén, Ulf Bodin	2024-05-23	下载	Distributed ledger technology such as blockchain is considered essential for supporting large numbers of micro-transactions in the Machine Economy, which is envisioned to involve billions of connected...
Recurrent Early Exits for Federated Learning with Heterogeneous Clients	Royson Lee, Javier Fernandez-Marques, Shell Xu Hu, Da Li, Stefanos Laskaridis, Łukasz Dudziak, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane	2024-05-23	下载	Federated learning (FL) has enabled distributed learning of a model across multiple clients in a privacy-preserving manner. One of the main challenges of FL is to accommodate clients with varying hard...
The integration of heterogeneous resources in the CMS Submission Infrastructure for the LHC Run 3 and beyond	Antonio Perez-Calero Yzquierdo, Marco Mascheroni, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem	2024-05-23	下载	While the computing landscape supporting LHC experiments is currently dominated by x86 processors at WLCG sites, this configuration will evolve in the coming years.
Adoption of a token-based authentication model for the CMS Submission Infrastructure	Antonio Perez-Calero Yzquierdo, Marco Mascheroni, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem, Frank Wurthwein	2024-05-23	下载	The CMS Submission Infrastructure (SI) is the main computing resource provisioning system for CMS workloads. A number of HTCondor pools are employed to manage this infrastructure, which aggregates geo...
GPU Implementations for Midsize Integer Addition and Multiplication	Cosmin E. Oancea, Stephen M. Watt	2024-05-23	下载	This paper explores practical aspects of using a high-level functional language for GPU-based arithmetic on ``midsize'' integers. By this we mean integers of up to about a quarter million bits, which ...
Repurposing of the Run 2 CMS High Level Trigger Infrastructure as a Cloud Resource for Offline Computing	Marco Mascheroni, Antonio Perez-Calero Yzquierdo, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem, Damiele Spiga, Christoph Wissing, Frank Wurthwein	2024-05-23	下载	The former CMS Run 2 High Level Trigger (HLT) farm is one of the largest contributors to CMS compute resources, providing about 25k job slots for offline computing.
PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services	Zheming Yang, Yuanhao Yang, Chang Zhao, Qi Guo, Wenkai He, Wen Ji	2024-05-23	下载	With the rapid growth in the number of large language model (LLM) users, it is difficult for bandwidth-constrained cloud servers to simultaneously process massive LLM services in real-time.
HPC resources for CMS offline computing: An integration and scalability challenge for the Submission Infrastructure	Antonio Perez-Calero Yzquierdo, Marco Mascheroni, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem	2024-05-23	下载	The computing resource needs of LHC experiments are expected to continue growing significantly during the Run 3 and into the HL-LHC era. The landscape of available resources will also evolve, as High ...
DEX: Scalable Range Indexing on Disaggregated Memory [Extended Version]	Baotong Lu, Kaisong Huang, Chieh-Jan Mike Liang, Tianzheng Wang, Eric Lo	2024-05-23	下载	Memory disaggregation can potentially allow memory-optimized range indexes such as B+-trees to scale beyond one machine while attaining high hardware utilization and low cost.
Worldwide Federated Training of Language Models	Alex Iacob, Lorenzo Sani, Bill Marino, Preslav Aleksandrov, William F. Shen, Nicholas Donald Lane	2024-05-23	下载	The reliance of language model training on massive amounts of computation and vast datasets scraped from potentially low-quality, copyrighted, or sensitive data has come into question practically, leg...
GeoFaaS: An Edge-to-Cloud FaaS Platform	Mohammadreza Malekabbasi, Tobias Pfandzelter, Trever Schirmer, David Bermbach	2024-05-23	下载	The massive growth of mobile and IoT devices demands geographically distributed computing systems for optimal performance, privacy, and scalability.
EdgeShard: Efficient LLM Inference via Collaborative Edge Computing	Mingjin Zhang, Jiannong Cao, Xiaoming Shen, Zeyang Cui	2024-05-23	下载	Large language models (LLMs) have shown great potential in natural language processing and content generation. However, current LLMs heavily rely on cloud computing, leading to prolonged latency, high...
Variational Bayes for Federated Continual Learning	Dezhong Yao, Sanmu Li, Yutong Dai, Zhiqiang Xu, Shengshan Hu, Peilin Zhao, Lichao Sun	2024-05-23	下载	Federated continual learning (FCL) has received increasing attention due to its potential in handling real-world streaming data, characterized by evolving data distributions and varying client classes...
Exploring and Evaluating Real-world CXL: Use Cases and System Adoption	Xi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li	2024-05-23	下载	Compute eXpress Link (CXL) is emerging as a promising memory interface technology. However, its performance characteristics remain largely unclear due to the limited availability of production hardwar...
Application of cloud computing platform in industrial big data processing	Ziyan Yao	2024-05-23	下载	With the rapid growth and increasing complexity of industrial big data, traditional data processing methods are facing many challenges. This article takes an in-depth look at the application of cloud ...
Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference	Nadav Timor, Jonathan Mamou, Daniel Korat, Moshe Berchansky, Oren Pereg, Moshe Wasserblat, Tomer Galanti, Michal Gordon, David Harel	2024-05-23	下载	This paper introduces distributed speculative inference (DSI), a novel inference algorithm that is provably faster than speculative inference (SI) [leviathan2023, chen2023, miao2024, sun2025, timor202...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
PandORA: Automated Design and Comprehensive Evaluation of Deep Reinforcement Learning Agents for Open RAN	Maria Tsampazi, Salvatore D'Oro, Michele Polese, Leonardo Bonati, Gwenael Poitau, Michael Healy, Mohammad Alavirad, Tommaso Melodia	2024-05-23	下载	The highly heterogeneous ecosystem of NextG wireless communication systems calls for novel networking paradigms where functionalities and operations can be dynamically and optimally reconfigured in re...
Surveilling the Masses with Wi-Fi-Based Positioning Systems	Erik Rye, Dave Levin	2024-05-23	下载	Wi-Fi-based Positioning Systems (WPSes) are used by modern mobile devices to learn their position using nearby Wi-Fi access points as landmarks.
P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF	Osama Bajaber, Bo Ji, Peng Gao	2024-05-23	下载	Modern targeted attacks such as Advanced Persistent Threats use multiple hosts as stepping stones and move laterally across them to gain deeper access to the network.
A Duty-Cycle-Efficient Synchronization Protocol for Slotted-Aloha in LoRaWAN	Amavi Dossa, El Mehdi Amhoud	2024-05-23	下载	In the current context of massive IoT, the Pure-Aloha scheme used in LoRaWAN is reaching its limit, and Slotted-Aloha is being considered as an alternative, as it offers twice Pure-Aloha's packet succ...
PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services	Zheming Yang, Yuanhao Yang, Chang Zhao, Qi Guo, Wenkai He, Wen Ji	2024-05-23	下载	With the rapid growth in the number of large language model (LLM) users, it is difficult for bandwidth-constrained cloud servers to simultaneously process massive LLM services in real-time.
QoE-Aware and Secure UAV-Aided Rate-Splitting Multiple Access Based Communications	Abuzar B. M. Adam, Xiaoyu Wan, Mohammed Saleh Ali Muthanna	2024-05-23	下载	In this work, we address the issue of quality of experience (QoE) in unmanned aerial vehicle (UAV) aided multiuser rate-splitting multiple access (RSMA) networks under secrecy constraints.
Enhancing Critical Infrastructure Cybersecurity: Collaborative DNN Synthesis in the Cloud Continuum	Lav Gupta, Guoxing Yao	2024-05-23	下载	Researchers are exploring the integration of IoT and the cloud continuum, together with AI to enhance the cost-effectiveness and efficiency of critical infrastructure (CI) systems.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
PipeFusion: Patch-level Pipeline Parallelism for Diffusion Transformers Inference	Jiarui Fang, Jinzhe Pan, Aoyu Li, Xibo Sun, Jiannan Wang	2024-05-23	下载	This paper presents PipeFusion, an innovative parallel methodology to tackle the high latency issues associated with generating high-resolution images using diffusion transformers (DiTs) models.
Exploring and Evaluating Real-world CXL: Use Cases and System Adoption	Xi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li	2024-05-23	下载	Compute eXpress Link (CXL) is emerging as a promising memory interface technology. However, its performance characteristics remain largely unclear due to the limited availability of production hardwar...
A Structure-Aware Framework for Learning Device Placements on Computation Graphs	Shukai Duan, Heng Ping, Nikos Kanakaris, Xiongye Xiao, Panagiotis Kyriakis, Nesreen K. Ahmed, Peiyu Zhang, Guixiang Ma, Mihai Capota, Shahin Nazarian, Theodore L. Willke, Paul Bogdan	2024-05-23	下载	Computation graphs are Directed Acyclic Graphs (DAGs) where the nodes correspond to mathematical operations and are used widely as abstractions in optimizations of neural networks.