Skip to content

2024-05-23

cs.AR - Architecture

标题作者发布日期PDF摘要
Analog or Digital In-memory Computing? Benchmarking through Quantitative ModelingJiacong Sun, Pouya Houshmand, Marian Verhelst2024-05-23下载In-Memory Computing (IMC) has emerged as a promising paradigm for energy-efficient, throughput-efficient and area-efficient machine learning at the edge.
Exploring and Evaluating Real-world CXL: Use Cases and System AdoptionXi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li2024-05-23下载Compute eXpress Link (CXL) is emerging as a promising memory interface technology. However, its performance characteristics remain largely unclear due to the limited availability of production hardwar...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Fast Transaction Scheduling in Blockchain ShardingRamesh Adhikari, Costas Busch, Miroslav Popovic2024-05-23下载Sharding is a promising technique for addressing the scalability issues of blockchain, and this technique is especially important for IoT, edge, or mobile computing.
Dynamically Sharded Ledgers on a Distributed Hash TableChristoffer Fink, Olov Schelén, Ulf Bodin2024-05-23下载Distributed ledger technology such as blockchain is considered essential for supporting large numbers of micro-transactions in the Machine Economy, which is envisioned to involve billions of connected...
Recurrent Early Exits for Federated Learning with Heterogeneous ClientsRoyson Lee, Javier Fernandez-Marques, Shell Xu Hu, Da Li, Stefanos Laskaridis, Łukasz Dudziak, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane2024-05-23下载Federated learning (FL) has enabled distributed learning of a model across multiple clients in a privacy-preserving manner. One of the main challenges of FL is to accommodate clients with varying hard...
The integration of heterogeneous resources in the CMS Submission Infrastructure for the LHC Run 3 and beyondAntonio Perez-Calero Yzquierdo, Marco Mascheroni, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem2024-05-23下载While the computing landscape supporting LHC experiments is currently dominated by x86 processors at WLCG sites, this configuration will evolve in the coming years.
Adoption of a token-based authentication model for the CMS Submission InfrastructureAntonio Perez-Calero Yzquierdo, Marco Mascheroni, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem, Frank Wurthwein2024-05-23下载The CMS Submission Infrastructure (SI) is the main computing resource provisioning system for CMS workloads. A number of HTCondor pools are employed to manage this infrastructure, which aggregates geo...
GPU Implementations for Midsize Integer Addition and MultiplicationCosmin E. Oancea, Stephen M. Watt2024-05-23下载This paper explores practical aspects of using a high-level functional language for GPU-based arithmetic on ``midsize'' integers. By this we mean integers of up to about a quarter million bits, which ...
Repurposing of the Run 2 CMS High Level Trigger Infrastructure as a Cloud Resource for Offline ComputingMarco Mascheroni, Antonio Perez-Calero Yzquierdo, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem, Damiele Spiga, Christoph Wissing, Frank Wurthwein2024-05-23下载The former CMS Run 2 High Level Trigger (HLT) farm is one of the largest contributors to CMS compute resources, providing about 25k job slots for offline computing.
PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM ServicesZheming Yang, Yuanhao Yang, Chang Zhao, Qi Guo, Wenkai He, Wen Ji2024-05-23下载With the rapid growth in the number of large language model (LLM) users, it is difficult for bandwidth-constrained cloud servers to simultaneously process massive LLM services in real-time.
HPC resources for CMS offline computing: An integration and scalability challenge for the Submission InfrastructureAntonio Perez-Calero Yzquierdo, Marco Mascheroni, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem2024-05-23下载The computing resource needs of LHC experiments are expected to continue growing significantly during the Run 3 and into the HL-LHC era. The landscape of available resources will also evolve, as High ...
DEX: Scalable Range Indexing on Disaggregated Memory [Extended Version]Baotong Lu, Kaisong Huang, Chieh-Jan Mike Liang, Tianzheng Wang, Eric Lo2024-05-23下载Memory disaggregation can potentially allow memory-optimized range indexes such as B+-trees to scale beyond one machine while attaining high hardware utilization and low cost.
Worldwide Federated Training of Language ModelsAlex Iacob, Lorenzo Sani, Bill Marino, Preslav Aleksandrov, William F. Shen, Nicholas Donald Lane2024-05-23下载The reliance of language model training on massive amounts of computation and vast datasets scraped from potentially low-quality, copyrighted, or sensitive data has come into question practically, leg...
GeoFaaS: An Edge-to-Cloud FaaS PlatformMohammadreza Malekabbasi, Tobias Pfandzelter, Trever Schirmer, David Bermbach2024-05-23下载The massive growth of mobile and IoT devices demands geographically distributed computing systems for optimal performance, privacy, and scalability.
EdgeShard: Efficient LLM Inference via Collaborative Edge ComputingMingjin Zhang, Jiannong Cao, Xiaoming Shen, Zeyang Cui2024-05-23下载Large language models (LLMs) have shown great potential in natural language processing and content generation. However, current LLMs heavily rely on cloud computing, leading to prolonged latency, high...
Variational Bayes for Federated Continual LearningDezhong Yao, Sanmu Li, Yutong Dai, Zhiqiang Xu, Shengshan Hu, Peilin Zhao, Lichao Sun2024-05-23下载Federated continual learning (FCL) has received increasing attention due to its potential in handling real-world streaming data, characterized by evolving data distributions and varying client classes...
Exploring and Evaluating Real-world CXL: Use Cases and System AdoptionXi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li2024-05-23下载Compute eXpress Link (CXL) is emerging as a promising memory interface technology. However, its performance characteristics remain largely unclear due to the limited availability of production hardwar...
Application of cloud computing platform in industrial big data processingZiyan Yao2024-05-23下载With the rapid growth and increasing complexity of industrial big data, traditional data processing methods are facing many challenges. This article takes an in-depth look at the application of cloud ...
Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model InferenceNadav Timor, Jonathan Mamou, Daniel Korat, Moshe Berchansky, Oren Pereg, Moshe Wasserblat, Tomer Galanti, Michal Gordon, David Harel2024-05-23下载This paper introduces distributed speculative inference (DSI), a novel inference algorithm that is provably faster than speculative inference (SI) [leviathan2023, chen2023, miao2024, sun2025, timor202...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
PandORA: Automated Design and Comprehensive Evaluation of Deep Reinforcement Learning Agents for Open RANMaria Tsampazi, Salvatore D'Oro, Michele Polese, Leonardo Bonati, Gwenael Poitau, Michael Healy, Mohammad Alavirad, Tommaso Melodia2024-05-23下载The highly heterogeneous ecosystem of NextG wireless communication systems calls for novel networking paradigms where functionalities and operations can be dynamically and optimally reconfigured in re...
Surveilling the Masses with Wi-Fi-Based Positioning SystemsErik Rye, Dave Levin2024-05-23下载Wi-Fi-based Positioning Systems (WPSes) are used by modern mobile devices to learn their position using nearby Wi-Fi access points as landmarks.
P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPFOsama Bajaber, Bo Ji, Peng Gao2024-05-23下载Modern targeted attacks such as Advanced Persistent Threats use multiple hosts as stepping stones and move laterally across them to gain deeper access to the network.
A Duty-Cycle-Efficient Synchronization Protocol for Slotted-Aloha in LoRaWANAmavi Dossa, El Mehdi Amhoud2024-05-23下载In the current context of massive IoT, the Pure-Aloha scheme used in LoRaWAN is reaching its limit, and Slotted-Aloha is being considered as an alternative, as it offers twice Pure-Aloha's packet succ...
PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM ServicesZheming Yang, Yuanhao Yang, Chang Zhao, Qi Guo, Wenkai He, Wen Ji2024-05-23下载With the rapid growth in the number of large language model (LLM) users, it is difficult for bandwidth-constrained cloud servers to simultaneously process massive LLM services in real-time.
QoE-Aware and Secure UAV-Aided Rate-Splitting Multiple Access Based CommunicationsAbuzar B. M. Adam, Xiaoyu Wan, Mohammed Saleh Ali Muthanna2024-05-23下载In this work, we address the issue of quality of experience (QoE) in unmanned aerial vehicle (UAV) aided multiuser rate-splitting multiple access (RSMA) networks under secrecy constraints.
Enhancing Critical Infrastructure Cybersecurity: Collaborative DNN Synthesis in the Cloud ContinuumLav Gupta, Guoxing Yao2024-05-23下载Researchers are exploring the integration of IoT and the cloud continuum, together with AI to enhance the cost-effectiveness and efficiency of critical infrastructure (CI) systems.

cs.PF - Performance

标题作者发布日期PDF摘要
PipeFusion: Patch-level Pipeline Parallelism for Diffusion Transformers InferenceJiarui Fang, Jinzhe Pan, Aoyu Li, Xibo Sun, Jiannan Wang2024-05-23下载This paper presents PipeFusion, an innovative parallel methodology to tackle the high latency issues associated with generating high-resolution images using diffusion transformers (DiTs) models.
Exploring and Evaluating Real-world CXL: Use Cases and System AdoptionXi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li2024-05-23下载Compute eXpress Link (CXL) is emerging as a promising memory interface technology. However, its performance characteristics remain largely unclear due to the limited availability of production hardwar...
A Structure-Aware Framework for Learning Device Placements on Computation GraphsShukai Duan, Heng Ping, Nikos Kanakaris, Xiongye Xiao, Panagiotis Kyriakis, Nesreen K. Ahmed, Peiyu Zhang, Guixiang Ma, Mihai Capota, Shahin Nazarian, Theodore L. Willke, Paul Bogdan2024-05-23下载Computation graphs are Directed Acyclic Graphs (DAGs) where the nodes correspond to mathematical operations and are used widely as abstractions in optimizations of neural networks.

基于 VitePress 构建