Appearance
2024-05-23
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Analog or Digital In-memory Computing? Benchmarking through Quantitative Modeling | Jiacong Sun, Pouya Houshmand, Marian Verhelst | 2024-05-23 | 下载 | In-Memory Computing (IMC) has emerged as a promising paradigm for energy-efficient, throughput-efficient and area-efficient machine learning at the edge. |
| Exploring and Evaluating Real-world CXL: Use Cases and System Adoption | Xi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li | 2024-05-23 | 下载 | Compute eXpress Link (CXL) is emerging as a promising memory interface technology. However, its performance characteristics remain largely unclear due to the limited availability of production hardwar... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Fast Transaction Scheduling in Blockchain Sharding | Ramesh Adhikari, Costas Busch, Miroslav Popovic | 2024-05-23 | 下载 | Sharding is a promising technique for addressing the scalability issues of blockchain, and this technique is especially important for IoT, edge, or mobile computing. |
| Dynamically Sharded Ledgers on a Distributed Hash Table | Christoffer Fink, Olov Schelén, Ulf Bodin | 2024-05-23 | 下载 | Distributed ledger technology such as blockchain is considered essential for supporting large numbers of micro-transactions in the Machine Economy, which is envisioned to involve billions of connected... |
| Recurrent Early Exits for Federated Learning with Heterogeneous Clients | Royson Lee, Javier Fernandez-Marques, Shell Xu Hu, Da Li, Stefanos Laskaridis, Łukasz Dudziak, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane | 2024-05-23 | 下载 | Federated learning (FL) has enabled distributed learning of a model across multiple clients in a privacy-preserving manner. One of the main challenges of FL is to accommodate clients with varying hard... |
| The integration of heterogeneous resources in the CMS Submission Infrastructure for the LHC Run 3 and beyond | Antonio Perez-Calero Yzquierdo, Marco Mascheroni, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem | 2024-05-23 | 下载 | While the computing landscape supporting LHC experiments is currently dominated by x86 processors at WLCG sites, this configuration will evolve in the coming years. |
| Adoption of a token-based authentication model for the CMS Submission Infrastructure | Antonio Perez-Calero Yzquierdo, Marco Mascheroni, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem, Frank Wurthwein | 2024-05-23 | 下载 | The CMS Submission Infrastructure (SI) is the main computing resource provisioning system for CMS workloads. A number of HTCondor pools are employed to manage this infrastructure, which aggregates geo... |
| GPU Implementations for Midsize Integer Addition and Multiplication | Cosmin E. Oancea, Stephen M. Watt | 2024-05-23 | 下载 | This paper explores practical aspects of using a high-level functional language for GPU-based arithmetic on ``midsize'' integers. By this we mean integers of up to about a quarter million bits, which ... |
| Repurposing of the Run 2 CMS High Level Trigger Infrastructure as a Cloud Resource for Offline Computing | Marco Mascheroni, Antonio Perez-Calero Yzquierdo, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem, Damiele Spiga, Christoph Wissing, Frank Wurthwein | 2024-05-23 | 下载 | The former CMS Run 2 High Level Trigger (HLT) farm is one of the largest contributors to CMS compute resources, providing about 25k job slots for offline computing. |
| PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services | Zheming Yang, Yuanhao Yang, Chang Zhao, Qi Guo, Wenkai He, Wen Ji | 2024-05-23 | 下载 | With the rapid growth in the number of large language model (LLM) users, it is difficult for bandwidth-constrained cloud servers to simultaneously process massive LLM services in real-time. |
| HPC resources for CMS offline computing: An integration and scalability challenge for the Submission Infrastructure | Antonio Perez-Calero Yzquierdo, Marco Mascheroni, Edita Kizinevic, Farrukh Aftab Khan, Hyunwoo Kim, Maria Acosta Flechas, Nikos Tsipinakis, Saqib Haleem | 2024-05-23 | 下载 | The computing resource needs of LHC experiments are expected to continue growing significantly during the Run 3 and into the HL-LHC era. The landscape of available resources will also evolve, as High ... |
| DEX: Scalable Range Indexing on Disaggregated Memory [Extended Version] | Baotong Lu, Kaisong Huang, Chieh-Jan Mike Liang, Tianzheng Wang, Eric Lo | 2024-05-23 | 下载 | Memory disaggregation can potentially allow memory-optimized range indexes such as B+-trees to scale beyond one machine while attaining high hardware utilization and low cost. |
| Worldwide Federated Training of Language Models | Alex Iacob, Lorenzo Sani, Bill Marino, Preslav Aleksandrov, William F. Shen, Nicholas Donald Lane | 2024-05-23 | 下载 | The reliance of language model training on massive amounts of computation and vast datasets scraped from potentially low-quality, copyrighted, or sensitive data has come into question practically, leg... |
| GeoFaaS: An Edge-to-Cloud FaaS Platform | Mohammadreza Malekabbasi, Tobias Pfandzelter, Trever Schirmer, David Bermbach | 2024-05-23 | 下载 | The massive growth of mobile and IoT devices demands geographically distributed computing systems for optimal performance, privacy, and scalability. |
| EdgeShard: Efficient LLM Inference via Collaborative Edge Computing | Mingjin Zhang, Jiannong Cao, Xiaoming Shen, Zeyang Cui | 2024-05-23 | 下载 | Large language models (LLMs) have shown great potential in natural language processing and content generation. However, current LLMs heavily rely on cloud computing, leading to prolonged latency, high... |
| Variational Bayes for Federated Continual Learning | Dezhong Yao, Sanmu Li, Yutong Dai, Zhiqiang Xu, Shengshan Hu, Peilin Zhao, Lichao Sun | 2024-05-23 | 下载 | Federated continual learning (FCL) has received increasing attention due to its potential in handling real-world streaming data, characterized by evolving data distributions and varying client classes... |
| Exploring and Evaluating Real-world CXL: Use Cases and System Adoption | Xi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li | 2024-05-23 | 下载 | Compute eXpress Link (CXL) is emerging as a promising memory interface technology. However, its performance characteristics remain largely unclear due to the limited availability of production hardwar... |
| Application of cloud computing platform in industrial big data processing | Ziyan Yao | 2024-05-23 | 下载 | With the rapid growth and increasing complexity of industrial big data, traditional data processing methods are facing many challenges. This article takes an in-depth look at the application of cloud ... |
| Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference | Nadav Timor, Jonathan Mamou, Daniel Korat, Moshe Berchansky, Oren Pereg, Moshe Wasserblat, Tomer Galanti, Michal Gordon, David Harel | 2024-05-23 | 下载 | This paper introduces distributed speculative inference (DSI), a novel inference algorithm that is provably faster than speculative inference (SI) [leviathan2023, chen2023, miao2024, sun2025, timor202... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| PandORA: Automated Design and Comprehensive Evaluation of Deep Reinforcement Learning Agents for Open RAN | Maria Tsampazi, Salvatore D'Oro, Michele Polese, Leonardo Bonati, Gwenael Poitau, Michael Healy, Mohammad Alavirad, Tommaso Melodia | 2024-05-23 | 下载 | The highly heterogeneous ecosystem of NextG wireless communication systems calls for novel networking paradigms where functionalities and operations can be dynamically and optimally reconfigured in re... |
| Surveilling the Masses with Wi-Fi-Based Positioning Systems | Erik Rye, Dave Levin | 2024-05-23 | 下载 | Wi-Fi-based Positioning Systems (WPSes) are used by modern mobile devices to learn their position using nearby Wi-Fi access points as landmarks. |
| P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF | Osama Bajaber, Bo Ji, Peng Gao | 2024-05-23 | 下载 | Modern targeted attacks such as Advanced Persistent Threats use multiple hosts as stepping stones and move laterally across them to gain deeper access to the network. |
| A Duty-Cycle-Efficient Synchronization Protocol for Slotted-Aloha in LoRaWAN | Amavi Dossa, El Mehdi Amhoud | 2024-05-23 | 下载 | In the current context of massive IoT, the Pure-Aloha scheme used in LoRaWAN is reaching its limit, and Slotted-Aloha is being considered as an alternative, as it offers twice Pure-Aloha's packet succ... |
| PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services | Zheming Yang, Yuanhao Yang, Chang Zhao, Qi Guo, Wenkai He, Wen Ji | 2024-05-23 | 下载 | With the rapid growth in the number of large language model (LLM) users, it is difficult for bandwidth-constrained cloud servers to simultaneously process massive LLM services in real-time. |
| QoE-Aware and Secure UAV-Aided Rate-Splitting Multiple Access Based Communications | Abuzar B. M. Adam, Xiaoyu Wan, Mohammed Saleh Ali Muthanna | 2024-05-23 | 下载 | In this work, we address the issue of quality of experience (QoE) in unmanned aerial vehicle (UAV) aided multiuser rate-splitting multiple access (RSMA) networks under secrecy constraints. |
| Enhancing Critical Infrastructure Cybersecurity: Collaborative DNN Synthesis in the Cloud Continuum | Lav Gupta, Guoxing Yao | 2024-05-23 | 下载 | Researchers are exploring the integration of IoT and the cloud continuum, together with AI to enhance the cost-effectiveness and efficiency of critical infrastructure (CI) systems. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| PipeFusion: Patch-level Pipeline Parallelism for Diffusion Transformers Inference | Jiarui Fang, Jinzhe Pan, Aoyu Li, Xibo Sun, Jiannan Wang | 2024-05-23 | 下载 | This paper presents PipeFusion, an innovative parallel methodology to tackle the high latency issues associated with generating high-resolution images using diffusion transformers (DiTs) models. |
| Exploring and Evaluating Real-world CXL: Use Cases and System Adoption | Xi Wang, Jie Liu, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li | 2024-05-23 | 下载 | Compute eXpress Link (CXL) is emerging as a promising memory interface technology. However, its performance characteristics remain largely unclear due to the limited availability of production hardwar... |
| A Structure-Aware Framework for Learning Device Placements on Computation Graphs | Shukai Duan, Heng Ping, Nikos Kanakaris, Xiongye Xiao, Panagiotis Kyriakis, Nesreen K. Ahmed, Peiyu Zhang, Guixiang Ma, Mihai Capota, Shahin Nazarian, Theodore L. Willke, Paul Bogdan | 2024-05-23 | 下载 | Computation graphs are Directed Acyclic Graphs (DAGs) where the nodes correspond to mathematical operations and are used widely as abstractions in optimizations of neural networks. |