Skip to content

2025-02-16

cs.AR - Architecture

标题作者发布日期PDF摘要
JExplore: Design Space Exploration Tool for Nvidia Jetson BoardsBasar Kutukcu, Sinan Xie, Sabur Baidya, Sujit Dey2025-02-16下载Nvidia Jetson boards are powerful systems for executing artificial intelligence workloads in edge and mobile environments due to their effective GPU hardware and widely supported software stack.
Unveiling Environmental Impacts of Large Language Model Serving: A Functional Unit ViewYanran Wu, Inez Hua, Yi Ding2025-02-16下载Large language models (LLMs) offer powerful capabilities but come with significant environmental impact, particularly in carbon emissions. Existing studies benchmark carbon emissions but lack a standa...
Enabling Efficient Transaction Processing on CXL-Based Memory SharingZhao Wang, Yiqi Chen, Cong Li, Dimin Niu, Tianchan Guan, Zhaoyang Du, Xingda Wei, Guangyu Sun2025-02-16下载Transaction processing systems are the crux for modern data-center applications, yet current multi-node systems are slow due to network overheads.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Grassroots Platforms with Atomic Transactions: Social Networks, Cryptocurrencies, and Democratic FederationsEhud Shapiro2025-02-16下载Grassroots platforms aim to offer an egalitarian alternative to global platforms. Whereas global platforms can have only a single instance, grassroots platforms can have multiple instances that emerge...
JExplore: Design Space Exploration Tool for Nvidia Jetson BoardsBasar Kutukcu, Sinan Xie, Sabur Baidya, Sujit Dey2025-02-16下载Nvidia Jetson boards are powerful systems for executing artificial intelligence workloads in edge and mobile environments due to their effective GPU hardware and widely supported software stack.
Combining GPU and CPU for accelerating evolutionary computing workloadsRustam Eynaliyev, Houcen Liu2025-02-16下载Evolutionary computing (EC) has proven to be effective in solving complex optimization and robotics problems. Unfortunately, typical Evolutionary Algorithms (EAs) are constrained by the computational ...
DreamDDP: Accelerating Data Parallel Distributed LLM Training with Layer-wise Scheduled Partial SynchronizationZhenheng Tang, Zichen Tang, Junlin Huang, Xinglin Pan, Rudan Yan, Yuxin Wang, Amelie Chi Zhou, Shaohuai Shi, Xiaowen Chu, Bo Li2025-02-16下载The growth of large language models (LLMs) increases challenges of accelerating distributed training across multiple GPUs in different data centers.
Local-Cloud Inference Offloading for LLMs in Multi-Modal, Multi-Task, Multi-Dialogue SettingsLiangqi Yuan, Dong-Jun Han, Shiqiang Wang, Christopher G. Brinton2025-02-16下载Compared to traditional machine learning models, recent large language models (LLMs) can exhibit multi-task-solving capabilities through multiple dialogues and multi-modal data sources.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Serverless Edge Computing: A Taxonomy, Systematic Literature Review, Current Trends and Research ChallengesIqra Batool, Sania Kanwal2025-02-16下载In recent years, the rapid expansion of Internet of Things (IoT) nodes and devices has seamlessly integrated technology into everyday life, amplifying the demand for optimized computing solutions.
Grassroots Platforms with Atomic Transactions: Social Networks, Cryptocurrencies, and Democratic FederationsEhud Shapiro2025-02-16下载Grassroots platforms aim to offer an egalitarian alternative to global platforms. Whereas global platforms can have only a single instance, grassroots platforms can have multiple instances that emerge...
Integrating Language Models for Enhanced Network State Monitoring in DRL-Based SFC ProvisioningParisa Fard Moshiri, Murat Arda Onsu, Poonam Lohan, Burak Kantarci, Emil Janulewicz2025-02-16下载Efficient Service Function Chain (SFC) provisioning and Virtual Network Function (VNF) placement are critical for enhancing network performance in modern architectures such as Software-Defined Network...
Evaluating the Potential of Quantum Machine Learning in Cybersecurity: A Case-Study on PCA-based Intrusion Detection SystemsArmando Bellante, Tommaso Fioravanti, Michele Carminati, Stefano Zanero, Alessandro Luongo2025-02-16下载Quantum computing promises to revolutionize our understanding of the limits of computation, and its implications in cryptography have long been evident.
Leveraging Uncertainty Estimation for Efficient LLM RoutingTuo Zhang, Asal Mehradfar, Dimitrios Dimitriadis, Salman Avestimehr2025-02-16下载Deploying large language models (LLMs) in edge-cloud environments requires an efficient routing strategy to balance cost and response quality.

cs.PF - Performance

标题作者发布日期PDF摘要
Scalable Binary CUR Low-Rank Approximation AlgorithmBowen Su2025-02-16下载This paper proposes a scalable binary CUR low-rank approximation algorithm that leverages parallel selection of representative rows and columns within a deterministic framework.

基于 VitePress 构建