Appearance
2024-05-29
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Optimizing Foundation Model Inference on a Many-tiny-core Open-source RISC-V Platform | Viviane Potocnik, Luca Colagrande, Tim Fischer, Luca Bertaccini, Daniele Jahier Pagliari, Alessio Burrello, Luca Benini | 2024-05-29 | 下载 | Transformer-based foundation models have become crucial for various domains, most notably natural language processing (NLP) or computer vision (CV). |
| xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems | Georg Rutishauser, Joan Mihali, Moritz Scherer, Luca Benini | 2024-05-29 | 下载 | Ternary neural networks (TNNs) offer a superior accuracy-energy trade-off compared to binary neural networks. However, until now, they have required specialized accelerators to realize their efficienc... |
| An Open-Source Framework for Efficient Numerically-Tailored Computations | Louis Ledoux, Marc Casas | 2024-05-29 | 下载 | We present a versatile open-source framework designed to facilitate efficient, numerically-tailored Matrix-Matrix Multiplications (MMMs). The framework offers two primary contributions: first, a fine-... |
| MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models | Taehyun Kim, Kwanseok Choi, Youngmock Cho, Jaehoon Cho, Hyuk-Jae Lee, Jaewoong Sim | 2024-05-29 | 下载 | Mixture-of-Experts (MoE) large language models (LLM) have memory requirements that often exceed the GPU memory capacity, requiring costly parameter movement from secondary memories to the GPU for expe... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution | Yechen Xu, Xinhao Kong, Tingjun Chen, Danyang Zhuo | 2024-05-29 | 下载 | The complexity of large language model (LLM) serving workloads has substantially increased due to the integration with external tool invocations, such as ChatGPT plugins. |
| Decentralized Optimization in Time-Varying Networks with Arbitrary Delays | Tomas Ortega, Hamid Jafarkhani | 2024-05-29 | 下载 | We consider a decentralized optimization problem for networks affected by communication delays. Examples of such networks include collaborative machine learning, sensor networks, and multi-agent syste... |
| Construction of a Byzantine Linearizable SWMR Atomic Register from SWSR Atomic Registers | Ajay D. Kshemkalyani, Manaswini Piduguralla, Sathya Peri, Anshuman Misra | 2024-05-29 | 下载 | The SWMR atomic register is a fundamental building block in shared memory distributed systems and implementing it from SWSR atomic registers is an important problem. |
| Optimizing Foundation Model Inference on a Many-tiny-core Open-source RISC-V Platform | Viviane Potocnik, Luca Colagrande, Tim Fischer, Luca Bertaccini, Daniele Jahier Pagliari, Alessio Burrello, Luca Benini | 2024-05-29 | 下载 | Transformer-based foundation models have become crucial for various domains, most notably natural language processing (NLP) or computer vision (CV). |
| Differentially Private Clustered Federated Learning | Saber Malekmohammadi, Afaf Taik, Golnoosh Farnadi | 2024-05-29 | 下载 | Federated learning (FL), which is a decentralized machine learning (ML) approach, often incorporates differential privacy (DP) to provide rigorous data privacy guarantees. |
| Hybrid-Parallel: Achieving High Performance and Energy Efficient Distributed Inference on Robots | Zekai Sun, Xiuxian Guan, Junming Wang, Haoze Song, Yuhao Qing, Tianxiang Shen, Dong Huang, Fangming Liu, Heming Cui | 2024-05-29 | 下载 | The rapid advancements in machine learning techniques have led to significant achievements in various real-world robotic tasks. These tasks heavily rely on fast and energy-efficient inference of deep ... |
| Accelerating Lattice QCD Simulations using GPUs | Tilmann Matthaei | 2024-05-29 | 下载 | Solving discretized versions of the Dirac equation represents a large share of execution time in lattice Quantum Chromodynamics (QCD) simulations. |
| LoByITFL: Low Communication Secure and Private Federated Learning | Yue Xia, Maximilian Egger, Christoph Hofmeister, Rawad Bitar | 2024-05-29 | 下载 | Privacy of the clients' data and security against Byzantine clients are key challenges in Federated Learning (FL). Existing solutions to joint privacy and security incur sacrifices on the privacy guar... |
| Multi-Source Coflow Scheduling in Collaborative Edge Computing with Multihop Network | Yuvraj Sahni, Jiannong Cao, Lei Yang, Shengwei Wang | 2024-05-29 | 下载 | Collaborative edge computing has become a popular paradigm where edge devices collaborate by sharing resources. Data dissemination is a fundamental problem in CEC to decide what data is transmitted fr... |
| Learning Interpretable Scheduling Algorithms for Data Processing Clusters | Zhibo Hu, Chen Wang, Helen, Paik, Yanfeng Shu, Liming Zhu | 2024-05-29 | 下载 | Workloads in data processing clusters are often represented in the form of DAG (Directed Acyclic Graph) jobs. Scheduling DAG jobs is challenging. |
| Federated Assemblies | Daniel Halpern, Ariel D. Procaccia, Ehud Shapiro, Nimrod Talmon | 2024-05-29 | 下载 | A citizens' assembly is a group of people who are randomly selected to represent a larger population in a deliberation. While this approach has successfully strengthened democracy, it has certain limi... |
| Optimization-based Proof of Useful Work: Framework, Modeling, and Security Analysis | Weihang Cao, Xintong Ling, Jiaheng Wang, Xiqi Gao, Zhi Ding | 2024-05-29 | 下载 | Proof of Work (PoW) has extensively served as the foundation of blockchain's security, consistency, and tamper-resistance. However, long has it been criticized for its tremendous and inefficient utili... |
| Federated Learning under Partially Class-Disjoint Data via Manifold Reshaping | Ziqing Fan, Jiangchao Yao, Ruipeng Zhang, Lingjuan Lyu, Ya Zhang, Yanfeng Wang | 2024-05-29 | 下载 | Statistical heterogeneity severely limits the performance of federated learning (FL), motivating several explorations e.g., FedProx, MOON and FedDyn, to alleviate this problem. |
| Federated Learning with Bilateral Curation for Partially Class-Disjoint Data | Ziqing Fan, Ruipeng Zhang, Jiangchao Yao, Bo Han, Ya Zhang, Yanfeng Wang | 2024-05-29 | 下载 | Partially class-disjoint data (PCDD), a common yet under-explored data formation where each client contributes a part of classes (instead of all classes) of samples, severely challenges the performanc... |
| Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization | Ziqing Fan, Shengchao Hu, Jiangchao Yao, Gang Niu, Ya Zhang, Masashi Sugiyama, Yanfeng Wang | 2024-05-29 | 下载 | In federated learning (FL), the multi-step update and data heterogeneity among clients often lead to a loss landscape with sharper minima, degenerating the performance of the resulted global model. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| RANFusion: A Comprehensive Tool for Simulating Handover In Next-G RAN | Seyed Bagher Hashemi Natanzi, Bo Tang | 2024-05-29 | 下载 | The rapid advancement of 5G networks and the upcoming transition to 6G necessitate the use of the Open Radio Access Network (O-RAN) architecture to enable greater flexibility, interoperability, and in... |
| Network Connectivity--Information Freshness Tradeoff in Information Dissemination Over Networks | Arunabh Srivastava, Sennur Ulukus | 2024-05-29 | 下载 | We consider a gossip network consisting of a source generating updates and nodes connected according to a given graph structure. The source keeps updates of a process, that might be generated or o... |
| EdgeSight: Enabling Modeless and Cost-Efficient Inference at the Edge | ChonLam Lao, Jiaqi Gao, Ganesh Ananthanarayanan, Aditya Akella, Minlan Yu | 2024-05-29 | 下载 | Traditional ML inference is evolving toward modeless inference, which abstracts the complexity of model selection from users, allowing the system to automatically choose the most appropriate model for... |
| Multi-Source Coflow Scheduling in Collaborative Edge Computing with Multihop Network | Yuvraj Sahni, Jiannong Cao, Lei Yang, Shengwei Wang | 2024-05-29 | 下载 | Collaborative edge computing has become a popular paradigm where edge devices collaborate by sharing resources. Data dissemination is a fundamental problem in CEC to decide what data is transmitted fr... |
| Preamble Design and Burst-Mode DSP for Upstream Reception of 200G Coherent TDM-PON | Haide Wang, Ji Zhou, Jinyang Yang, Zhiyang Liu, Cheng Li, Weiping Liu, Changyuan Yu | 2024-05-29 | 下载 | Burst-mode DSP based on 10ns preamble is proposed for upstream reception of 200G coherent TDM-PON. The 128-symbol tone preamble is used for SOP, frequency offset, and sampling phase estimation, while ... |
| Quantum Circuit Switching with One-Way Repeaters in Star Networks | Álvaro G. Iñesta, Hyeongrak Choi, Dirk Englund, Stephanie Wehner | 2024-05-29 | 下载 | Distributing quantum states reliably among distant locations is a key challenge in the field of quantum networks. One-way quantum networks address this by using one-way communication and quantum error... |
| To RL or not to RL? An Algorithmic Cheat-Sheet for AI-Based Radio Resource Management | Lorenzo Maggi, Matthew Andrews, Ryo Koblitz | 2024-05-29 | 下载 | Several Radio Resource Management (RRM) use cases can be framed as sequential decision planning problems, where an agent (the base station, typically) makes decisions that influence the network utilit... |
| Optimizing Vehicular Networks with Variational Quantum Circuits-based Reinforcement Learning | Zijiang Yan, Ramsundar Tanikella, Hina Tabassum | 2024-05-29 | 下载 | In vehicular networks (VNets), ensuring both road safety and dependable network connectivity is of utmost importance. Achieving this necessitates the creation of resilient and efficient decision-makin... |
| User Association and Channel Allocation in 5G Mobile Asymmetric Multi-band Heterogeneous Networks | Miao Dai, Gang Sun, Hongfang Yu, Sheng Wang, Dusit Niyato | 2024-05-29 | 下载 | With the proliferation of mobile terminals and the continuous upgrading of services, 4G LTE networks are showing signs of weakness. To enhance the capacity of wireless networks, millimeter waves are i... |
| FlocOff: Data Heterogeneity Resilient Federated Learning with Communication-Efficient Edge Offloading | Mulei Ma, Chenyu Gong, Liekang Zeng, Yang Yang, Liantao Wu | 2024-05-29 | 下载 | Federated Learning (FL) has emerged as a fundamental learning paradigm to harness massive data scattered at geo-distributed edge devices in a privacy-preserving way. |
| Adaptive and Parallel Split Federated Learning in Vehicular Edge Computing | Xianke Qiang, Zheng Chang, Yun Hu, Lei Liu, Timo Hamalainen | 2024-05-29 | 下载 | Vehicular edge intelligence (VEI) is a promising paradigm for enabling future intelligent transportation systems by accommodating artificial intelligence (AI) at the vehicular edge computing (VEC) sys... |