Appearance
2024-06-24
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LLM-Aided Testbench Generation and Bug Detection for Finite-State Machines | Jitendra Bhandari, Johann Knechtel, Ramesh Narayanaswamy, Siddharth Garg, Ramesh Karri | 2024-06-24 | 下载 | This work investigates the potential of tailoring Large Language Models (LLMs), specifically GPT3.5 and GPT4, for the domain of chip testing. A key aspect of chip design is functional testing, which r... |
| Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving | Ruoyu Qin, Zheming Li, Weiran He, Mingxing Zhang, Yongwei Wu, Weimin Zheng, Xinran Xu | 2024-06-24 | 下载 | Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI. It features a KVCache-centric disaggregated architecture that separates the prefill and decoding clusters. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Robust Zero Trust Architecture: Joint Blockchain based Federated learning and Anomaly Detection based Framework | Shiva Raj Pokhrel, Luxing Yang, Sutharshan Rajasegarar, Gang Li | 2024-06-24 | 下载 | This paper introduces a robust zero-trust architecture (ZTA) tailored for the decentralized system that empowers efficient remote work and collaboration within IoT networks. |
| GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism | Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen, Peiyuan Liao, Xupeng Miao, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia | 2024-06-24 | 下载 | Deep neural networks (DNNs) continue to grow rapidly in size, making them infeasible to train on a single device. Pipeline parallelism is commonly used in existing DNN systems to support large-scale D... |
| Scalable Artificial Intelligence for Science: Perspectives, Methods and Exemplars | Wesley Brewer, Aditya Kashi, Sajal Dash, Aristeidis Tsaris, Junqi Yin, Mallikarjun Shankar, Feiyi Wang | 2024-06-24 | 下载 | In a post-ChatGPT world, this paper explores the potential of leveraging scalable artificial intelligence for scientific discovery. We propose that scaling up artificial intelligence on high-performan... |
| Fast Switching Serial and Parallel Paradigms of SNN Inference on Multi-core Heterogeneous Neuromorphic Platform SpiNNaker2 | Jiaxin Huang, Bernhard Vogginger, Florian Kelber, Hector Gonzalez, Klaus Knobloch, Christian Georg Mayr | 2024-06-24 | 下载 | With serial and parallel processors introduced into Spiking Neural Networks (SNNs) execution, more and more researchers are dedicated to improving the performance of the computing paradigms by taking ... |
| A Multi-Party, Multi-Blockchain Atomic Swap Protocol with Universal Adaptor Secret | Shengewei You, Aditya Joshi, Andrey Kuehlkamp, Jarek Nabrzyski | 2024-06-24 | 下载 | The increasing complexity of digital asset transactions across multiple blockchains necessitates a robust atomic swap protocol that can securely handle more than two participants. |
| Bisimulation for Impure Simplicial Complexes | Marta Bílková, Hans van Ditmarsch, Roman Kuznets, Rojo Randrianomentsoa | 2024-06-24 | 下载 | As an alternative to Kripke models, simplicial complexes are a versatile semantic primitive on which to interpret epistemic logic. Given a set of vertices, a simplicial complex is a downward closed se... |
| Towards Communication-Efficient Peer-to-Peer Networks | Khalid Hourani, William K. Moses, Gopal Pandurangan | 2024-06-24 | 下载 | We focus on designing Peer-to-Peer (P2P) networks that enable efficient communication. Over the last two decades, there has been substantial algorithmic research on distributed protocols for building ... |
| Digital Twinning of a Pressurized Water Reactor Startup Operation and Partial Computational Offloading in In-network Computing-Assisted Multiaccess Edge Computing | Ibrahim Aliyu, Awwal M. Arigi, Tai-Won Um, Jinsul Kim | 2024-06-24 | 下载 | This paper addresses the challenge of representing complex human action (HA) in a nuclear power plant (NPP) digital twin (DT) and minimizing latency in partial computation offloading (PCO) in sixth-ge... |
| Semantic Revolution from Communications to Orchestration for 6G: Challenges, Enablers, and Research Directions | Masoud Shokrnezhad, Hamidreza Mazandarani, Tarik Taleb, Jaeseung Song, Richard Li | 2024-06-24 | 下载 | In the context of emerging 6G services, the realization of everything-to-everything interactions involving a myriad of physical and digital entities presents a crucial challenge. |
| Decentralized Task Offloading and Load-Balancing for Mobile Edge Computing in Dense Networks | Mariam Yahya, Alexander Conzelmann, Setareh Maghsudi | 2024-06-24 | 下载 | We study the problem of decentralized task offloading and load-balancing in a dense network with numerous devices and a set of edge servers. Solving this problem optimally is complicated due to the un... |
| Placing Timely Refreshing Services at the Network Edge | Xishuo Li, Shan Zhang, Hongbin Luo, Xiao Ma, Junyi He | 2024-06-24 | 下载 | Accommodating services at the network edge is favorable for time-sensitive applications. However, maintaining service usability is resource-consuming in terms of pulling service images to the edge, sy... |
| Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving | Ruoyu Qin, Zheming Li, Weiran He, Mingxing Zhang, Yongwei Wu, Weimin Zheng, Xinran Xu | 2024-06-24 | 下载 | Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI. It features a KVCache-centric disaggregated architecture that separates the prefill and decoding clusters. |
| Evaluating Serverless Machine Learning Performance on Google Cloud Run | Prerana Khatiwada, Pranjal Dhakal | 2024-06-24 | 下载 | End-users can get functions-as-a-service from serverless platforms, which promise lower hosting costs, high availability, fault tolerance, and dynamic flexibility for hosting individual functions know... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Integrating Generative AI with Network Digital Twins for Enhanced Network Operations | Kassi Muhammad, Teef David, Giulia Nassisid, Tina Farus | 2024-06-24 | 下载 | As telecommunications networks become increasingly complex, the integration of advanced technologies such as network digital twins and generative artificial intelligence (AI) emerges as a pivotal solu... |
| Digital Twinning of a Pressurized Water Reactor Startup Operation and Partial Computational Offloading in In-network Computing-Assisted Multiaccess Edge Computing | Ibrahim Aliyu, Awwal M. Arigi, Tai-Won Um, Jinsul Kim | 2024-06-24 | 下载 | This paper addresses the challenge of representing complex human action (HA) in a nuclear power plant (NPP) digital twin (DT) and minimizing latency in partial computation offloading (PCO) in sixth-ge... |
| Semantic Revolution from Communications to Orchestration for 6G: Challenges, Enablers, and Research Directions | Masoud Shokrnezhad, Hamidreza Mazandarani, Tarik Taleb, Jaeseung Song, Richard Li | 2024-06-24 | 下载 | In the context of emerging 6G services, the realization of everything-to-everything interactions involving a myriad of physical and digital entities presents a crucial challenge. |
| A Queuing Envelope Model for Estimating Latency Guarantees in Deterministic Networking Scenarios | Nataliia Koneva, Alfonso Sánchez-Macián, José Alberto Hernández, Farhad Arpanaei, Óscar González de Dios | 2024-06-24 | 下载 | Accurate estimation of queuing delays is crucial for designing and optimizing communication networks, particularly in the context of Deterministic Networking (DetNet) scenarios. |
| Placing Timely Refreshing Services at the Network Edge | Xishuo Li, Shan Zhang, Hongbin Luo, Xiao Ma, Junyi He | 2024-06-24 | 下载 | Accommodating services at the network edge is favorable for time-sensitive applications. However, maintaining service usability is resource-consuming in terms of pulling service images to the edge, sy... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Evaluating Serverless Machine Learning Performance on Google Cloud Run | Prerana Khatiwada, Pranjal Dhakal | 2024-06-24 | 下载 | End-users can get functions-as-a-service from serverless platforms, which promise lower hosting costs, high availability, fault tolerance, and dynamic flexibility for hosting individual functions know... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournaments | Grigori Fursin | 2024-06-24 | 下载 | This white paper introduces my educational community initiative to learn how to run AI, ML and other emerging workloads in the most efficient and cost-effective way across diverse models, data sets, s... |