Skip to content

2024-08-08

cs.AR - Architecture

标题作者发布日期PDF摘要
Deeploy: Enabling Energy-Efficient Deployment of Small Language Models On Heterogeneous MicrocontrollersMoritz Scherer, Luka Macan, Victor Jung, Philip Wiese, Luca Bompani, Alessio Burrello, Francesco Conti, Luca Benini2024-08-08下载With the rise of Embodied Foundation Models (EFMs), most notably Small Language Models (SLMs), adapting Transformers for edge applications has become a very active field of research.
A Node-Based Polar List Decoder with Frame Interleaving and Ensemble Decoding SupportYuqing Ren, Leyu Zhang, Ludovic Damien Blanc, Yifei Shen, Xinwei Li, Alexios Balatsoukas-Stimming, Chuan Zhang, Andreas Burg2024-08-08下载Node-based successive cancellation list (SCL) decoding has received considerable attention in wireless communications for its significant reduction in decoding latency, particularly with 5G New Radio ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Noise-augmented Chaotic Ising Machines for Combinatorial Optimization and SamplingKyle Lee, Shuvro Chowdhury, Kerem Y. Camsari2024-08-08下载Ising machines, hardware accelerators for combinatorial optimization and probabilistic sampling problems, have gained significant interest recently.
Early-Exit meets Model-Distributed Inference at Edge NetworksMarco Colocrese, Erdem Koyuncu, Hulya Seferoglu2024-08-08下载Distributed inference techniques can be broadly classified into data-distributed and model-distributed schemes. In data-distributed inference (DDI), each worker carries the entire deep neural network ...
Sparse Spiking Neural-like Membrane Systems on Graphics Processing UnitsJavier Hernández-Tello, Miguel Ángel Martínez-del-Amor, David Orellana-Martín, Francis George C. Cabarle2024-08-08下载The parallel simulation of Spiking Neural P systems is mainly based on a matrix representation, where the graph inherent to the neural model is encoded in an adjacency matrix.
SCOOT: SLO-Oriented Performance Tuning for LLM Inference EnginesKe Cheng, Zhi Wang, Wen Hu, Tiannuo Yang, Jianguo Li, Sheng Zhang2024-08-08下载As large language models (LLMs) are gaining increasing popularity across a wide range of web applications, it is of great importance to optimize service-level objectives (SLOs) for LLM inference servi...
Qonductor: A Cloud Orchestrator for Quantum ComputingEmmanouil Giortamis, Francisco Romão, Nathaniel Tornow, Dmitry Lugovoy, Pramod Bhatotia2024-08-08下载We describe Qonductor, a cloud orchestrator for hybrid quantum-classical applications that run on heterogeneous hybrid resources. Qonductor abstracts away the complexity of hybrid programming and reso...
MoC-System: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model TrainingWeilin Cai, Le Qin, Jiayi Huang2024-08-08下载As large language models continue to scale up, distributed training systems have expanded beyond 10k nodes, intensifying the importance of fault tolerance.
DistTrain: Addressing Model and Data Heterogeneity with Disaggregated Training for Multimodal Large Language ModelsZili Zhang, Yinmin Zhong, Yimin Jiang, Hanpeng Hu, Jianjian Sun, Zheng Ge, Yibo Zhu, Daxin Jiang, Xin Jin2024-08-08下载Multimodal large language models (LLMs) empower LLMs to ingest inputs and generate outputs in multiple forms, such as text, image, and audio. However, the integration of multiple modalities introduces...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Redefining Accountability: Navigating Legal Challenges of Participant Liability in Decentralized Autonomous OrganizationsAneta Napieralska, Przemysław Kępczyński2024-08-08下载In the digital era, where innovative technologies like blockchain are revolutionizing traditional organizational paradigms, Decentralized Autonomous Organizations (DAOs) emerge as avant-garde models o...
Overlay-based Decentralized Federated Learning in Bandwidth-limited NetworksYudi Huang, Tingyang Sun, Ting He2024-08-08下载The emerging machine learning paradigm of decentralized federated learning (DFL) has the promise of greatly boosting the deployment of artificial intelligence (AI) by directly learning across distribu...
Role of Error Syndromes in Teleportation SchedulingAparimit Chandra, Filip Rozpędek, Don Towsley2024-08-08下载Quantum teleportation enables quantum information transmission, but requires distribution of entangled resource states. Unfortunately, decoherence, caused by environmental interference during quantum ...
Advancements in UWB: Paving the Way for Sovereign Data Networks in Healthcare FacilitiesKhan Reaz, Thibaud Ardoin, Lea Muth, Marian Margraf, Gerhard Wunder, Mahsa Kholghi, Kai Jansen, Christian Zenger, Julian Schmidt, Enrico Köppe, Zoran Utkovski, Igor Bjelakovic, Mathis Schmieder, Olaf Dressel2024-08-08下载Ultra-Wideband (UWB) technology re-emerges as a groundbreaking ranging technology with its precise micro-location capabilities and robustness.
Early-Exit meets Model-Distributed Inference at Edge NetworksMarco Colocrese, Erdem Koyuncu, Hulya Seferoglu2024-08-08下载Distributed inference techniques can be broadly classified into data-distributed and model-distributed schemes. In data-distributed inference (DDI), each worker carries the entire deep neural network ...
TupleChain: Fast Lookup of OpenFlow Table with Multifaceted ScalabilityYanbiao Li, Neng Ren, Xin Wang, Yuxuan Chen, Xinyi Zhang, Lingbo Guo, Gaogang Xie2024-08-08下载OpenFlow switches are fundamental components of software defined networking, where the key operation is to look up flow tables to determine which flow an incoming packet belongs to.
Towards Explainable Network Intrusion Detection using Large Language ModelsPaul R. B. Houssel, Priyanka Singh, Siamak Layeghy, Marius Portmann2024-08-08下载Large Language Models (LLMs) have revolutionised natural language processing tasks, particularly as chat agents. However, their applicability to threat detection problems remains unclear.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Columbo: Low Level End-to-End System Traces through Modular Full-System SimulationJakob Görgen, Vaastav Anand, Hejing Li, Jialin Li, Antoine Kaufmann2024-08-08下载Fully understanding performance is a growing challenge when building next-generation cloud systems. Often these systems build on next-generation hardware, and evaluation in realistic physical testbeds...
Crash Consistency in DRAM-NVM-Disk Hybrid Storage SystemGuoyu Wang, Xilong Che, Haoyang Wei, Chenju Pei, Juncheng Hu2024-08-08下载NVM is used as a new hierarchy in the storage system, due to its intermediate speed and capacity between DRAM, and its byte granularity. However, consistency problems emerge when we attempt to put DRA...

cs.PF - Performance

标题作者发布日期PDF摘要
Columbo: Low Level End-to-End System Traces through Modular Full-System SimulationJakob Görgen, Vaastav Anand, Hejing Li, Jialin Li, Antoine Kaufmann2024-08-08下载Fully understanding performance is a growing challenge when building next-generation cloud systems. Often these systems build on next-generation hardware, and evaluation in realistic physical testbeds...
Evaluation of Hash Algorithm Performance for Cryptocurrency Exchanges Based on Blockchain SystemAbel C. H. Chen2024-08-08下载The blockchain system has emerged as one of the focal points of research in recent years, particularly in applications and services such as cryptocurrencies and smart contracts.

基于 VitePress 构建