Skip to content

2024-04-22

cs.AR - Architecture

标题作者发布日期PDF摘要
TDRAM: Tag-enhanced DRAM for Efficient CachingMaryam Babaie, Ayaz Akram, Wendy Elsasser, Brent Haukness, Michael Miller, Taeksang Song, Thomas Vogelsang, Steven Woo, Jason Lowe-Power2024-04-22下载As SRAM-based caches are hitting a scaling wall, manufacturers are integrating DRAM-based caches into system designs to continue increasing cache sizes.
Co-designing a Sub-millisecond Latency Event-based Eye Tracking System with Submanifold Sparse CNNBaoheng Zhang, Yizhao Gao, Jingyuan Li, Hayden Kwok-Hay So2024-04-22下载Eye-tracking technology is integral to numerous consumer electronics applications, particularly in the realm of virtual and augmented reality (VR/AR).
HomeLabGym: A real-world testbed for home energy management systemsToon Van Puyvelde, Marie-Sophie Verwee, Gargya Gokhale, Mehran Zareh Eshghdoust, Chris Develder2024-04-22下载Amid growing environmental concerns and resulting energy costs, there is a rising need for efficient Home Energy Management Systems (HEMS). Evaluating such innovative HEMS solutions typically relies o...
On the Systematic Creation of Faithfully Rounded Commutative Truncated Booth MultipliersTheo Drane, Samuel Coward, Mertcan Temel, Joe Leslie-Hurd2024-04-22下载In many instances of fixed-point multiplication, a full precision result is not required. Instead it is sufficient to return a faithfully rounded result.
A Stochastic Rounding-Enabled Low-Precision Floating-Point MAC for DNN TrainingSami Ben Ali, Silviu-Ioan Filip, Olivier Sentieys2024-04-22下载Training Deep Neural Networks (DNNs) can be computationally demanding, particularly when dealing with large models. Recent work has aimed to mitigate this computational challenge by introducing 8-bit ...
CNN-Based Equalization for Communications: Achieving Gigabit Throughput with a Flexible FPGA Hardware ArchitectureJonas Ney, Christoph Füllner, Vincent Lauinger, Laurent Schmalen, Sebastian Randel, Norbert Wehn2024-04-22下载To satisfy the growing throughput demand of data-intensive applications, the performance of optical communication systems increased dramatically in recent years.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU HeterogeneityTyler Griggs, Xiaoxuan Liu, Jiaxiang Yu, Doyoung Kim, Wei-Lin Chiang, Alvin Cheung, Ion Stoica2024-04-22下载Large language models (LLMs) are increasingly integrated into many online services, yet they remain cost-prohibitive to deploy due to the requirement of expensive GPU instances.
Blockchain in a box: A portable blockchain network implementation on Raspberry Pi'sMatija Piškorec, Anton Ivashkevich, Said Haji Abukar, Lundrim Azemi, Md Rezuanul Haque, Mostafa Chegenizadeh, Claudio J. Tessone2024-04-22下载In this paper we describe a prototype of a blockchain-in-a-box system which allows users to easily bootstrap the whole Ethereum Proof-of-Work (PoW) network running on multiple Raspberry Pi nodes - an ...
Frosty: Bringing strong liveness guarantees to the Snow family of consensus protocolsAaron Buchwald, Stephen Buttolph, Andrew Lewis-Pye, Patrick O'Grady, Kevin Sekniqi2024-04-22下载Snowman is the consensus protocol implemented by the Avalanche blockchain and is part of the Snow family of protocols, first introduced through the original Avalanche leaderless consensus protocol.
Proceedings of 3rd Workshop on Heterogeneous Composable and Disaggregated SystemsChristian Pinto, Dong Li, Thaleia Dimitra Doudali, Christina Giannoula, Jie Ren2024-04-22下载The future of computing systems is inevitably embracing a disaggregated and composable pattern: from clusters of computers to pools of resources that can be dynamically combined together and tailored ...
LLAMP: Assessing Network Latency Tolerance of HPC Applications with Linear ProgrammingSiyuan Shen, Langwen Huang, Marcin Chrapek, Timo Schneider, Jai Dayal, Manisha Gajbe, Robert Wisniewski, Torsten Hoefler2024-04-22下载The shift towards high-bandwidth networks driven by AI workloads in data centers and HPC clusters has unintentionally aggravated network latency, adversely affecting the performance of communication-i...
New Solutions Based on the Generalized Eigenvalue Problem for the Data Collaboration AnalysisYuta Kawakami, Yuichi Takano, Akira Imakura2024-04-22下载In recent years, the accumulation of data across various institutions has garnered attention for the technology of confidential data analysis, which improves analytical accuracy by sharing data betwee...
Towards Proxy Staking Accounts Based on NFTs in EthereumViktor Valaštín, Roman Bitarovský, Kristián Košťál, Ivan Kotuliak2024-04-22下载Blockchain is a technology that is often used to share data and assets. However, in the decentralized ecosystem, blockchain-based systems can be utilized to share information and assets without the tr...
Apodotiko: Enabling Efficient Serverless Federated Learning in Heterogeneous EnvironmentsMohak Chadha, Alexander Jensen, Jianfeng Gu, Osama Abboud, Michael Gerndt2024-04-22下载Federated Learning (FL) is an emerging machine learning paradigm that enables the collaborative training of a shared global model across distributed clients while keeping the data decentralized.
HamilToniQ: An Open-Source Benchmark Toolkit for Quantum ComputersXiaotian Xu, Kuan-Cheng Chen, Robert Wille2024-04-22下载In this paper, we introduce HamilToniQ, an open-source, and application-oriented benchmarking toolkit for the comprehensive evaluation of Quantum Processing Units (QPUs).
Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless NetworksBing Luo, Wenli Xiao, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas2024-04-22下载Federated learning (FL) algorithms usually sample a fraction of clients in each round (partial participation) when the number of participants is large and the server's communication bandwidth is limit...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Latency-Distortion Tradeoffs in Communicating Classification Results over Noisy ChannelsNoel Teku, Sudarshan Adiga, Ravi Tandon2024-04-22下载In this work, the problem of communicating decisions of a classifier over a noisy channel is considered. With machine learning based models being used in variety of time-sensitive applications, transm...
Mapping Wireless Networks into Digital Reality through Joint Vertical and Horizontal LearningZifan Zhang, Mingzhe Chen, Zhaohui Yang, Yuchen Liu2024-04-22下载In recent years, the complexity of 5G and beyond wireless networks has escalated, prompting a need for innovative frameworks to facilitate flexible management and efficient deployment.
Poisoning Attacks on Federated Learning-based Wireless Traffic PredictionZifan Zhang, Minghong Fang, Jiayuan Huang, Yuchen Liu2024-04-22下载Federated Learning (FL) offers a distributed framework to train a global control model across multiple base stations without compromising the privacy of their local network data.
DE-LIoT: The Data-Energy Networking Paradigm for Sustainable Light-Based Internet of ThingsAmila Perera, Roshan Godaliyadda, Marcos Katz2024-04-22下载The growing demand for Internet of Things (IoT) networks has sparked interest in sustainable, zero-energy designs through Energy Harvesting (EH) to extend the lifespans of IoT sensors.
Distributed Learning for Wi-Fi AP Load PredictionDariush Salami, Francesc Wilhelmi, Lorenzo Galati-Giordano, Mika Kasslin2024-04-22下载The increasing cloudification and softwarization of networks foster the interplay among multiple independently managed deployments. An appealing reason for such an interplay lies in distributed Machin...
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research TrajectoriesNing Yang, Shuo Chen, Haijun Zhang, Randall Berry2024-04-22下载Mobile Edge Computing (MEC) broadens the scope of computation and storage beyond the central network, incorporating edge nodes close to end devices.
EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML ModelsMathias Thorsager, Victor Croisfelt, Junya Shiraishi, Petar Popovski2024-04-22下载This paper introduces EcoPull, a sustainable Internet of Things (IoT) framework empowered by tiny machine learning (TinyML) models for fetching images from wireless visual sensor networks.
TrimCaching: Parameter-sharing Edge Caching for AI Model DownloadingGuanqiao Qu, Zheng Lin, Qian Chen, Jian Li, Fangming Liu, Xianhao Chen, Kaibin Huang2024-04-22下载Next-generation mobile networks are expected to facilitate fast AI model downloading to end users. By caching models on edge servers, mobile networks can deliver models to end users with low latency, ...
LLAMP: Assessing Network Latency Tolerance of HPC Applications with Linear ProgrammingSiyuan Shen, Langwen Huang, Marcin Chrapek, Timo Schneider, Jai Dayal, Manisha Gajbe, Robert Wisniewski, Torsten Hoefler2024-04-22下载The shift towards high-bandwidth networks driven by AI workloads in data centers and HPC clusters has unintentionally aggravated network latency, adversely affecting the performance of communication-i...
Dismantling Common Internet Services for Ad-Malware DetectionFlorian Nettersheim, Stephan Arlt, Michael Rademacher2024-04-22下载Online advertising represents a main instrument for publishers to fund content on the World Wide Web. Unfortunately, a significant number of online advertisements often accommodates potentially malici...
Access-Point to Access-Point Connectivity for PON-based OWC Spine and Leaf Data Centre ArchitectureAbrar S. Alhazmi, Sanaa H. Mohamed, Ahmad Qidan, T. E. H. El-Gorashi, Jaafar M. H. Elmirghani2024-04-22下载In this paper, we propose incorporating Optical Wireless Communication (OWC) and Passive Optical Network (PON) technologies into next generation spine-and-leaf Data Centre Networks (DCNs).
5GC2^2ache: Improving 5G UPF Performance via Cache OptimizationHaonan Jia, Meng Wang, Biyi Li, Yirui Liu, Junchen Guo, Pengyu Zhang2024-04-22下载Last Level Cache (LLC) is a precious and critical resource that impacts the performance of applications running on top of CPUs. In this paper, we reveal the significant impact of LLC on the performanc...
Langues en danger et multilinguisme num{é}riqueMokhtar Ben Henda2024-04-22下载In the era of globalization and digital networks, the so-called ''minored'' or ''endangered'' languages are facing a twofold dilemma: either succeed in their digital modernity by accepting a ''painful...
Cross-Modal Generative Semantic Communications for Mobile AIGC: Joint Semantic Encoding and Prompt EngineeringYinqiu Liu, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Shiwen Mao, Ping Zhang, Xuemin Shen2024-04-22下载Employing massive Mobile AI-Generated Content (AIGC) Service Providers (MASPs) with powerful models, high-quality AIGC services can become accessible for resource-constrained end users.
ICST-DNET: An Interpretable Causal Spatio-Temporal Diffusion Network for Traffic Speed PredictionYi Rong, Yingchi Mao, Yinqiu Liu, Ling Chen, Xiaoming He, Dusit Niyato2024-04-22下载Traffic speed prediction is significant for intelligent navigation and congestion alleviation. However, making accurate predictions is challenging due to three factors: 1) traffic diffusion, i.e.
Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless NetworksBing Luo, Wenli Xiao, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas2024-04-22下载Federated learning (FL) algorithms usually sample a fraction of clients in each round (partial participation) when the number of participants is large and the server's communication bandwidth is limit...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Taming Server Memory TCO with Multiple Software-Defined Compressed TiersSandeep Kumar, Aravinda Prasad, Sreenivas Subramoney2024-04-22下载Memory accounts for 33 - 50% of the total cost of ownership (TCO) in modern data centers. We propose a novel solution to tame memory TCO through the novel creation and judicious management of multiple...

cs.PF - Performance

标题作者发布日期PDF摘要
Performance Characterization of Expert Router for Scalable LLM InferenceJosef Pichlmeier, Philipp Ross, Andre Luckow2024-04-22下载Large Language Models (LLMs) have experienced widespread adoption across scientific and industrial domains due to their versatility and utility for diverse tasks.
LLAMP: Assessing Network Latency Tolerance of HPC Applications with Linear ProgrammingSiyuan Shen, Langwen Huang, Marcin Chrapek, Timo Schneider, Jai Dayal, Manisha Gajbe, Robert Wisniewski, Torsten Hoefler2024-04-22下载The shift towards high-bandwidth networks driven by AI workloads in data centers and HPC clusters has unintentionally aggravated network latency, adversely affecting the performance of communication-i...

基于 VitePress 构建