2024-04-22

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
TDRAM: Tag-enhanced DRAM for Efficient Caching	Maryam Babaie, Ayaz Akram, Wendy Elsasser, Brent Haukness, Michael Miller, Taeksang Song, Thomas Vogelsang, Steven Woo, Jason Lowe-Power	2024-04-22	下载	As SRAM-based caches are hitting a scaling wall, manufacturers are integrating DRAM-based caches into system designs to continue increasing cache sizes.
Co-designing a Sub-millisecond Latency Event-based Eye Tracking System with Submanifold Sparse CNN	Baoheng Zhang, Yizhao Gao, Jingyuan Li, Hayden Kwok-Hay So	2024-04-22	下载	Eye-tracking technology is integral to numerous consumer electronics applications, particularly in the realm of virtual and augmented reality (VR/AR).
HomeLabGym: A real-world testbed for home energy management systems	Toon Van Puyvelde, Marie-Sophie Verwee, Gargya Gokhale, Mehran Zareh Eshghdoust, Chris Develder	2024-04-22	下载	Amid growing environmental concerns and resulting energy costs, there is a rising need for efficient Home Energy Management Systems (HEMS). Evaluating such innovative HEMS solutions typically relies o...
On the Systematic Creation of Faithfully Rounded Commutative Truncated Booth Multipliers	Theo Drane, Samuel Coward, Mertcan Temel, Joe Leslie-Hurd	2024-04-22	下载	In many instances of fixed-point multiplication, a full precision result is not required. Instead it is sufficient to return a faithfully rounded result.
A Stochastic Rounding-Enabled Low-Precision Floating-Point MAC for DNN Training	Sami Ben Ali, Silviu-Ioan Filip, Olivier Sentieys	2024-04-22	下载	Training Deep Neural Networks (DNNs) can be computationally demanding, particularly when dealing with large models. Recent work has aimed to mitigate this computational challenge by introducing 8-bit ...
CNN-Based Equalization for Communications: Achieving Gigabit Throughput with a Flexible FPGA Hardware Architecture	Jonas Ney, Christoph Füllner, Vincent Lauinger, Laurent Schmalen, Sebastian Randel, Norbert Wehn	2024-04-22	下载	To satisfy the growing throughput demand of data-intensive applications, the performance of optical communication systems increased dramatically in recent years.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity	Tyler Griggs, Xiaoxuan Liu, Jiaxiang Yu, Doyoung Kim, Wei-Lin Chiang, Alvin Cheung, Ion Stoica	2024-04-22	下载	Large language models (LLMs) are increasingly integrated into many online services, yet they remain cost-prohibitive to deploy due to the requirement of expensive GPU instances.
Blockchain in a box: A portable blockchain network implementation on Raspberry Pi's	Matija Piškorec, Anton Ivashkevich, Said Haji Abukar, Lundrim Azemi, Md Rezuanul Haque, Mostafa Chegenizadeh, Claudio J. Tessone	2024-04-22	下载	In this paper we describe a prototype of a blockchain-in-a-box system which allows users to easily bootstrap the whole Ethereum Proof-of-Work (PoW) network running on multiple Raspberry Pi nodes - an ...
Frosty: Bringing strong liveness guarantees to the Snow family of consensus protocols	Aaron Buchwald, Stephen Buttolph, Andrew Lewis-Pye, Patrick O'Grady, Kevin Sekniqi	2024-04-22	下载	Snowman is the consensus protocol implemented by the Avalanche blockchain and is part of the Snow family of protocols, first introduced through the original Avalanche leaderless consensus protocol.
Proceedings of 3rd Workshop on Heterogeneous Composable and Disaggregated Systems	Christian Pinto, Dong Li, Thaleia Dimitra Doudali, Christina Giannoula, Jie Ren	2024-04-22	下载	The future of computing systems is inevitably embracing a disaggregated and composable pattern: from clusters of computers to pools of resources that can be dynamically combined together and tailored ...
LLAMP: Assessing Network Latency Tolerance of HPC Applications with Linear Programming	Siyuan Shen, Langwen Huang, Marcin Chrapek, Timo Schneider, Jai Dayal, Manisha Gajbe, Robert Wisniewski, Torsten Hoefler	2024-04-22	下载	The shift towards high-bandwidth networks driven by AI workloads in data centers and HPC clusters has unintentionally aggravated network latency, adversely affecting the performance of communication-i...
New Solutions Based on the Generalized Eigenvalue Problem for the Data Collaboration Analysis	Yuta Kawakami, Yuichi Takano, Akira Imakura	2024-04-22	下载	In recent years, the accumulation of data across various institutions has garnered attention for the technology of confidential data analysis, which improves analytical accuracy by sharing data betwee...
Towards Proxy Staking Accounts Based on NFTs in Ethereum	Viktor Valaštín, Roman Bitarovský, Kristián Košťál, Ivan Kotuliak	2024-04-22	下载	Blockchain is a technology that is often used to share data and assets. However, in the decentralized ecosystem, blockchain-based systems can be utilized to share information and assets without the tr...
Apodotiko: Enabling Efficient Serverless Federated Learning in Heterogeneous Environments	Mohak Chadha, Alexander Jensen, Jianfeng Gu, Osama Abboud, Michael Gerndt	2024-04-22	下载	Federated Learning (FL) is an emerging machine learning paradigm that enables the collaborative training of a shared global model across distributed clients while keeping the data decentralized.
HamilToniQ: An Open-Source Benchmark Toolkit for Quantum Computers	Xiaotian Xu, Kuan-Cheng Chen, Robert Wille	2024-04-22	下载	In this paper, we introduce HamilToniQ, an open-source, and application-oriented benchmarking toolkit for the comprehensive evaluation of Quantum Processing Units (QPUs).
Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless Networks	Bing Luo, Wenli Xiao, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas	2024-04-22	下载	Federated learning (FL) algorithms usually sample a fraction of clients in each round (partial participation) when the number of participants is large and the server's communication bandwidth is limit...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Latency-Distortion Tradeoffs in Communicating Classification Results over Noisy Channels	Noel Teku, Sudarshan Adiga, Ravi Tandon	2024-04-22	下载	In this work, the problem of communicating decisions of a classifier over a noisy channel is considered. With machine learning based models being used in variety of time-sensitive applications, transm...
Mapping Wireless Networks into Digital Reality through Joint Vertical and Horizontal Learning	Zifan Zhang, Mingzhe Chen, Zhaohui Yang, Yuchen Liu	2024-04-22	下载	In recent years, the complexity of 5G and beyond wireless networks has escalated, prompting a need for innovative frameworks to facilitate flexible management and efficient deployment.
Poisoning Attacks on Federated Learning-based Wireless Traffic Prediction	Zifan Zhang, Minghong Fang, Jiayuan Huang, Yuchen Liu	2024-04-22	下载	Federated Learning (FL) offers a distributed framework to train a global control model across multiple base stations without compromising the privacy of their local network data.
DE-LIoT: The Data-Energy Networking Paradigm for Sustainable Light-Based Internet of Things	Amila Perera, Roshan Godaliyadda, Marcos Katz	2024-04-22	下载	The growing demand for Internet of Things (IoT) networks has sparked interest in sustainable, zero-energy designs through Energy Harvesting (EH) to extend the lifespans of IoT sensors.
Distributed Learning for Wi-Fi AP Load Prediction	Dariush Salami, Francesc Wilhelmi, Lorenzo Galati-Giordano, Mika Kasslin	2024-04-22	下载	The increasing cloudification and softwarization of networks foster the interplay among multiple independently managed deployments. An appealing reason for such an interplay lies in distributed Machin...
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories	Ning Yang, Shuo Chen, Haijun Zhang, Randall Berry	2024-04-22	下载	Mobile Edge Computing (MEC) broadens the scope of computation and storage beyond the central network, incorporating edge nodes close to end devices.
EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models	Mathias Thorsager, Victor Croisfelt, Junya Shiraishi, Petar Popovski	2024-04-22	下载	This paper introduces EcoPull, a sustainable Internet of Things (IoT) framework empowered by tiny machine learning (TinyML) models for fetching images from wireless visual sensor networks.
TrimCaching: Parameter-sharing Edge Caching for AI Model Downloading	Guanqiao Qu, Zheng Lin, Qian Chen, Jian Li, Fangming Liu, Xianhao Chen, Kaibin Huang	2024-04-22	下载	Next-generation mobile networks are expected to facilitate fast AI model downloading to end users. By caching models on edge servers, mobile networks can deliver models to end users with low latency, ...
LLAMP: Assessing Network Latency Tolerance of HPC Applications with Linear Programming	Siyuan Shen, Langwen Huang, Marcin Chrapek, Timo Schneider, Jai Dayal, Manisha Gajbe, Robert Wisniewski, Torsten Hoefler	2024-04-22	下载	The shift towards high-bandwidth networks driven by AI workloads in data centers and HPC clusters has unintentionally aggravated network latency, adversely affecting the performance of communication-i...
Dismantling Common Internet Services for Ad-Malware Detection	Florian Nettersheim, Stephan Arlt, Michael Rademacher	2024-04-22	下载	Online advertising represents a main instrument for publishers to fund content on the World Wide Web. Unfortunately, a significant number of online advertisements often accommodates potentially malici...
Access-Point to Access-Point Connectivity for PON-based OWC Spine and Leaf Data Centre Architecture	Abrar S. Alhazmi, Sanaa H. Mohamed, Ahmad Qidan, T. E. H. El-Gorashi, Jaafar M. H. Elmirghani	2024-04-22	下载	In this paper, we propose incorporating Optical Wireless Communication (OWC) and Passive Optical Network (PON) technologies into next generation spine-and-leaf Data Centre Networks (DCNs).
5GC $^2$ ache: Improving 5G UPF Performance via Cache Optimization	Haonan Jia, Meng Wang, Biyi Li, Yirui Liu, Junchen Guo, Pengyu Zhang	2024-04-22	下载	Last Level Cache (LLC) is a precious and critical resource that impacts the performance of applications running on top of CPUs. In this paper, we reveal the significant impact of LLC on the performanc...
Langues en danger et multilinguisme num{é}rique	Mokhtar Ben Henda	2024-04-22	下载	In the era of globalization and digital networks, the so-called ''minored'' or ''endangered'' languages are facing a twofold dilemma: either succeed in their digital modernity by accepting a ''painful...
Cross-Modal Generative Semantic Communications for Mobile AIGC: Joint Semantic Encoding and Prompt Engineering	Yinqiu Liu, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Shiwen Mao, Ping Zhang, Xuemin Shen	2024-04-22	下载	Employing massive Mobile AI-Generated Content (AIGC) Service Providers (MASPs) with powerful models, high-quality AIGC services can become accessible for resource-constrained end users.
ICST-DNET: An Interpretable Causal Spatio-Temporal Diffusion Network for Traffic Speed Prediction	Yi Rong, Yingchi Mao, Yinqiu Liu, Ling Chen, Xiaoming He, Dusit Niyato	2024-04-22	下载	Traffic speed prediction is significant for intelligent navigation and congestion alleviation. However, making accurate predictions is challenging due to three factors: 1) traffic diffusion, i.e.
Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless Networks	Bing Luo, Wenli Xiao, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas	2024-04-22	下载	Federated learning (FL) algorithms usually sample a fraction of clients in each round (partial participation) when the number of participants is large and the server's communication bandwidth is limit...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Taming Server Memory TCO with Multiple Software-Defined Compressed Tiers	Sandeep Kumar, Aravinda Prasad, Sreenivas Subramoney	2024-04-22	下载	Memory accounts for 33 - 50% of the total cost of ownership (TCO) in modern data centers. We propose a novel solution to tame memory TCO through the novel creation and judicious management of multiple...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Performance Characterization of Expert Router for Scalable LLM Inference	Josef Pichlmeier, Philipp Ross, Andre Luckow	2024-04-22	下载	Large Language Models (LLMs) have experienced widespread adoption across scientific and industrial domains due to their versatility and utility for diverse tasks.
LLAMP: Assessing Network Latency Tolerance of HPC Applications with Linear Programming	Siyuan Shen, Langwen Huang, Marcin Chrapek, Timo Schneider, Jai Dayal, Manisha Gajbe, Robert Wisniewski, Torsten Hoefler	2024-04-22	下载	The shift towards high-bandwidth networks driven by AI workloads in data centers and HPC clusters has unintentionally aggravated network latency, adversely affecting the performance of communication-i...