2024-08-14

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Development of simulation model for Single Carrier Transceiver for Nanosatellite	Pallewela R. C. K, Rohana Thilakuamra, Prabath Buddhika	2024-08-14	下载	CubeSat is a nanosatellite concept emerged from a paper published by Stanford University and with their low cost nature and extreme feasibility , more started researching on nano satellites.
Efficient Edge AI: Deploying Convolutional Neural Networks on FPGA with the Gemmini Accelerator	Federico Nicolas Peccia, Svetlana Pavlitska, Tobias Fleck, Oliver Bringmann	2024-08-14	下载	The growing concerns regarding energy consumption and privacy have prompted the development of AI solutions deployable on the edge, circumventing the substantial CO2 emissions associated with cloud se...
LPU: A Latency-Optimized and Highly Scalable Processor for Large Language Model Inference	Seungjae Moon, Jung-Hoon Kim, Junsoo Kim, Seongmin Hong, Junseo Cha, Minsu Kim, Sukbin Lim, Gyubin Choi, Dongjin Seo, Jongho Kim, Hunjong Lee, Hyunjun Park, Ryeowook Ko, Soongyu Choi, Jongse Park, Jinwon Lee, Joo-Young Kim	2024-08-14	下载	The explosive arrival of OpenAI's ChatGPT has fueled the globalization of large language model (LLM), which consists of billions of pretrained parameters that embodies the aspects of syntax and semant...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Learning-Augmented Competitive Algorithms for Spatiotemporal Online Allocation with Deadline Constraints	Adam Lechowicz, Nicolas Christianson, Bo Sun, Noman Bashir, Mohammad Hajiesmaili, Adam Wierman, Prashant Shenoy	2024-08-14	下载	We introduce and study spatiotemporal online allocation with deadline constraints ( $\mathsf{SOAD}$ ), a new online problem motivated by emerging challenges in sustainability and energy.
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference	Rohan Baskar Prabhakar, Hengrui Zhang, David Wentzlaff	2024-08-14	下载	Large Transformer networks are increasingly used in settings where low inference latency can improve the end-user experience and enable new applications.
Modernizing an Operational Real-time Tsunami Simulator to Support Diverse Hardware Platforms	Keichi Takahashi, Takashi Abe, Akihiro Musa, Yoshihiko Sato, Yoichi Shimomura, Hiroyuki Takizawa, Shunichi Koshimura	2024-08-14	下载	To issue early warnings and rapidly initiate disaster responses after tsunami damage, various tsunami inundation forecast systems have been deployed worldwide.
FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual Teacher	Alessio Mora, Lorenzo Valerio, Paolo Bellavista, Andrea Passarella	2024-08-14	下载	Federated Learning (FL) systems enable the collaborative training of machine learning models without requiring centralized collection of individual data.
Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems	Ning Lu, Qian Xie, Hao Zhang, Wenyi Fang, Yang Zheng, Zheng Hu, Jiantao Ma	2024-08-14	下载	Large Language Models (LLMs) are revolutionizing the AI industry with their superior capabilities. Training these models requires large-scale GPU clusters and significant computing time, leading to fr...
UNR: Unified Notifiable RMA Library for HPC	Guangnan Feng, Jiabin Xie, Dezun Dong, Yutong Lu	2024-08-14	下载	Remote Memory Access (RMA) enables direct access to remote memory to achieve high performance for HPC applications. However, most modern parallel programming models lack schemes for the remote process...
Efficient Edge AI: Deploying Convolutional Neural Networks on FPGA with the Gemmini Accelerator	Federico Nicolas Peccia, Svetlana Pavlitska, Tobias Fleck, Oliver Bringmann	2024-08-14	下载	The growing concerns regarding energy consumption and privacy have prompted the development of AI solutions deployable on the edge, circumventing the substantial CO2 emissions associated with cloud se...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
A Case for Enabling Delegation of 5G Core Decisions to the RAN	Lucas Vancina, Geoffrey Xie	2024-08-14	下载	Under conventional 5G system design, the authentication and continuous monitoring of user equipment (UE) demands a reliable backhaul connection between the radio access network (RAN) and the core netw...
Context-aware Container Orchestration in Serverless Edge Computing	Peiyuan Guan, Chen Chen, Ziru Chen, Lin X. Cai, Xing Hao, Amir Taherkordi	2024-08-14	下载	Adopting serverless computing to edge networks benefits end-users from the pay-as-you-use billing model and flexible scaling of applications. This paradigm extends the boundaries of edge computing and...
A First Look at Related Website Sets	Stephen McQuistin, Peter Snyder, Hamed Haddadi, Gareth Tyson	2024-08-14	下载	We present the first measurement of the user-effect and privacy impact of "Related Website Sets," a recent proposal to reduce browser privacy protections between two sites if those sites are related t...
Optical Networks	Varsha Lohani, Anjali Sharma, Yatindra Nath Singh, Kumari Akansha, Baljinder Singh Heera, Pallavi Athe	2024-08-14	下载	Optical networks play a crucial role in todays digital topography, enabling the high-speed and reliable transmission of vast amounts of data over optical fibre for long distances.
A Stability-first Approach to Running TCP over Starlink	Gregory Stock, Juan A. Fraire, Santiago Henn, Holger Hermanns, Andreas Schmidt	2024-08-14	下载	The end-to-end connectivity patterns between two points on Earth are highly volatile if mediated via a Low-Earth orbit (LEO) satellite constellation.
A MAC Protocol with Time Reversal for Wireless Networks within Computing Packages	Ama Bandara, Abhijit Das, Fátima Rodríguez-Galán, Eduard Alarcón, Sergi Abadal	2024-08-14	下载	Wireless Network-on-Chip (WNoC) is a promising concept which provides a solution to overcome the scalability issues in prevailing networks-in-package for many-core processors.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Inspection of I/O Operations from System Call Traces using Directly-Follows-Graph	Aravind Sankaran, Ilya Zhukov, Wolfgang Frings, Paolo Bientinesi	2024-08-14	下载	We aim to identify the differences in Input/Output(I/O) behavior between multiple user programs through the inspection of system calls (i.e., requests made to the operating system).

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Portability of Fortran's `do concurrent' on GPUs	Ronald M. Caplan, Miko M. Stulajter, Jon A. Linker, Jeff Larkin, Henry A. Gabb, Shiquan Su, Ivan Rodriguez, Zachary Tschirhart, Nicholas Malaya	2024-08-14	下载	There is a continuing interest in using standard language constructs for accelerated computing in order to avoid (sometimes vendor-specific) external APIs.
Inspection of I/O Operations from System Call Traces using Directly-Follows-Graph	Aravind Sankaran, Ilya Zhukov, Wolfgang Frings, Paolo Bientinesi	2024-08-14	下载	We aim to identify the differences in Input/Output(I/O) behavior between multiple user programs through the inspection of system calls (i.e., requests made to the operating system).