Skip to content

2024-08-14

cs.AR - Architecture

标题作者发布日期PDF摘要
Development of simulation model for Single Carrier Transceiver for NanosatellitePallewela R. C. K, Rohana Thilakuamra, Prabath Buddhika2024-08-14下载CubeSat is a nanosatellite concept emerged from a paper published by Stanford University and with their low cost nature and extreme feasibility , more started researching on nano satellites.
Efficient Edge AI: Deploying Convolutional Neural Networks on FPGA with the Gemmini AcceleratorFederico Nicolas Peccia, Svetlana Pavlitska, Tobias Fleck, Oliver Bringmann2024-08-14下载The growing concerns regarding energy consumption and privacy have prompted the development of AI solutions deployable on the edge, circumventing the substantial CO2 emissions associated with cloud se...
LPU: A Latency-Optimized and Highly Scalable Processor for Large Language Model InferenceSeungjae Moon, Jung-Hoon Kim, Junsoo Kim, Seongmin Hong, Junseo Cha, Minsu Kim, Sukbin Lim, Gyubin Choi, Dongjin Seo, Jongho Kim, Hunjong Lee, Hyunjun Park, Ryeowook Ko, Soongyu Choi, Jongse Park, Jinwon Lee, Joo-Young Kim2024-08-14下载The explosive arrival of OpenAI's ChatGPT has fueled the globalization of large language model (LLM), which consists of billions of pretrained parameters that embodies the aspects of syntax and semant...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Learning-Augmented Competitive Algorithms for Spatiotemporal Online Allocation with Deadline ConstraintsAdam Lechowicz, Nicolas Christianson, Bo Sun, Noman Bashir, Mohammad Hajiesmaili, Adam Wierman, Prashant Shenoy2024-08-14下载We introduce and study spatiotemporal online allocation with deadline constraints (SOAD\mathsf{SOAD}), a new online problem motivated by emerging challenges in sustainability and energy.
Kraken: Inherently Parallel Transformers For Efficient Multi-Device InferenceRohan Baskar Prabhakar, Hengrui Zhang, David Wentzlaff2024-08-14下载Large Transformer networks are increasingly used in settings where low inference latency can improve the end-user experience and enable new applications.
Modernizing an Operational Real-time Tsunami Simulator to Support Diverse Hardware PlatformsKeichi Takahashi, Takashi Abe, Akihiro Musa, Yoshihiko Sato, Yoichi Shimomura, Hiroyuki Takizawa, Shunichi Koshimura2024-08-14下载To issue early warnings and rapidly initiate disaster responses after tsunami damage, various tsunami inundation forecast systems have been deployed worldwide.
FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual TeacherAlessio Mora, Lorenzo Valerio, Paolo Bellavista, Andrea Passarella2024-08-14下载Federated Learning (FL) systems enable the collaborative training of machine learning models without requiring centralized collection of individual data.
Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training SystemsNing Lu, Qian Xie, Hao Zhang, Wenyi Fang, Yang Zheng, Zheng Hu, Jiantao Ma2024-08-14下载Large Language Models (LLMs) are revolutionizing the AI industry with their superior capabilities. Training these models requires large-scale GPU clusters and significant computing time, leading to fr...
UNR: Unified Notifiable RMA Library for HPCGuangnan Feng, Jiabin Xie, Dezun Dong, Yutong Lu2024-08-14下载Remote Memory Access (RMA) enables direct access to remote memory to achieve high performance for HPC applications. However, most modern parallel programming models lack schemes for the remote process...
Efficient Edge AI: Deploying Convolutional Neural Networks on FPGA with the Gemmini AcceleratorFederico Nicolas Peccia, Svetlana Pavlitska, Tobias Fleck, Oliver Bringmann2024-08-14下载The growing concerns regarding energy consumption and privacy have prompted the development of AI solutions deployable on the edge, circumventing the substantial CO2 emissions associated with cloud se...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Case for Enabling Delegation of 5G Core Decisions to the RANLucas Vancina, Geoffrey Xie2024-08-14下载Under conventional 5G system design, the authentication and continuous monitoring of user equipment (UE) demands a reliable backhaul connection between the radio access network (RAN) and the core netw...
Context-aware Container Orchestration in Serverless Edge ComputingPeiyuan Guan, Chen Chen, Ziru Chen, Lin X. Cai, Xing Hao, Amir Taherkordi2024-08-14下载Adopting serverless computing to edge networks benefits end-users from the pay-as-you-use billing model and flexible scaling of applications. This paradigm extends the boundaries of edge computing and...
A First Look at Related Website SetsStephen McQuistin, Peter Snyder, Hamed Haddadi, Gareth Tyson2024-08-14下载We present the first measurement of the user-effect and privacy impact of "Related Website Sets," a recent proposal to reduce browser privacy protections between two sites if those sites are related t...
Optical NetworksVarsha Lohani, Anjali Sharma, Yatindra Nath Singh, Kumari Akansha, Baljinder Singh Heera, Pallavi Athe2024-08-14下载Optical networks play a crucial role in todays digital topography, enabling the high-speed and reliable transmission of vast amounts of data over optical fibre for long distances.
A Stability-first Approach to Running TCP over StarlinkGregory Stock, Juan A. Fraire, Santiago Henn, Holger Hermanns, Andreas Schmidt2024-08-14下载The end-to-end connectivity patterns between two points on Earth are highly volatile if mediated via a Low-Earth orbit (LEO) satellite constellation.
A MAC Protocol with Time Reversal for Wireless Networks within Computing PackagesAma Bandara, Abhijit Das, Fátima Rodríguez-Galán, Eduard Alarcón, Sergi Abadal2024-08-14下载Wireless Network-on-Chip (WNoC) is a promising concept which provides a solution to overcome the scalability issues in prevailing networks-in-package for many-core processors.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Inspection of I/O Operations from System Call Traces using Directly-Follows-GraphAravind Sankaran, Ilya Zhukov, Wolfgang Frings, Paolo Bientinesi2024-08-14下载We aim to identify the differences in Input/Output(I/O) behavior between multiple user programs through the inspection of system calls (i.e., requests made to the operating system).

cs.PF - Performance

标题作者发布日期PDF摘要
Portability of Fortran's `do concurrent' on GPUsRonald M. Caplan, Miko M. Stulajter, Jon A. Linker, Jeff Larkin, Henry A. Gabb, Shiquan Su, Ivan Rodriguez, Zachary Tschirhart, Nicholas Malaya2024-08-14下载There is a continuing interest in using standard language constructs for accelerated computing in order to avoid (sometimes vendor-specific) external APIs.
Inspection of I/O Operations from System Call Traces using Directly-Follows-GraphAravind Sankaran, Ilya Zhukov, Wolfgang Frings, Paolo Bientinesi2024-08-14下载We aim to identify the differences in Input/Output(I/O) behavior between multiple user programs through the inspection of system calls (i.e., requests made to the operating system).

基于 VitePress 构建