Skip to content

2024-01-17

cs.AR - Architecture

标题作者发布日期PDF摘要
Élivágar: Efficient Quantum Circuit Search for ClassificationSashwat Anagolum, Narges Alavisamani, Poulami Das, Moinuddin Qureshi, Eric Kessler, Yunong Shi2024-01-17下载Designing performant and noise-robust circuits for Quantum Machine Learning (QML) is challenging -- the design space scales exponentially with circuit size, and there are few well-supported guiding pr...
LRSCwait: Enabling Scalable and Efficient Synchronization in Manycore Systems through Polling-Free and Retry-Free OperationSamuel Riedel, Marc Gantenbein, Alessandro Ottaviano, Torsten Hoefler, Luca Benini2024-01-17下载Extensive polling in shared-memory manycore systems can lead to contention, decreased throughput, and poor energy efficiency. Both lock implementations and the general-purpose atomic operation, load-r...
Exploration of Activation Fault Reliability in Quantized Systolic Array-Based DNN AcceleratorsMahdi Taheri, Natalia Cherezova, Mohammad Saeed Ansari, Maksim Jenihhin, Ali Mahani, Masoud Daneshtalab, Jaan Raik2024-01-17下载The stringent requirements for the Deep Neural Networks (DNNs) accelerator's reliability stand along with the need for reducing the computational burden on the hardware platforms, i.e.
VeriBug: An Attention-based Framework for Bug-Localization in Hardware DesignsGiuseppe Stracquadanio, Sourav Medya, Stefano Quer, Debjit Pal2024-01-17下载In recent years, there has been an exponential growth in the size and complexity of System-on-Chip designs targeting different specialized applications.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Computing in the Era of Large Generative Models: From Cloud-Native to AI-NativeYao Lu, Song Bian, Lequn Chen, Yongjun He, Yulong Hui, Matthew Lentz, Beibin Li, Fei Liu, Jialin Li, Qi Liu, Rui Liu, Xiaoxuan Liu, Lin Ma, Kexin Rong, Jianguo Wang, Yingjun Wu, Yongji Wu, Huanchen Zhang, Minjia Zhang, Qizhen Zhang, Tianyi Zhou, Danyang Zhuo2024-01-17下载In this paper, we investigate the intersection of large generative AI models and cloud-native computing architectures. Recent large models such as ChatGPT, while revolutionary in their capabilities, f...
Swing: Short-cutting Rings for Higher Bandwidth AllreduceDaniele De Sensi, Tommaso Bonato, David Saam, Torsten Hoefler2024-01-17下载The allreduce collective operation accounts for a significant fraction of the runtime of workloads running on distributed systems. One factor determining its performance is the distance between commun...
Guardian: Safe GPU Sharing in Multi-Tenant EnvironmentsManos Pavlidakis, Giorgos Vasiliadis, Stelios Mavridis, Anargyros Argyros, Antony Chazapis, Angelos Bilas2024-01-17下载Modern GPU applications, such as machine learning (ML), can only partially utilize GPUs, leading to GPU underutilization in cloud environments.
PIM-STM: Software Transactional Memory for Processing-In-Memory SystemsAndré Lopes, Daniel Castro, Paolo Romano2024-01-17下载Processing-In-Memory (PIM) is a novel approach that augments existing DRAM memory chips with lightweight logic. By allowing to offload computations to the PIM system, this architecture allows for circ...
A Blockchain-based Model for Securing Data Pipeline in a Heterogeneous Information SystemMN Ramahlosi, Y Madani, A Akanbi2024-01-17下载In our digital world, access to personal and public data has become an item of concern, with challenging security and privacy aspects. Modern information systems are heterogeneous in nature and have a...
Data Trading and Monetization: Challenges and Open Research DirectionsQusai Ramadan, Zeyd Boukhers, Muath AlShaikh, Christoph Lange, Jan Jürjens2024-01-17下载Traditional data monetization approaches face challenges related to data protection and logistics. In response, digital data marketplaces have emerged as intermediaries simplifying data transactions.
InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant ShardingQiaoling Chen, Diandian Gu, Guoteng Wang, Xun Chen, YingTong Xiong, Ting Huang, Qinghao Hu, Xin Jin, Yonggang Wen, Tianwei Zhang, Peng Sun2024-01-17下载Large language models (LLMs) with long sequences begin to power more and more fundamentally new applications we use every day. Existing methods for long-sequence LLM training are neither efficient nor...
cedar: Optimized and Unified Machine Learning Input Data PipelinesMark Zhao, Emanuel Adamiak, Christos Kozyrakis2024-01-17下载The input data pipeline is an essential component of each machine learning (ML) training job. It is responsible for reading massive amounts of training data, processing batches of samples using comple...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Staggered Comb Reference Signal Design for Integrated Communication and SensingRui Zhang, Shawn Tsai, Tzu-Han Chou, Jiaying Ren2024-01-17下载Ambiguity performance is a critical criterion in radar sensor design, which indicates the ambiguities arising from multiple target estimation and detection.
OFDM Reference Signal Pattern Design Criteria for Integrated Communication and SensingRui Zhang, Shawn Tsai, Tzu-Han Chou, Jiaying Ren, Wenze Qu, Oliver Sun2024-01-17下载Extended ambiguity performance (EAP), which includes all grating lobes and side peaks, indicates the maximum detectable region without undesired peaks for target parameter estimation and is critical t...
Cost-effective and performant virtual WANs with CORNIFERAnjali, Rachee Singh, Michael M. Swift2024-01-17下载Virtual wide-area networks (WANs) are WAN-as-a-service cloud offerings that aim to bring the performance benefits of dedicated wide-area interconnects to enterprise customers.
Detection of Distributed Denial of Service Attacks Carried Out by Botnets in Software-Defined NetworksJaime Tamayo, Lorena Isabel Barona López, Ángel Leonardo Valdivieso Caraguay2024-01-17下载Recent years witnessed a surge in network traffic due to the emergence of new online services, causing periodic saturation and complexity problems.
Swing: Short-cutting Rings for Higher Bandwidth AllreduceDaniele De Sensi, Tommaso Bonato, David Saam, Torsten Hoefler2024-01-17下载The allreduce collective operation accounts for a significant fraction of the runtime of workloads running on distributed systems. One factor determining its performance is the distance between commun...
A Fast Control Plane for a Large-Scale and High-Speed Optical Circuit Switch SystemRyousei Takano, Kiyo Ishii, Toshiyuki Shimizu, Fumihiro Okazaki, Shu Namiki, Ken-ichi Sato2024-01-17下载We experimentally verify a fast control plane with 100 microseconds of configuration time that can support more than 1000 racks, leveraged by a software-defined network controller and an industrial re...
Offloading platooning applications from 5.9 GHz V2X to Radar Communications: effects on safety and efficiencyElena Haller, Galina Sidorenko, Oscar Amador, Emil Nilsson2024-01-17下载V2X communications are nowadays performed at 5.9,GHz spectrum, either using WiFi-based or Cellular technology. The channel capacity is limited, and congestion control regulates the number of messages...
A Blockchain-based Model for Securing Data Pipeline in a Heterogeneous Information SystemMN Ramahlosi, Y Madani, A Akanbi2024-01-17下载In our digital world, access to personal and public data has become an item of concern, with challenging security and privacy aspects. Modern information systems are heterogeneous in nature and have a...
Scalable Resource Provisioning for Multi-user Communications in Next Generation NetworksAugusto Neto, Eduardo Cerqueira, Marilia Curado, Edmundo Monteiro, Paulo Mendes2024-01-17下载The great demand for real-time multimedia sessions encompassing groups of users (multi-user), associated with the limitations of the current Internet in providing quality assurance, has raised challen...
Cross-Domain AI for Early Attack Detection and Defense Against Malicious Flows in O-RANBruno Missi Xavier, Merim Dzaferagic, Irene Vilà, Magnos Martinello, Marco Ruffini2024-01-17下载Only the chairs can edit In the fight against cyber attacks, Network Softwarization (NS) is a flexible and adaptable shield, using advanced software to spot malicious activity in regular network traff...
The Mikado Filesystem: An experimental RPC filesystem running over gRPCJohn D. Dougrez-Lewis2024-01-17下载Computer applications seeking to persist files remotely across the Internet are faced with a bewildering choice of mechanisms which tend to boil down to monolithic proprietary closed-source Vendor sol...
Named Service Networking as a primer for the MetaversePaulo Mendes2024-01-17下载Ubiquitous extended reality environments such as the Metaverse will have a significant impact on the Internet, which will evolve to interconnect a large number of mixed reality spaces.
On Optimization of Next-Generation Microservice-Based Core NetworksAndrea Tassi, Daniel Warren, Yue Wang, Deval Bhamare, Rasoul Behravesh2024-01-17下载Next-generation mobile core networks are required to be scalable and capable of efficiently utilizing heterogeneous bare metal resources that may include edge servers.
An Improved Virtual Force Approach for UAV Deployment and Resource Allocation in Emergency CommunicationsHongying Guo, Li Wang, Ruoguang Li, Luyang Hou, Lianming Xu, Aiguo Fei2024-01-17下载In this paper, we consider an unmanned aerial vehicle (UAV)-enabled emergency communication system, which establishes temporary communication link with users equipment (UEs) in a typical disaster envi...
Characterizing TCP's Performance for Low-Priority Flows Inside a CloudHafiz Mohsin Bashir, Abdullah Bin Faisal, Fahad R. Dogar2024-01-17下载Many cloud systems utilize low-priority flows to achieve various performance objectives (e.g., low latency, high utilization), relying on TCP as their preferred transport protocol.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Herding LLaMaS: Using LLMs as an OS ModuleAditya K Kamath, Sujay Yadalam2024-01-17下载Computer systems are becoming increasingly heterogeneous with the emergence of new memory technologies and compute devices. GPUs alongside CPUs have become commonplace and CXL is poised to be a mainst...

cs.PF - Performance

标题作者发布日期PDF摘要
Cost-effective and performant virtual WANs with CORNIFERAnjali, Rachee Singh, Michael M. Swift2024-01-17下载Virtual wide-area networks (WANs) are WAN-as-a-service cloud offerings that aim to bring the performance benefits of dedicated wide-area interconnects to enterprise customers.
Swing: Short-cutting Rings for Higher Bandwidth AllreduceDaniele De Sensi, Tommaso Bonato, David Saam, Torsten Hoefler2024-01-17下载The allreduce collective operation accounts for a significant fraction of the runtime of workloads running on distributed systems. One factor determining its performance is the distance between commun...
Hierarchical Analyses Applied to Computer System Performance: Review and Call for Further StudiesAlexander Thomasian2024-01-17下载We review studies based on analytic and simulation methods for hierarchical performance analysis of Queueing Network - QN models, which result in an order of magnitude reduction in performance evaluat...
cedar: Optimized and Unified Machine Learning Input Data PipelinesMark Zhao, Emanuel Adamiak, Christos Kozyrakis2024-01-17下载The input data pipeline is an essential component of each machine learning (ML) training job. It is responsible for reading massive amounts of training data, processing batches of samples using comple...

基于 VitePress 构建