2024-01-31

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
ConSmax: Hardware-Friendly Alternative Softmax with Learnable Parameters	Shiwei Liu, Guanchen Tao, Yifei Zou, Derek Chow, Zichen Fan, Kauna Lei, Bangfei Pan, Dennis Sylvester, Gregory Kielian, Mehdi Saligane	2024-01-31	下载	The self-attention mechanism distinguishes transformer-based large language models (LLMs) apart from convolutional and recurrent neural networks.
Makinote: An FPGA-Based HW/SW Platform for Pre-Silicon Emulation of RISC-V Designs	Elias Perdomo, Alexander Kropotov, Francelly Cano, Syed Zafar, Teresa Cervero, Xavier Martorell, Behzad Salami	2024-01-31	下载	Emulating chip functionality before silicon production is crucial, especially with the increasing prevalence of RISC-V-based designs. FPGAs are promising candidates for such purposes due to their high...
QTFlow: Quantitative Timing-Sensitive Information Flow for Security-Aware Hardware Design on RTL	Lennart M. Reimann, Anshul Prashar, Chiara Ghinami, Rebecca Pelke, Dominik Sisejkovic, Farhad Merchant, Rainer Leupers	2024-01-31	下载	In contemporary Electronic Design Automation (EDA) tools, security often takes a backseat to the primary goals of power, performance, and area optimization.
High-Performance Data Mapping for BNNs on PCM-based Integrated Photonics	Taha Shahroodi, Raphael Cardoso, Stephan Wong, Alberto Bosio, Ian O'Connor, Said Hamdioui	2024-01-31	下载	State-of-the-Art (SotA) hardware implementations of Deep Neural Networks (DNNs) incur high latencies and costs. Binary Neural Networks (BNNs) are potential alternative solutions to realize faster impl...
STAR: An Efficient Softmax Engine for Attention Model with RRAM Crossbar	Yifeng Zhai, Bing Li, Bonan Yan, Jing Wang	2024-01-31	下载	RRAM crossbars have been studied to construct in-memory accelerators for neural network applications due to their in-situ computing capability.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
FedCore: Straggler-Free Federated Learning with Distributed Coresets	Hongpeng Guo, Haotian Gu, Xiaoyang Wang, Bo Chen, Eun Kyung Lee, Tamar Eilam, Deming Chen, Klara Nahrstedt	2024-01-31	下载	Federated learning (FL) is a machine learning paradigm that allows multiple clients to collaboratively train a shared model while keeping their data on-premise.
MP-SL: Multihop Parallel Split Learning	Joana Tirana, Spyros Lalis, Dimitris Chatzopoulos	2024-01-31	下载	Federated Learning (FL) stands out as a widely adopted protocol facilitating the training of Machine Learning (ML) models while maintaining decentralized data.
Decomposable Submodular Maximization in Federated Setting	Akbar Rafiey	2024-01-31	下载	Submodular functions, as well as the sub-class of decomposable submodular functions, and their optimization appear in a wide range of applications in machine learning, recommendation systems, and welf...
Service Level Agreements and Security SLA: A Comprehensive Survey	Serena Nicolazzo, Antonino Nocera, Witold Pedrycz	2024-01-31	下载	A Service Level Agreement (SLA) is a formal contract between a service provider and a consumer, representing a crucial instrument to define, manage, and maintain relationships between these two partie...
Model-driven development of data intensive applications over cloud resources	Rafael Tolosana-Calasanz, José Ángel Bañares, José-Manuel Colom	2024-01-31	下载	The proliferation of sensors over the last years has generated large amounts of raw data, forming data streams that need to be processed. In many cases, cloud resources are used for such processing, e...
BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems	Yuxin Wang, Yuhan Chen, Zeyu Li, Xueze Kang, Yuchu Fang, Yeju Zhou, Yang Zheng, Zhenheng Tang, Xin He, Rui Guo, Xin Wang, Qiang Wang, Amelie Chi Zhou, Xiaowen Chu	2024-01-31	下载	Serving systems for Large Language Models (LLMs) are often optimized to improve quality of service (QoS) and throughput. However, due to the lack of open-source LLM serving workloads, these systems ar...
Bitcoin Inscriptions: Foundations and Beyond	Ningran Li, Minfeng Qi, Qin Wang, Shiping Chen	2024-01-31	下载	Bitcoin inscription marks a pivotal moment in blockchain technology. This report presents a primary exploration of Bitcoin inscriptions. We dive into the technological underpinnings and offer a detail...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Decentralized Covert Routing in Heterogeneous Networks Using Reinforcement Learning	Justin Kong, Terrence J. Moore, Fikadu T. Dagefu	2024-01-31	下载	This letter investigates covert routing communications in a heterogeneous network where a source transmits confidential data to a destination with the aid of relaying nodes where each transmitter judi...
How to Measure TLS, X.509 Certificates, and Web PKI: A Tutorial and Brief Survey	Pouyan Fotouhi Tehrani, Eric Osterweil, Thomas C. Schmidt, Matthias Wählisch	2024-01-31	下载	Transport Layer Security (TLS) is the base for many Internet applications and services to achieve end-to-end security. In this paper, we provide guidance on how to measure TLS deployments, including X...
Deterministic Computing Power Networking: Architecture, Technologies and Prospects	Qingmin Jia, Yujiao Hu, Xiaomao Zhou, Qianpiao Ma, Kai Guo, Huayu Zhang, Renchao Xie, Tao Huang, Yunjie Liu	2024-01-31	下载	With the development of new Internet services such as computation-intensive and delay-sensitive tasks, the traditional "Best Effort" network transmission mode has been greatly challenged.
Design and Testbed Deployment of Frequency-Domain Equalization Full Duplex Radios	Manav Kohli, Mahmood Baraani Dastjerdi, Jin Zhou, Ivan Seskar, Harish Krishnaswamy, Gil Zussman, Tingjun Chen	2024-01-31	下载	Full-duplex (FD) wireless can significantly enhance spectrum efficiency but requires effective self-interference (SI) cancellers. RF SI cancellation (SIC) via frequency-domain equalization (FDE), wher...
Time Synchronization for 5G and TSN Integrated Networking	Zixiao Wang, Zonghui Li, Xuan Qiao, Yiming Zheng, Bo Ai, Xiaoyu Song	2024-01-31	下载	Emerging industrial applications involving robotic collaborative operations and mobile robots require a more reliable and precise wireless network for deterministic data transmission.
Version Innovation Age and Age of Incorrect Version for Monitoring Markovian Sources	Mehrdad Salimnejad, Marios Kountouris, Anthony Ephremides, Nikolaos Pappas	2024-01-31	下载	In this paper, we propose two new performance metrics, coined the Version Innovation Age (VIA) and the Age of Incorrect Version (AoIV) for real-time monitoring of a two-state Markov process over an un...
IQ Skew and Imbalance Estimation for Coherent Point-to-Multi-Point Optical Networks	Ji Zhou, Jianrui Zeng, Haide Wang, Dong Guo, Liangchuan Li, Weiping Liu, Changyuan Yu	2024-01-31	下载	Coherent point-to-multi-point (PtMP) optical network based on digital subcarrier multiplexing (DSCM) has been a promising technology for metro and access networks to achieve cost savings, low latency,...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Beyond Control: Exploring Novel File System Objects for Data-Only Attacks on Linux Systems	Jinmeng Zhou, Jiayi Hu, Ziyue Pan, Jiaxun Zhu, Wenbo Shen, Guoren Li, Zhiyun Qian	2024-01-31	下载	The widespread deployment of control-flow integrity has propelled non-control data attacks into the mainstream. In the domain of OS kernel exploits, by corrupting critical non-control data, local atta...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Makinote: An FPGA-Based HW/SW Platform for Pre-Silicon Emulation of RISC-V Designs	Elias Perdomo, Alexander Kropotov, Francelly Cano, Syed Zafar, Teresa Cervero, Xavier Martorell, Behzad Salami	2024-01-31	下载	Emulating chip functionality before silicon production is crucial, especially with the increasing prevalence of RISC-V-based designs. FPGAs are promising candidates for such purposes due to their high...
A Modular Graph-Native Query Optimization Framework	Bingqing Lyu, Xiaoli Zhou, Longbin Lai, Yufan Yang, Yunkai Lou, Wenyuan Yu, Jingren Zhou	2024-01-31	下载	Complex Graph Patterns (CGPs), which combine pattern matching with relational operations, are widely used in real-world applications. Existing systems rely on monolithic architectures for CGPs, which ...
BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving Systems	Yuxin Wang, Yuhan Chen, Zeyu Li, Xueze Kang, Yuchu Fang, Yeju Zhou, Yang Zheng, Zhenheng Tang, Xin He, Rui Guo, Xin Wang, Qiang Wang, Amelie Chi Zhou, Xiaowen Chu	2024-01-31	下载	Serving systems for Large Language Models (LLMs) are often optimized to improve quality of service (QoS) and throughput. However, due to the lack of open-source LLM serving workloads, these systems ar...