Skip to content

2024-01-31

cs.AR - Architecture

标题作者发布日期PDF摘要
ConSmax: Hardware-Friendly Alternative Softmax with Learnable ParametersShiwei Liu, Guanchen Tao, Yifei Zou, Derek Chow, Zichen Fan, Kauna Lei, Bangfei Pan, Dennis Sylvester, Gregory Kielian, Mehdi Saligane2024-01-31下载The self-attention mechanism distinguishes transformer-based large language models (LLMs) apart from convolutional and recurrent neural networks.
Makinote: An FPGA-Based HW/SW Platform for Pre-Silicon Emulation of RISC-V DesignsElias Perdomo, Alexander Kropotov, Francelly Cano, Syed Zafar, Teresa Cervero, Xavier Martorell, Behzad Salami2024-01-31下载Emulating chip functionality before silicon production is crucial, especially with the increasing prevalence of RISC-V-based designs. FPGAs are promising candidates for such purposes due to their high...
QTFlow: Quantitative Timing-Sensitive Information Flow for Security-Aware Hardware Design on RTLLennart M. Reimann, Anshul Prashar, Chiara Ghinami, Rebecca Pelke, Dominik Sisejkovic, Farhad Merchant, Rainer Leupers2024-01-31下载In contemporary Electronic Design Automation (EDA) tools, security often takes a backseat to the primary goals of power, performance, and area optimization.
High-Performance Data Mapping for BNNs on PCM-based Integrated PhotonicsTaha Shahroodi, Raphael Cardoso, Stephan Wong, Alberto Bosio, Ian O'Connor, Said Hamdioui2024-01-31下载State-of-the-Art (SotA) hardware implementations of Deep Neural Networks (DNNs) incur high latencies and costs. Binary Neural Networks (BNNs) are potential alternative solutions to realize faster impl...
STAR: An Efficient Softmax Engine for Attention Model with RRAM CrossbarYifeng Zhai, Bing Li, Bonan Yan, Jing Wang2024-01-31下载RRAM crossbars have been studied to construct in-memory accelerators for neural network applications due to their in-situ computing capability.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
FedCore: Straggler-Free Federated Learning with Distributed CoresetsHongpeng Guo, Haotian Gu, Xiaoyang Wang, Bo Chen, Eun Kyung Lee, Tamar Eilam, Deming Chen, Klara Nahrstedt2024-01-31下载Federated learning (FL) is a machine learning paradigm that allows multiple clients to collaboratively train a shared model while keeping their data on-premise.
MP-SL: Multihop Parallel Split LearningJoana Tirana, Spyros Lalis, Dimitris Chatzopoulos2024-01-31下载Federated Learning (FL) stands out as a widely adopted protocol facilitating the training of Machine Learning (ML) models while maintaining decentralized data.
Decomposable Submodular Maximization in Federated SettingAkbar Rafiey2024-01-31下载Submodular functions, as well as the sub-class of decomposable submodular functions, and their optimization appear in a wide range of applications in machine learning, recommendation systems, and welf...
Service Level Agreements and Security SLA: A Comprehensive SurveySerena Nicolazzo, Antonino Nocera, Witold Pedrycz2024-01-31下载A Service Level Agreement (SLA) is a formal contract between a service provider and a consumer, representing a crucial instrument to define, manage, and maintain relationships between these two partie...
Model-driven development of data intensive applications over cloud resourcesRafael Tolosana-Calasanz, José Ángel Bañares, José-Manuel Colom2024-01-31下载The proliferation of sensors over the last years has generated large amounts of raw data, forming data streams that need to be processed. In many cases, cloud resources are used for such processing, e...
BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving SystemsYuxin Wang, Yuhan Chen, Zeyu Li, Xueze Kang, Yuchu Fang, Yeju Zhou, Yang Zheng, Zhenheng Tang, Xin He, Rui Guo, Xin Wang, Qiang Wang, Amelie Chi Zhou, Xiaowen Chu2024-01-31下载Serving systems for Large Language Models (LLMs) are often optimized to improve quality of service (QoS) and throughput. However, due to the lack of open-source LLM serving workloads, these systems ar...
Bitcoin Inscriptions: Foundations and BeyondNingran Li, Minfeng Qi, Qin Wang, Shiping Chen2024-01-31下载Bitcoin inscription marks a pivotal moment in blockchain technology. This report presents a primary exploration of Bitcoin inscriptions. We dive into the technological underpinnings and offer a detail...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Decentralized Covert Routing in Heterogeneous Networks Using Reinforcement LearningJustin Kong, Terrence J. Moore, Fikadu T. Dagefu2024-01-31下载This letter investigates covert routing communications in a heterogeneous network where a source transmits confidential data to a destination with the aid of relaying nodes where each transmitter judi...
How to Measure TLS, X.509 Certificates, and Web PKI: A Tutorial and Brief SurveyPouyan Fotouhi Tehrani, Eric Osterweil, Thomas C. Schmidt, Matthias Wählisch2024-01-31下载Transport Layer Security (TLS) is the base for many Internet applications and services to achieve end-to-end security. In this paper, we provide guidance on how to measure TLS deployments, including X...
Deterministic Computing Power Networking: Architecture, Technologies and ProspectsQingmin Jia, Yujiao Hu, Xiaomao Zhou, Qianpiao Ma, Kai Guo, Huayu Zhang, Renchao Xie, Tao Huang, Yunjie Liu2024-01-31下载With the development of new Internet services such as computation-intensive and delay-sensitive tasks, the traditional "Best Effort" network transmission mode has been greatly challenged.
Design and Testbed Deployment of Frequency-Domain Equalization Full Duplex RadiosManav Kohli, Mahmood Baraani Dastjerdi, Jin Zhou, Ivan Seskar, Harish Krishnaswamy, Gil Zussman, Tingjun Chen2024-01-31下载Full-duplex (FD) wireless can significantly enhance spectrum efficiency but requires effective self-interference (SI) cancellers. RF SI cancellation (SIC) via frequency-domain equalization (FDE), wher...
Time Synchronization for 5G and TSN Integrated NetworkingZixiao Wang, Zonghui Li, Xuan Qiao, Yiming Zheng, Bo Ai, Xiaoyu Song2024-01-31下载Emerging industrial applications involving robotic collaborative operations and mobile robots require a more reliable and precise wireless network for deterministic data transmission.
Version Innovation Age and Age of Incorrect Version for Monitoring Markovian SourcesMehrdad Salimnejad, Marios Kountouris, Anthony Ephremides, Nikolaos Pappas2024-01-31下载In this paper, we propose two new performance metrics, coined the Version Innovation Age (VIA) and the Age of Incorrect Version (AoIV) for real-time monitoring of a two-state Markov process over an un...
IQ Skew and Imbalance Estimation for Coherent Point-to-Multi-Point Optical NetworksJi Zhou, Jianrui Zeng, Haide Wang, Dong Guo, Liangchuan Li, Weiping Liu, Changyuan Yu2024-01-31下载Coherent point-to-multi-point (PtMP) optical network based on digital subcarrier multiplexing (DSCM) has been a promising technology for metro and access networks to achieve cost savings, low latency,...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Beyond Control: Exploring Novel File System Objects for Data-Only Attacks on Linux SystemsJinmeng Zhou, Jiayi Hu, Ziyue Pan, Jiaxun Zhu, Wenbo Shen, Guoren Li, Zhiyun Qian2024-01-31下载The widespread deployment of control-flow integrity has propelled non-control data attacks into the mainstream. In the domain of OS kernel exploits, by corrupting critical non-control data, local atta...

cs.PF - Performance

标题作者发布日期PDF摘要
Makinote: An FPGA-Based HW/SW Platform for Pre-Silicon Emulation of RISC-V DesignsElias Perdomo, Alexander Kropotov, Francelly Cano, Syed Zafar, Teresa Cervero, Xavier Martorell, Behzad Salami2024-01-31下载Emulating chip functionality before silicon production is crucial, especially with the increasing prevalence of RISC-V-based designs. FPGAs are promising candidates for such purposes due to their high...
A Modular Graph-Native Query Optimization FrameworkBingqing Lyu, Xiaoli Zhou, Longbin Lai, Yufan Yang, Yunkai Lou, Wenyuan Yu, Jingren Zhou2024-01-31下载Complex Graph Patterns (CGPs), which combine pattern matching with relational operations, are widely used in real-world applications. Existing systems rely on monolithic architectures for CGPs, which ...
BurstGPT: A Real-world Workload Dataset to Optimize LLM Serving SystemsYuxin Wang, Yuhan Chen, Zeyu Li, Xueze Kang, Yuchu Fang, Yeju Zhou, Yang Zheng, Zhenheng Tang, Xin He, Rui Guo, Xin Wang, Qiang Wang, Amelie Chi Zhou, Xiaowen Chu2024-01-31下载Serving systems for Large Language Models (LLMs) are often optimized to improve quality of service (QoS) and throughput. However, due to the lack of open-source LLM serving workloads, these systems ar...

基于 VitePress 构建