2025-04-22

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
The Dawn of Disaggregation and the Coherence Conundrum: A Call for Federated Coherence	Jaewan Hong, Marcos K. Aguilera, Emmanuel Amaro, Vincent Liu, Aurojit Panda, Ion Stoica	2025-04-22	下载	Disaggregated memory is an upcoming data center technology that will allow nodes (servers) to share data efficiently. Sharing data creates a debate on the level of cache coherence the system should pr...
COBRA: Algorithm-Architecture Co-optimized Binary Transformer Accelerator for Edge Inference	Ye Qiao, Zhiheng Chen, Yian Wang, Yifan Zhang, Yunzhe Deng, Sitao Huang	2025-04-22	下载	Transformer-based models have demonstrated superior performance in various fields, including natural language processing and computer vision. However, their enormous model size and high demands in com...
TeLLMe: An Energy-Efficient Ternary LLM Accelerator for Prefilling and Decoding on Edge FPGAs	Ye Qiao, Zhiheng Chen, Yifan Zhang, Yian Wang, Sitao Huang	2025-04-22	下载	Deploying large language models (LLMs) on edge platforms is challenged by their high computational and memory demands. Although recent low-bit quantization methods (e.g.
FPGA-Based Neural Network Accelerators for Space Applications: A Survey	Pedro Antunes, Artur Podobas	2025-04-22	下载	Space missions are becoming increasingly ambitious, necessitating high-performance onboard spacecraft computing systems. In response, field-programmable gate arrays (FPGAs) have garnered significant i...
EFFACT: A Highly Efficient Full-Stack FHE Acceleration Platform	Yi Huang, Xinsheng Gong, Xiangyu Kong, Dibei Chen, Jianfeng Zhu, Wenping Zhu, Liangwei Li, Mingyu Gao, Shaojun Wei, Aoyang Zhang, Leibo Liu	2025-04-22	下载	Fully Homomorphic Encryption (FHE) is a set of powerful cryptographic schemes that allows computation to be performed directly on encrypted data with an unlimited depth.
Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback	Ning Wang, Bingkun Yao, Jie Zhou, Yuchen Hu, Xi Wang, Nan Guan, Zhe Jiang	2025-04-22	下载	Large language models (LLMs) have shown strong performance in Verilog generation from natural language description. However, ensuring the functional correctness of the generated code remains a signifi...
BBAL: A Bidirectional Block Floating Point-Based Quantisation Accelerator for Large Language Models	Xiaomeng Han, Yuan Cheng, Jing Wang, Junyang Lu, Hui Wang, X. x. Zhang, Ning Xu, Dawei Yang, Zhe Jiang	2025-04-22	下载	Large language models (LLMs), with their billions of parameters, pose substantial challenges for deployment on edge devices, straining both memory capacity and computational resources.
Zoozve: A Strip-Mining-Free RISC-V Vector Extension with Arbitrary Register Grouping Compilation Support (WIP)	Siyi Xu, Limin Jiang, Yintao Liu, Yihao Shen, Yi Shi, Shan Cao, Zhiyuan Jiang	2025-04-22	下载	Vector processing is crucial for boosting processor performance and efficiency, particularly with data-parallel tasks. The RISC-V "V" Vector Extension (RVV) enhances algorithm efficiency by supporting...
VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation	Anjiang Wei, Huanmi Tan, Tarun Suresh, Daniel Mendoza, Thiago S. F. X. Teixeira, Ke Wang, Caroline Trippel, Alex Aiken	2025-04-22	下载	Recent advances in Large Language Models (LLMs) have sparked growing interest in applying them to Electronic Design Automation (EDA) tasks, particularly Register Transfer Level (RTL) code generation.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
The Dawn of Disaggregation and the Coherence Conundrum: A Call for Federated Coherence	Jaewan Hong, Marcos K. Aguilera, Emmanuel Amaro, Vincent Liu, Aurojit Panda, Ion Stoica	2025-04-22	下载	Disaggregated memory is an upcoming data center technology that will allow nodes (servers) to share data efficiently. Sharing data creates a debate on the level of cache coherence the system should pr...
Two-Fold Byzantine Fault Tolerance Algorithm: Byzantine Consensus in Blockchain	Mohammad R. Shakournia, Pooya Jamshidi, Hamid Reza Faragardi, Nasser Yazdani	2025-04-22	下载	Blockchain technology offers a decentralized and secure method for storing and authenticating data, rendering it well-suited for various applications such as digital currencies, supply chain managemen...
Towards a Distributed Federated Learning Aggregation Placement using Particle Swarm Intelligence	Amir Ali-Pour, Sadra Bekrani, Laya Samizadeh, Julien Gascon-Samson	2025-04-22	下载	Federated learning has become a promising distributed learning concept with extra insurance on data privacy. Extensive studies on various models of Federated learning have been done since the coinage ...
Charting the Uncharted: The Landscape of Monero Peer-to-Peer Network	Yu Gao, Matija Piškorec, Yu Zhang, Nicolò Vallarano, Claudio J. Tessone	2025-04-22	下载	The Monero blockchain enables anonymous transactions through advanced cryptography in its peer-to-peer network, which underpins decentralization, security, and trustless interactions.
StreamRL: Scalable, Heterogeneous, and Elastic RL for LLMs with Disaggregated Stream Generation	Yinmin Zhong, Zili Zhang, Xiaoniu Song, Hanpeng Hu, Chao Jin, Bingyang Wu, Nuo Chen, Yukun Chen, Yu Zhou, Changyi Wan, Hongyu Zhou, Yimin Jiang, Yibo Zhu, Daxin Jiang	2025-04-22	下载	Reinforcement learning (RL) has become the core post-training technique for large language models (LLMs). RL for LLMs involves two stages: generation and training.
FailLite: Failure-Resilient Model Serving for Resource-Constrained Edge Environments	Li Wu, Walid A. Hanafy, Tarek Abdelzaher, David Irwin, Jesse Milzman, Prashant Shenoy	2025-04-22	下载	Model serving systems have become popular for deploying deep learning models for various latency-sensitive inference tasks. While traditional replication-based methods have been used for failure-resil...
Collaborative Split Federated Learning with Parallel Training and Aggregation	Yiannis Papageorgiou, Yannis Thomas, Alexios Filippakopoulos, Ramin Khalili, Iordanis Koutsopoulos	2025-04-22	下载	Federated learning (FL) operates based on model exchanges between the server and the clients, and it suffers from significant client-side computation and communication burden.
Residual-Evasive Attacks on ADMM in Distributed Optimization	Sabrina Bruckmeier, Huadong Mo, James Qin	2025-04-22	下载	This paper presents two attack strategies designed to evade detection in ADMM-based systems by preventing significant changes to the residual during the attacked iteration.
SeaLLM: Service-Aware and Latency-Optimized Resource Sharing for Large Language Model Inference	Yihao Zhao, Jiadun Chen, Peng Sun, Lei Li, Xuanzhe Liu, Xin Jin	2025-04-22	下载	Large language models (LLMs) with different architectures and sizes have been developed. Serving each LLM with dedicated GPUs leads to resource waste and service inefficiency due to the varying demand...
Towards True Work-Efficiency in Parallel Derandomization: MIS, Maximal Matching, and Hitting Set	Mohsen Ghaffari, Christoph Grunau	2025-04-22	下载	Derandomization is one of the classic topics studied in the theory of parallel computations, dating back to the early 1980s. Despite much work, all known techniques lead to deterministic algorithms th...
DR.FIX: Automatically Fixing Data Races at Industry Scale	Farnaz Behrang, Zhizhou Zhang, Georgian-Vlad Saioc, Peng Liu, Milind Chabbi	2025-04-22	下载	Data races are a prevalent class of concurrency bugs in shared-memory parallel programs, posing significant challenges to software reliability and reproducibility.
Scaling Neural-Network-Based Molecular Dynamics with Long-Range Electrostatic Interactions to 51 Nanoseconds per Day	Jianxiong Li, Beining Zhang, Mingzhen Li, Siyu Hu, Jinzhe Zeng, Lijun Liu, Guojun Yuan, Zhan Wang, Guangming Tan, Weile Jia	2025-04-22	下载	Neural network-based molecular dynamics (NNMD) simulations incorporating long-range electrostatic interactions have significantly extended the applicability to heterogeneous and ionic systems, enablin...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
BAROC: Concealing Packet Losses in LSNs with Bimodal Behavior Awareness for Livecast Ingestion	Haoyuan Zhao, Jianxin Shi, Guanzhen Wu, Hao Fang, Yi Ching Chou, Long Chen, Feng Wang, Jiangchuan Liu	2025-04-22	下载	The advent of Low-Earth Orbit satellite networks (LSNs), exemplified by initiatives like \emph{Starlink}, \emph{OneWeb} and \emph{Kuiper}, has ushered in a new era of ``Internet from Space" global con...
Two-Fold Byzantine Fault Tolerance Algorithm: Byzantine Consensus in Blockchain	Mohammad R. Shakournia, Pooya Jamshidi, Hamid Reza Faragardi, Nasser Yazdani	2025-04-22	下载	Blockchain technology offers a decentralized and secure method for storing and authenticating data, rendering it well-suited for various applications such as digital currencies, supply chain managemen...
Towards a Distributed Federated Learning Aggregation Placement using Particle Swarm Intelligence	Amir Ali-Pour, Sadra Bekrani, Laya Samizadeh, Julien Gascon-Samson	2025-04-22	下载	Federated learning has become a promising distributed learning concept with extra insurance on data privacy. Extensive studies on various models of Federated learning have been done since the coinage ...
Monero Peer-to-peer Network Topology Analysis	Yu Gao, Yu Zhang, Matija Piškorec, Claudio J. Tessone	2025-04-22	下载	Monero, a privacy-focused cryptocurrency, employs a decentralized peer-to-peer (P2P) network that plays a critical role in transaction propagation and consensus formation.
A Comparative and Measurement-Based Study on Real-Time Network KPI Extraction Methods for 5G and Beyond Applications	Batuhan Kaplan, Samed Keşir, Ahmet Faruk Coşkun	2025-04-22	下载	Key performance indicators (KPIs), which can be extracted from the standardized interfaces of network equipment defined by current standards, constitute a primary data source that can be leveraged in ...
Aerial Active STAR-RIS-assisted Satellite-Terrestrial Covert Communications	Chuang Zhang, Geng Sun, Jiahui Li, Jiacheng Wang, Ruichen Zhang, Dusit Niyato, Shiwen Mao, Abbas Jamalipour	2025-04-22	下载	An integration of satellites and terrestrial networks is crucial for enhancing performance of next generation communication systems. However, the networks are hindered by the long-distance path loss a...
Modelling and Performance Analysis of Non-Primary Channel Access in Wi-Fi Networks	Boris Bellalta, Francesc Wilhelmi, Lorenzo Galati-Giordano, Giovanni Geraci	2025-04-22	下载	This paper aims to improve our understanding of the performance of the Non-Primary Channel Access (NPCA) mechanism, a new feature introduced in IEEE 802.
RRC Signaling Storm Detection in O-RAN	Dang Kien Nguyen, Rim El Malki, Filippo Rebecchi	2025-04-22	下载	The Open Radio Access Network (O-RAN) marks a significant shift in the mobile network industry. By transforming a traditionally vertically integrated architecture into an open, data-driven one, O-RAN ...
Research on Cloud Platform Network Traffic Monitoring and Anomaly Detection System based on Large Language Models	Ze Yang, Yihong Jin, Juntian Liu, Xinhe Xu, Yihan Zhang, Shuyang Ji	2025-04-22	下载	The rapidly evolving cloud platforms and the escalating complexity of network traffic demand proper network traffic monitoring and anomaly detection to ensure network security and performance.
State-Aware IoT Scheduling Using Deep Q-Networks and Edge-Based Coordination	Qingyuan He, Chang Liu, Juecen Zhan, Weiqiang Huang, Ran Hao	2025-04-22	下载	This paper addresses the challenge of energy efficiency management faced by intelligent IoT devices in complex application environments. A novel optimization method is proposed, combining Deep Q-Netwo...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Adaptive and Efficient Dynamic Memory Management for Hardware Enclaves	Vijay Dhanraj, Harpreet Singh Chawla, Tao Zhang, Daniel Manila, Eric Thomas Schneider, Erica Fu, Mona Vij, Chia-Che Tsai, Donald E. Porter	2025-04-22	下载	The second version of Intel Software Guard Extensions (Intel SGX), or SGX2, adds dynamic management of enclave memory and threads. The first version required the address space and thread counts to be ...
Guillotine: Hypervisors for Isolating Malicious AIs	James Mickens, Sarah Radway, Ravi Netravali	2025-04-22	下载	As AI models become more embedded in critical sectors like finance, healthcare, and the military, their inscrutable behavior poses ever-greater risks to society.