Skip to content

2025-03-25

cs.AR - Architecture

标题作者发布日期PDF摘要
Hardware Efficient Accelerator for Spiking Transformer With Reconfigurable Parallel Time Step ComputingBo-Yu Chen, Tian-Sheuan Chang2025-03-25下载This paper introduces the first low-power hardware accelerator for Spiking Transformers, an emerging alternative to traditional artificial neural networks.
An Efficient Data Reuse with Tile-Based Adaptive Stationary for Transformer AcceleratorsTseng-Jen Li, Tian-Sheuan Chang2025-03-25下载Transformer-based models have become the \textit{de facto} backbone across many fields, such as computer vision and natural language processing.
A Low-Power Sparse Deep Learning Accelerator with Optimized Data ReuseKai-Chieh Hsu, Tian-Sheuan Chang2025-03-25下载Sparse deep learning has reduced computation significantly, but its irregular non-zero data distribution complicates the data flow and hinders data reuse, increasing on-chip SRAM access and thus power...
Anvil: A General-Purpose Timing-Safe Hardware Description LanguageJason Zhijingcheng Yu, Aditya Ranjan Jha, Umang Mathur, Trevor E. Carlson, Prateek Saxena2025-03-25下载Expressing hardware designs using hardware description languages (HDLs) routinely involves using stateless signals whose values change according to their underlying registers.
Integrating Prefetcher Selection with Dynamic Request Allocation Improves Prefetching EfficiencyMengming Li, Qijun Zhang, Yongqing Ren, Zhiyao Xie2025-03-25下载Hardware prefetching plays a critical role in hiding the off-chip DRAM latency. The complexity of applications results in a wide variety of memory access patterns, prompting the development of numerou...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Exact and Linear Convergence for Federated Learning under Arbitrary Client Participation is AttainableBicheng Ying, Zhe Li, Haibo Yang2025-03-25下载This work tackles the fundamental challenges in Federated Learning (FL) posed by arbitrary client participation and data heterogeneity, prevalent characteristics in practical FL settings.
ARGO-SLSA: Software Supply Chain Security in Argo WorkflowsMohomed Thariq, Indrajith Ekanayake2025-03-25下载Distributed systems widely adopt microservice architecture to handle growing complexity and scale. This approach breaks applications into independent, loosely coupled services.
RCC-PFL: Robust Client Clustering under Noisy Labels in Personalized Federated LearningAbdulmoneam Ali, Ahmed Arafa2025-03-25下载We address the problem of cluster identity estimation in a personalized federated learning (PFL) setting in which users aim to learn different personal models.
Comparing the Run-time Behavior of Modern PDES Engines on Alternative Hardware ArchitecturesRomolo Marotta, Francesco Quaglia2025-03-25下载The current trend of technology has brought parallel machines equipped with multiple processors and multiple memory sockets to be available off-the-shelf -- or via renting through Iaas Clouds -- at re...
AIGC-assisted Federated Learning for Vehicular Edge Intelligence: Vehicle Selection, Resource Allocation and Model AugmentationXianke Qiang, Zheng Chang, Geyong Min2025-03-25下载To leverage the vast amounts of onboard data while ensuring privacy and security, federated learning (FL) is emerging as a promising technology for supporting a wide range of vehicular applications.
A Tight Meta-theorem for LOCAL Certification of MSO2_2 Properties within Bounded Treewidth GraphsLinda Cook, Eun Jung Kim, Tomáš Masařík2025-03-25下载Distributed networks are prone to errors so verifying their output is critical. Hence, we develop LOCAL certification protocols for graph properties in which nodes are given certificates that allow th...
Hierarchical Prediction-based Management for LMaaS SystemsZhihan Jiang, Yujie Huang, Guangba Yu, Junjie Huang, Jiazhen Gu, Michael R. Lyu2025-03-25下载Large Language Models (LLMs) have revolutionized numerous domains, driving the rise of Language-Model-as-a-Service (LMaaS) platforms that process millions of queries daily.
Fairness in Proof of Team Sprint (PoTS): Evaluating Reward Distribution Across Performance LevelsNaoki Yonezawa2025-03-25下载Blockchain consensus mechanisms must balance security, decentralization, and efficiency while ensuring fair participation. Proof of Team Sprint (PoTS) is a cooperative consensus mechanism designed to ...
ADApt: Edge Device Anomaly Detection and Microservice Replica PredictionNarges Mehran, Nikolay Nikolov, Radu Prodan, Dumitru Roman, Dragi Kimovski, Frank Pallas, Peter Dorfinger2025-03-25下载The increased usage of Internet of Things devices at the network edge and the proliferation of microservice-based applications create new orchestration challenges in Edge computing.
Robustness of Proof of Team Sprint (PoTS) Against Attacks: A Simulation-Based AnalysisNaoki Yonezawa2025-03-25下载This study evaluates the robustness of Proof of Team Sprint (PoTS) against adversarial attacks through simulations, focusing on the attacker win rate and computational efficiency under varying team si...
Empirical Evaluation and Scalability Analysis of Proof of Team Sprint (PoTS): Reward Fairness, Energy Efficiency, and System StabilityNaoki Yonezawa2025-03-25下载This paper presents an empirical evaluation of the Proof of Team Sprint (PoTS) consensus algorithm, focusing on reward fairness, energy efficiency, system stability, and scalability.
LOCO: Rethinking Objects for Network MemoryGeorge Hodgkins, Mark Madler, Joseph Izraelevitz2025-03-25下载In this work, we explore an object-based programming model for filling the space between shared memory and distributed systems programming. We argue that the natural representation for resources distr...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
SoK: Decoding the Enigma of Encrypted Network Traffic ClassifiersNimesha Wickramasinghe, Arash Shaghaghi, Gene Tsudik, Sanjay Jha2025-03-25下载The adoption of modern encryption protocols such as TLS 1.3 has significantly challenged traditional network traffic classification (NTC) methods.
Deploying an Aerial Reconfigurable Intelligent Surface for Vehicle-to-Vehicle Communications (PL: Wykorzystanie powietrznych przełączalnych inteligentnych powierzchni do komunikacji międzypojazdowej)Salim Janji2025-03-25下载This paper addresses the deployment of a drone equipped with a reconfigurable intelligent surface (RIS), creating a drone relay station (DRS) to enhance the connectivity of vehicle-to-vehicle (V2V) pa...
RCC-PFL: Robust Client Clustering under Noisy Labels in Personalized Federated LearningAbdulmoneam Ali, Ahmed Arafa2025-03-25下载We address the problem of cluster identity estimation in a personalized federated learning (PFL) setting in which users aim to learn different personal models.
OPC UA for IO-Link Wireless in a Cyber Physical Finite Element Sensor Network for Shape MeasurementHenry Beuster, Lars-Michel Bretthauer, Gerd Scholl2025-03-25下载This paper presents the integration of OPC UA as a communication protocol in a wireless sensor network and the associated companion specifications as a semantic template for an information model.
Energy-aware Joint Orchestration of 5G and Robots: Experimental Testbed and Field ValidationMilan Groshev, Lanfranco Zanzi, Carmen Delgado, Xi Li, Antonio de la Oliva, Xavier Costa-Perez2025-03-25下载5G mobile networks introduce a new dimension for connecting and operating mobile robots in outdoor environments, leveraging cloud-native and offloading features of 5G networks to enable fully flexible...
Empirical Analysis of the Impact of 5G Jitter on Time-Aware Shaper Scheduling in a 5G-TSN NetworkPablo Rodriguez-Martin, Oscar Adamuz-Hinojosa, Pablo Muñoz, Julia Caleya-Sanchez, Jorge Navarro-Ortiz, Pablo Ameigeiras2025-03-25下载Deterministic communications are essential for industrial automation, ensuring strict latency requirements and minimal jitter in packet transmission.
A Reliable and Efficient 5G Vehicular MEC: Guaranteed Task Completion with Minimal LatencyMahsa Paknejad, Parisa Fard Moshiri, Murat Simsek, Burak Kantarci, Hussein T. Mouftah2025-03-25下载This paper explores the advancement of Vehicular Edge Computing (VEC) as a tailored application of Mobile Edge Computing (MEC) for the automotive industry, addressing the rising demand for real-time p...
Partitioned Task Offloading for Low-Latency and Reliable Task Completion in 5G MECParisa Fard Moshiri, Murat Simsek, Burak Kantarci2025-03-25下载The demand for MEC has increased with the rise of data-intensive applications and 5G networks, while conventional cloud models struggle to satisfy low-latency requirements.

cs.PF - Performance

标题作者发布日期PDF摘要
Adaptive Orchestration for Large-Scale Inference on Heterogeneous Accelerator Systems Balancing Cost, Performance, and ResilienceYahav Biran, Imry Kissos2025-03-25下载The surge in generative AI workloads has created a need for scalable inference systems that can flexibly harness both GPUs and specialized accelerators while containing operational costs.
Versatile Cross-platform Compilation Toolchain for Schrödinger-style Quantum Circuit SimulationYuncheng Lu, Shuang Liang, Hongxiang Fan, Ce Guo, Wayne Luk, Paul H. J. Kelly2025-03-25下载While existing quantum hardware resources have limited availability and reliability, there is a growing demand for exploring and verifying quantum algorithms.
VecTrans: Enhancing Compiler Auto-Vectorization through LLM-Assisted Code TransformationsZhongchun Zheng, Kan Wu, Long Cheng, Lu Li, Rodrigo C. O. Rocha, Tianyi Liu, Wei Wei, Jianjiang Zeng, Xianwei Zhang, Yaoqing Gao2025-03-25下载Auto-vectorization is a fundamental optimization for modern compilers to exploit SIMD parallelism. However, state-of-the-art approaches still struggle to handle intricate code patterns, often requirin...

基于 VitePress 构建