Skip to content

2025-09-29

cs.AR - Architecture

标题作者发布日期PDF摘要
EEsizer: LLM-Based AI Agent for Sizing of Analog and Mixed Signal CircuitChang Liu, Danial Chitnis2025-09-29下载The design of Analog and Mixed-Signal (AMS) integrated circuits (ICs) often involves significant manual effort, especially during the transistor sizing process.
smallNet: Implementation of a convolutional layer in tiny FPGAsFernanda Zapata Bascuñán, Alan Ezequiel Fuster2025-09-29下载Since current neural network development systems in Xilinx and VLSI require codevelopment with Python libraries, the first stage of a convolutional network has been implemented by developing a convolu...
On the Shape of Latent Variables in a Denoising VAE-MoG: A Posterior Sampling-Based StudyFernanda Zapata Bascuñán2025-09-29下载In this work, we explore the latent space of a denoising variational autoencoder with a mixture-of-Gaussians prior (VAE-MoG), trained on gravitational wave data from event GW150914.
Fault Injection in On-Chip Interconnects: A Comparative Study of Wishbone, AXI-Lite, and AXIHongwei Zhao, Vianney Lapotre, Guy Gogniat2025-09-29下载Fault injection attacks exploit physical disturbances to compromise the functionality and security of integrated circuits. As System on Chip (SoC) architectures grow in complexity, the vulnerability o...
Intent-Driven Storage Systems: From Low-Level Tuning to High-Level UnderstandingShai Bergman, Won Wook Song, Lukas Cavigelli, Konstantin Berestizshevsky, Ke Zhou, Ji Zhang2025-09-29下载Existing storage systems lack visibility into workload intent, limiting their ability to adapt to the semantics of modern, large-scale data-intensive applications.
BiHDTrans: binary hyperdimensional transformer for efficient multivariate time series classificationJingtao Zhang, Yi Liu, Qi Shen, Changhong Wang2025-09-29下载The proliferation of Internet-of-Things (IoT) devices has led to an unprecedented volume of multivariate time series (MTS) data, requiring efficient and accurate processing for timely decision-making ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Enhancing Split Learning with Sharded and Blockchain-Enabled SplitFed ApproachesAmirreza Sokhankhosh, Khalid Hassan, Sara Rouhani2025-09-29下载Collaborative and distributed learning techniques, such as Federated Learning (FL) and Split Learning (SL), hold significant promise for leveraging sensitive data in privacy-critical domains.
CAFL-L: Constraint-Aware Federated Learning with Lagrangian Dual Optimization for On-Device Language ModelsDongqi Zheng, Wenjin Fu2025-09-29下载We introduce Constraint-Aware Federated Learning with Lagrangian Dual Optimization (CAFL-L), a principled extension of FedAvg that explicitly incorporates device-level resource constraints including e...
Permuting Transactions in Ethereum Blocks: An Empirical StudyJan Droll2025-09-29下载Several recent proposals implicitly or explicitly suggest making use of randomized transaction ordering within a block to mitigate centralization effects and to improve fairness in the Ethereum ecosys...
Context-Driven Performance Modeling for Causal Inference Operators on Neural Processing UnitsNeelesh Gupta, Rakshith Jayanth, Dhruv Parikh, Viktor Prasanna2025-09-29下载The proliferation of large language models has driven demand for long-context inference on resource-constrained edge platforms. However, deploying these models on Neural Processing Units (NPUs) presen...
Accelerating Dynamic Image Graph Construction on FPGA for Vision GNNsAnvitha Ramachandran, Dhruv Parikh, Viktor Prasanna2025-09-29下载Vision Graph Neural Networks (Vision GNNs, or ViGs) represent images as unstructured graphs, achieving state of the art performance in computer vision tasks such as image classification, object detect...
A Scalable Distributed Framework for Multimodal GigaVoxel Image RegistrationRohit Jena, Vedant Zope, Pratik Chaudhari, James C. Gee2025-09-29下载In this work, we propose FFDP, a set of IO-aware non-GEMM fused kernels supplemented with a distributed framework for image registration at unprecedented scales.
GRACE-MoE: Grouping and Replication with Locality-Aware Routing for Efficient Distributed MoE InferenceYu Han, Lehan Pan, Jie Peng, Ziyang Tao, Wuyang Zhang, Yanyong Zhang2025-09-29下载Sparse Mixture of Experts (SMoE) performs conditional computation by selectively activating a subset of experts, thereby enabling scalable parameter growth in large language models (LLMs).
From Score Distributions to Balance: Plug-and-Play Mixture-of-Experts RoutingRana Shahout, Colin Cai, Yilun Du, Minlan Yu, Michael Mitzenmacher2025-09-29下载Mixture-of-Experts (MoE) models can scale parameter capacity by routing each token to a subset of experts through a learned gate function. While conditional routing reduces training costs, it shifts t...
A Management Framework for Vehicular Cloudtoward Economic and Environmental EfficiencyRosario Patanè, Andrea Araldo, Nadjib Achir, Lila Boukhatem2025-09-29下载Vehicular Cloud Computing (VCC) leverages the idle computing capacity of vehicles to execute end-users' offloaded tasks without requiring new computation infrastructure.
Lumos: Performance Characterization of WebAssembly as a Serverless Runtime in the Edge-Cloud ContinuumCynthia Marcelino, Noah Krennmair, Thomas Pusztai, Stefan Nastic2025-09-29下载WebAssembly has emerged as a lightweight and portable runtime to execute serverless functions, particularly in heterogeneous and resource-constrained environments such as the Edge Cloud Continuum.
Graph Theory Meets Federated Learning over Satellite Constellations: Spanning Aggregations, Network Formation, and Performance OptimizationFardis Nadimi, Payam Abdisarabshali, Jacob Chakareski, Nicholas Mastronarde, Seyyedali Hosseinalipour2025-09-29下载In this work, we introduce Fed-Span: \textit{\underline{fed}erated learning with \underline{span}ning aggregation over low Earth orbit (LEO) satellite constellations}.
HAPT: Heterogeneity-Aware Automated Parallel Training on Heterogeneous ClustersAntian Liang, Zhigang Zhao, Kai Zhang, Xuri Shi, Chuantao Li, Chunxiao Wang, Zhenying He, Yinan Jing, X. Sean Wang2025-09-29下载With the rapid evolution of GPU architectures, the heterogeneity of model training infrastructures is steadily increasing. In such environments, effectively utilizing all available heterogeneous accel...
Intent-Driven Storage Systems: From Low-Level Tuning to High-Level UnderstandingShai Bergman, Won Wook Song, Lukas Cavigelli, Konstantin Berestizshevsky, Ke Zhou, Ji Zhang2025-09-29下载Existing storage systems lack visibility into workload intent, limiting their ability to adapt to the semantics of modern, large-scale data-intensive applications.
SparseServe: Unlocking Parallelism for Dynamic Sparse Attention in Long-Context LLM ServingQihui Zhou, Peiqi Yin, Pengfei Zuo, James Cheng2025-09-29下载Serving long-context LLMs is costly because attention computation grows linearly with context length. Dynamic sparse attention algorithms (DSAs) mitigate this by attending only to the key-value (KV) c...
ActorDB: A Unified Database Model Integrating Single-Writer Actors, Incremental View Maintenance, and Zero-Trust MessagingJun Kawasaki2025-09-29下载This paper presents ActorDB ( Dekigoto ) , a novel database architecture that tightly integrates a single-writer actor model for writes, Incremental View Maintenance (IVM), and a zero-trust security m...
Federated Spatiotemporal Graph Learning for Passive Attack Detection in Smart GridsBochra Al Agha, Razane Tajeddine2025-09-29下载Smart grids are exposed to passive eavesdropping, where attackers listen silently to communication links. Although no data is actively altered, such reconnaissance can reveal grid topology, consumptio...
BugMagnifier: TON Transaction Simulator for Revealing Smart Contract VulnerabilitiesYury Yanovich, Victoria Kovalevskaya, Maksim Egorov, Elizaveta Smirnova, Matvey Mishuris, Yash Madhwal, Kirill Ziborov, Vladimir Gorgadze, Subodh Sharma2025-09-29下载The Open Network (TON) blockchain employs an asynchronous execution model that introduces unique security challenges for smart contracts, particularly race conditions arising from unpredictable messag...
RServe: Overlapping Encoding and Prefill for Efficient LMM InferenceTianyu Guo, Tianming Xu, Xianjie Chen, Junru Chen, Nong Xiao, Xianwei Zhang2025-09-29下载Large multimodal models (LMMs) typically employ an encoding module to transform multimodal data inputs into embeddings, which are then fed to language models for further processing.
LogAction: Consistent Cross-system Anomaly Detection through Logs via Active Domain AdaptationChiming Duan, Minghua He, Pei Xiao, Tong Jia, Xin Zhang, Zhewei Zhong, Xiang Luo, Yan Niu, Lingzhe Zhang, Yifan Wu, Siyu Yu, Weijie Hong, Ying Li, Gang Huang2025-09-29下载Log-based anomaly detection is a essential task for ensuring the reliability and performance of software systems. However, the performance of existing anomaly detection methods heavily relies on label...
Asynchronous Policy Gradient Aggregation for Efficient Distributed Reinforcement LearningAlexander Tyurin, Andrei Spiridonov, Varvara Rudenko2025-09-29下载We study distributed reinforcement learning (RL) with policy gradient methods under asynchronous and parallel computations and communications.
RL in the Wild: Characterizing RLVR Training in LLM DeploymentJiecheng Zhou, Qinghao Hu, Yuyang Jin, Zerui Wang, Peng Sun, Yuzhe Gu, Wenwei Zhang, Mingshu Zhai, Xingcheng Zhang, Weiming Zhang2025-09-29下载Large Language Models (LLMs) are now widely used across many domains. With their rapid development, Reinforcement Learning with Verifiable Rewards (RLVR) has surged in recent months to enhance their r...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Multi-Layer Secret Sharing for Cross-Layer Attack Defense in 5G Networks: a COTS UE DemonstrationWai Ming Chan, Remi Chou, Taejoon Kim2025-09-29下载This demo presents the first implementation of multi-layer secret sharing on commercial-off-the-shelf (COTS) 5G user equipment (UE), operating without infrastructure modifications or pre-shared keys.
Experimental Study of Magnetic Near-Field Microstrip Electronic Probe for PCB EMC Emission MeasurementHongchuan Jia, Fayu Wan, Vladimir Mordachev, Jérôme Rossignol, Glauco Fontagalland, Nour Murad, Blaise Ravelo2025-09-29下载An experimental study on magnetic near-field (NF) scanning of printed circuit board (PCB) emission radiation is developed in this paper. The design and installation of the electromagnetic (EM) NF scan...
Graph Theory Meets Federated Learning over Satellite Constellations: Spanning Aggregations, Network Formation, and Performance OptimizationFardis Nadimi, Payam Abdisarabshali, Jacob Chakareski, Nicholas Mastronarde, Seyyedali Hosseinalipour2025-09-29下载In this work, we introduce Fed-Span: \textit{\underline{fed}erated learning with \underline{span}ning aggregation over low Earth orbit (LEO) satellite constellations}.
Blockchain-Driven Federation for Distributed Edge Systems: Design and Experimental ValidationAdam Zahir, Milan Groshev, Carlos J. Bernardos, Antonio de la Oliva2025-09-29下载Edge computing brings computation near end users, enabling the provisioning of novel use cases. To satisfy end-user requirements, the concept of edge federation has recently emerged as a key mechanism...
Markov Decision Processing NetworksSanidhay Bhambay, Thirupathaiah Vasantam, Neil Walton2025-09-29下载We introduce Markov Decision Processing Networks (MDPNs) as a multiclass queueing network model where service is a controlled, finite-state Markov process.
Optimisation of Resource Allocation in Heterogeneous Wireless Networks Using Deep Reinforcement LearningOluwaseyi Giwa, Jonathan Shock, Jaco Du Toit, Tobi Awodumila2025-09-29下载Dynamic resource allocation in open radio access network (O-RAN) heterogeneous networks (HetNets) presents a complex optimisation challenge under varying user loads.
Contrastive Learning for Correlating Network IncidentsJeremias Dötterl2025-09-29下载Internet service providers monitor their networks to detect, triage, and remediate service impairments. When an incident is detected, it is important to determine whether similar incidents have occurr...
Flexible and High-Performance Radio Access Networks for upcoming Sixth-Generation (6G) SystemsPeter Schefczik, Umar Toseef, Paolo Baracca, Ralf Klotsche, Torsten Dudda, Mai-Anh Phan, Lorenzo Miretti, David Ginthoer, Bin Han2025-09-29下载The collaborative research project 6G-ANNA develops concepts for the 6G radio access network (RAN) architecture and technology components. Previous RAN generations have become inherently more complex ...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Joyride: Rethinking Linux's network stack design for better performance, security, and reliabilityYanlin Du, Ruslan Nikolaev2025-09-29下载Contemporary distributed computing workloads, including scientific computation, data mining, and machine learning, increasingly demand OS networking with minimal latency as well as high throughput, se...

cs.PF - Performance

标题作者发布日期PDF摘要
FlashOmni: A Unified Sparse Attention Engine for Diffusion TransformersLiang Qiao, Yue Dai, Yeqi Huang, Hongyu Kan, Jun Shi, Hong An2025-09-29下载Multi-Modal Diffusion Transformers (DiTs) demonstrate exceptional capabilities in visual synthesis, yet their deployment remains constrained by substantial computational demands.
DarwinGame: Playing Tournaments for Tuning Applications in Noisy Cloud EnvironmentsRohan Basu Roy, Vijay Gadepally, Devesh Tiwari2025-09-29下载This work introduces a new subarea of performance tuning -- performance tuning in a shared interference-prone computing environment. We demonstrate that existing tuners are significantly suboptimal by...

基于 VitePress 构建