Skip to content

2025-07-21

cs.AR - Architecture

标题作者发布日期PDF摘要
VeriRAG: A Retrieval-Augmented Framework for Automated RTL Testability RepairHaomin Qi, Yuyang Du, Lihao Zhang, Soung Chang Liew, Kexin Chen, Yining Du2025-07-21下载Large language models (LLMs) have demonstrated immense potential in computer-aided design (CAD), particularly for automated debugging and verification within electronic design automation (EDA) tools.
When Pipelined In-Memory Accelerators Meet Spiking Direct Feedback Alignment: A Co-Design for Neuromorphic Edge ComputingHaoxiong Ren, Yangu He, Kwunhang Wong, Rui Bao, Ning Lin, Zhongrui Wang, Dashan Shang2025-07-21下载Spiking Neural Networks (SNNs) are increasingly favored for deployment on resource-constrained edge devices due to their energy-efficient and event-driven processing capabilities.
Rethinking LLM Inference Bottlenecks: Insights from Latent Attention and Mixture-of-ExpertsSungmin Yun, Seonyong Park, Hwayong Nam, Younjoo Lee, Gunjun Lee, Kwanhee Kyung, Sangpyo Kim, Nam Sung Kim, Jongmin Kim, Hyungyo Kim, Juhwan Cho, Seungmin Baek, Jung Ho Ahn2025-07-21下载Computational workloads composing traditional transformer models are starkly bifurcated. Multi-Head Attention (MHA) and Grouped-Query Attention are memory-bound due to low arithmetic intensity, while ...
GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional ProcessingMinnan Pei, Gang Li, Junwen Si, Zeyu Zhu, Zitao Mo, Peisong Wang, Zhuoran Song, Xiaoyao Liang, Jian Cheng2025-07-21下载3D Gaussian Splatting (3DGS) has emerged as a leading neural rendering technique for high-fidelity view synthesis, prompting the development of dedicated 3DGS accelerators for resource-constrained pla...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Resilience Evaluation of Kubernetes in Cloud-Edge Environments via Failure InjectionZihao Chen, Mohammad Goudarzi, Adel Nadjaran Toosi2025-07-21下载Kubernetes has emerged as an essential platform for deploying containerised applications across cloud and edge infrastructures. As Kubernetes gains increasing adoption for mission-critical microservic...
Entanglement-Efficient Distribution of Quantum Circuits over Large-Scale Quantum NetworksFelix Burt, Kuan-Cheng Chen, Kin K. Leung2025-07-21下载Quantum computers face inherent scaling challenges, a fact that necessitates investigation of distributed quantum computing systems, whereby scaling is achieved through interconnection of smaller quan...
Byzantine-Resilient Distributed Computation via Task Replication and Local ComputationsAayush Rajesh, Nikhil Karamchandani, Vinod M. Prabhakaran2025-07-21下载We study a distributed computation problem in the presence of Byzantine workers where a central node wishes to solve a task that is divided into independent sub-tasks, each of which needs to be solved...
Asynchronous Collective Tree Exploration: a Distributed Algorithm, and a new Lower BoundRomain Cosson, Laurent Massoulié2025-07-21下载We study the problem of collective tree exploration in which a team of kk mobile agents must collectively visit all nodes of an unknown tree in as few moves as possible.
Efficient Routing of Inference Requests across LLM Instances in Cloud-Edge ComputingShibo Yu, Mohammad Goudarzi, Adel Nadjaran Toosi2025-07-21下载The rising demand for Large Language Model (LLM) inference services has intensified pressure on computational resources, resulting in latency and cost challenges.
Scaling Decentralized Learning with FLockZehua Cheng, Rui Sun, Jiahao Sun, Yike Guo2025-07-21下载Fine-tuning the large language models (LLMs) are prevented by the deficiency of centralized control and the massive computing and communication overhead on the decentralized schemes.
A Multi-Armed Bandit-Based Participant Selection Method for Federated Recommendation SystemsJintao Liu, Mohammad Goudarzi, Adel Nadjaran Toosi2025-07-21下载Federated Recommendation Systems (FRS) enable privacy-preserving model training by keeping user data on edge devices. However, the practical deployment of FRS in Edge-Cloud environments faces signific...
GALE: Leveraging Heterogeneous Systems for Efficient Unstructured Mesh Data AnalysisGuoxi Liu, Thomas Randall, Rong Ge, Federico Iuricich2025-07-21下载Unstructured meshes present challenges in scientific data analysis due to irregular distribution and complex connectivity. Computing and storing connectivity information is a major bottleneck for visu...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
AI-driven Orchestration at Scale: Estimating Service Metrics on National-Wide TestbedsRodrigo Moreira, Rafael Pasquini, Joberto S. B. Martins, Tereza C. Carvalho, Flávio de Oliveira Silva2025-07-21下载Network Slicing (NS) realization requires AI-native orchestration architectures to efficiently and intelligently handle heterogeneous user requirements.
The Capacity of Semantic Private Information Retrieval with Colluding ServersMohamed Nomeir, Alptug Aytekin, Sennur Ulukus2025-07-21下载We study the problem of semantic private information retrieval (Sem-PIR) with TT colluding servers (Sem-TPIR), i.e., servers that collectively share user queries.
Federated Split Learning with Improved Communication and Storage EfficiencyYujia Mu, Cong Shen2025-07-21下载Federated learning (FL) is one of the popular distributed machine learning (ML) solutions but incurs significant communication and computation costs at edge devices.
Point Cloud Streaming with Latency-Driven Implicit Adaptation using MoQAndrew Freeman, Michael Rudolph, Amr Rizk2025-07-21下载Point clouds are a promising video representation for virtual and augmented reality. Their high-bitrate, however, has so far limited the practicality of live streaming systems.
Vehicular Cloud Computing: A cost-effective alternative to Edge Computing in 5G networksRosario Patanè, Nadjib Achir, Andrea Araldo, Lila Boukhatem2025-07-21下载Edge Computing (EC) is a computational paradigm that involves deploying resources such as CPUs and GPUs near end-users, enabling low-latency applications like augmented reality and real-time gaming.
SENSOR: A Cost-Efficient Open-Source Flow Monitoring PlatformGabriel Paradzik, Benjamin Steinert, Heinrich Abele, Michael Menth2025-07-21下载This paper presents a cost-effective and distributed flow monitoring platform for collecting unsampled IPFIX data exclusively using open-source tools, which is implemented at the University of Tübinge...
AoI-Aware Resource Allocation with Deep Reinforcement Learning for HAPS-V2X NetworksAhmet Melih Ince, Ayse Elif Canbilen, Halim Yanikomeroglu2025-07-21下载Sixth-generation (6G) networks are designed to meet the hyper-reliable and low-latency communication (HRLLC) requirements of safety-critical applications such as autonomous driving.
Assessing the Benefits of Ground Vehicles as Moving Urban Base StationsLaura Finarelli, Falko Dressler, Marco Ajmone Marsan, Gianluca Rizzo2025-07-21下载In the evolution towards 6G user-centric networking, the moving network (MN) paradigm can play an important role. In a MN, some small cell base stations (BS) are installed on top of vehicles, and enab...
Stack Management for MPLS Network Actions: Integration of Nodes with Limited Hardware CapabilitiesFabian Ihle, Michael Menth2025-07-21下载The MPLS Network Actions (MNA) framework enhances MPLS forwarding with a generalized encoding for manifold extensions such as network slicing and in-situ OAM (IOAM).
Enhancements to P4TG: Histogram-Based RTT Monitoring in the Data PlaneFabian Ihle, Etienne Zink, Michael Menth2025-07-21下载Modern traffic generators are essential tools for evaluating the performance of network environments. P4TG is a P4-based traffic generator implemented for Intel Tofino switches that offers high-speed ...
Low-Power and Accurate IoT Monitoring Under Radio Resource ConstraintTakaho Shimokasa, Hiroyuki Yomo, Federico Chiariotti, Junya Shiraishi, Petar Popovski2025-07-21下载This paper investigates how to achieve both low-power operations of sensor nodes and accurate state estimation using Kalman filter for internet of things (IoT) monitoring employing wireless sensor net...
Non-Terrestrial Network Models Using Stochastic Geometry: Planar or Spherical?Ruibo Wang, Baha Eddine Youcef Belmekki, Howard H. Yang, Mohamed Slim Alouini2025-07-21下载With the explosive deployment of non-terrestrial networks (NTNs), the computational complexity of network performance analysis is rapidly escalating.
Enabling Immersive XR Collaborations over FTTR Networks (Invited)Sourav Mondal, Elaine Wong2025-07-21下载Fiber-To-The-Room is a potential solution to achieve in-premise extended reality collaborations. This paper explores predictive bandwidth allocation and seamless handover schemes over FTTR, showing hi...
User Head Movement-Predictive XR in Immersive H2M Collaborations over Future Enterprise NetworksSourav Mondal, Elaine Wong2025-07-21下载The evolution towards future generation of mobile systems and fixed wireless networks is primarily driven by the urgency to support high-bandwidth and low-latency services across various vertical sect...

cs.PF - Performance

标题作者发布日期PDF摘要
Enhancements to P4TG: Histogram-Based RTT Monitoring in the Data PlaneFabian Ihle, Etienne Zink, Michael Menth2025-07-21下载Modern traffic generators are essential tools for evaluating the performance of network environments. P4TG is a P4-based traffic generator implemented for Intel Tofino switches that offers high-speed ...

基于 VitePress 构建