Skip to content

2025-11-08

cs.AR - Architecture

标题作者发布日期PDF摘要
Bespoke Co-processor for Energy-Efficient Health Monitoring on RISC-V-based Flexible WearablesTheofanis Vergos, Polykarpos Vergos, Mehdi B. Tahoori, Georgios Zervakis2025-11-08下载Flexible electronics offer unique advantages for conformable, lightweight, and disposable healthcare wearables. However, their limited gate count, large feature sizes, and high static power consumptio...
AiEDA: An Open-Source AI-Aided Design Library for Design-to-VectorYihang Qiu, Zengrong Huang, Simin Tao, Hongda Zhang, Weiguo Li, Xinhua Lai, Rui Wang, Weiqiang Wang, Xingquan Li2025-11-08下载Recent research has demonstrated that artificial intelligence (AI) can assist electronic design automation (EDA) in improving both the quality and efficiency of chip design.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Elastic Data Transfer Optimization with Hybrid Reinforcement LearningRasman Mubtasim Swargo, Md Arifuzzaman2025-11-08下载Modern scientific data acquisition generates petabytes of data that must be transferred to geographically distant computing clusters. Conventional tools either rely on preconfigured sessions, which ar...
Reliablocks: Developing Reliability Scores for Optimistic RollupsSouradeep Das, Ethan Lam, Varun Vaidya, Sanjay Amirthraj2025-11-08下载Introducing Reliablocks, an on-chain reliability index for non-finalized blocks in Optimistic Rollups. This was built during the EigenLayer Infinite Hackathon at the Infinite Hacker House at DevCon 20...
Secure Autonomous Agent Payments: Verifying Authenticity and Intent in a Trustless EnvironmentVivek Acharya2025-11-08下载Artificial intelligence (AI) agents are increasingly capable of initiating financial transactions on behalf of users or other agents. This evolution introduces a fundamental challenge: verifying both ...
Inductive Loop Analysis for Practical HPC Application OptimizationPhilipp Schaad, Tal Ben-Nun, Patrick Iff, Torsten Hoefler2025-11-08下载Scientific computing applications heavily rely on multi-level loop nests operating on multidimensional arrays. This presents multiple optimization opportunities from exploiting parallelism to reducing...
MoSKA: Mixture of Shared KV Attention for Efficient Long-Sequence LLM InferenceMyunghyun Rhee, Sookyung Choi, Euiseok Kim, Joonseop Sim, Youngpyo Joo, Hoshik Kim2025-11-08下载The escalating context length in Large Language Models (LLMs) creates a severe performance bottleneck around the Key-Value (KV) cache, whose memory-bound nature leads to significant GPU under-utilizat...
Distributed Deep Learning for Medical Image Denoising with Data ObfuscationSulaimon Oyeniyi Adebayo, Ayaz H. Khan2025-11-08下载Medical image denoising is essential for improving image quality while minimizing the exposure of sensitive information, particularly when working with large-scale clinical datasets.
Kunlun Anomaly Troubleshooter: Enabling Kernel-Level Anomaly Detection and Causal Reasoning for Large Model Distributed InferenceYuyang Liu, Jingjing Cai, Jiayi Ren, Peng Zhou, Danyang Zhang, Yin Du, Shijian Li2025-11-08下载Anomaly troubleshooting for large model distributed inference (LMDI) remains a critical challenge. Resolving anomalies such as inference performance degradation or latency jitter in distributed system...
DWM-RO: Decentralized World Models with Reasoning Offloading for SWIPT-enabled Satellite-Terrestrial HetNetsGuangyuan Liu, Yinqiu Liu, Ruichen Zhang, Dusit Niyato, Jiawen Kang, Sumei Sun, Abbas Jamalipour, Ping Zhang2025-11-08下载Wireless networks are undergoing a paradigm shift toward massive connectivity with energy-efficient operation, driving the integration of satellite-terrestrial architectures with simultaneous wireless...
MT4G: A Tool for Reliable Auto-Discovery of NVIDIA and AMD GPU Compute and Memory TopologiesStepan Vanecek, Manuel Walter Mussbacher, Dominik Groessler, Urvij Saroliya, Martin Schulz2025-11-08下载Understanding GPU topology is essential for performance-related tasks in HPC or AI. Yet, unlike for CPUs with tools like hwloc, GPU information is hard to come by, incomplete, and vendor-specific.
CoEdge-RAG: Optimizing Hierarchical Scheduling for Retrieval-Augmented LLMs in Collaborative Edge ComputingGuihang Hong, Tao Ouyang, Kongyange Zhao, Zhi Zhou, Xu Chen2025-11-08下载Motivated by the imperative for real-time responsiveness and data privacy preservation, large language models (LLMs) are increasingly deployed on resource-constrained edge devices to enable localized ...
Efficient Dynamic MaxFlow Computation on GPUsShruthi Kannappan, Ashwina Kumar, Rupesh Nasre2025-11-08下载Maxflow is a fundamental problem in graph theory and combinatorial optimisation, used to determine the maximum flow from a source node to a sink node in a flow network.
HYDRA: Breaking the Global Ordering Barrier in Multi-BFT ConsensusHanzheng Lyu, Shaokang Xie, Jianyu Niu, Mohammad Sadoghi, Yinqian Zhang, Cong Wang, Ivan Beschastnikh, Chen Feng2025-11-08下载Multi-Byzantine Fault Tolerant (Multi-BFT) consensus, which runs multiple BFT instances in parallel, has recently emerged as a promising approach to overcome the leader bottleneck in classical BFT pro...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Learning a Decentralized Medium Access Control Protocol for Shared Message TransmissionLorenzo Mario Amorosa, Zhan Gao, Roberto Verdone, Petar Popovski, Deniz Gündüz2025-11-08下载In large-scale Internet of things networks, efficient medium access control (MAC) is critical due to the growing number of devices competing for limited communication resources.
Enabling Data-Driven Policymaking Using Broadband-Plan Querying Tool (BQT+)Laasya Koduru, Sylee Beltiukov, Jaber Daneshamooz, Eugene Vuong, Arpit Gupta, Elizabeth Belding, Tejas N. Narechania2025-11-08下载Poor broadband access undermines civic and economic life, a challenge exacerbated by the fact that millions of Americans still lack reliable high-speed connectivity.
Digital Twin-Assisted Task Offloading and Resource Allocation in ISAC-Enabled Internet of VehiclesShanhao Zhan, Zhang Liu, Lianfen Huang, Shaowei Shen, Ziyang Bai, Zhibin Gao, Dusit Niyato2025-11-08下载The convergence of the Internet of vehicles (IoV) and 6G networks is driving the evolution of next-generation intelligent transportation systems.

cs.PF - Performance

标题作者发布日期PDF摘要
SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads?Jeffrey Jian Ma, Milad Hashemi, Amir Yazdanbakhsh, Kevin Swersky, Ofir Press, Enhui Li, Vijay Janapa Reddi, Parthasarathy Ranganathan2025-11-08下载Optimizing the performance of large-scale software repositories demands expertise in code reasoning and software engineering (SWE) to reduce runtime while preserving program correctness.
Inductive Loop Analysis for Practical HPC Application OptimizationPhilipp Schaad, Tal Ben-Nun, Patrick Iff, Torsten Hoefler2025-11-08下载Scientific computing applications heavily rely on multi-level loop nests operating on multidimensional arrays. This presents multiple optimization opportunities from exploiting parallelism to reducing...
Kunlun Anomaly Troubleshooter: Enabling Kernel-Level Anomaly Detection and Causal Reasoning for Large Model Distributed InferenceYuyang Liu, Jingjing Cai, Jiayi Ren, Peng Zhou, Danyang Zhang, Yin Du, Shijian Li2025-11-08下载Anomaly troubleshooting for large model distributed inference (LMDI) remains a critical challenge. Resolving anomalies such as inference performance degradation or latency jitter in distributed system...

基于 VitePress 构建