Appearance
2025-09-30
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Compact, Low Power Transprecision ALU for Smart Edge Devices | Ayushi Dube, Gian Singh, Sarma Vrudhula | 2025-09-30 | 下载 | Transprecision computing (TC) is a promising approach for energy-efficient machine learning (ML) computation on resource-constrained platforms. |
| Rearchitecting Datacenter Lifecycle for AI: A TCO-Driven Framework | Jovan Stojkovic, Chaojie Zhang, Íñigo Goiri, Ricardo Bianchini | 2025-09-30 | 下载 | The rapid rise of large language models (LLMs) has been driving an enormous demand for AI inference infrastructure, mainly powered by high-end GPUs. |
| Stab-QRAM: An All-Clifford Quantum Random Access Memory for Special Data | Guangyi Li, Yu Gan, Zeguan Wu, Xueyue Zhang, Zheshen Zhang, Junyu Liu | 2025-09-30 | 下载 | Quantum random access memories (QRAMs) are pivotal for data-intensive quantum algorithms, but existing general-purpose and domain-specific architectures are hampered by a critical bottleneck: a heavy ... |
| Robust NbN on Si-SiGe hybrid superconducting-semiconducting microwave quantum circuit | Paniz Foshat, Samane Kalhor, Shima Poorgholam-khanjari, Douglas Paul, Martin Weides, Kaveh Delfanazari | 2025-09-30 | 下载 | Advancing large-scale quantum computing requires superconducting circuits that combine long coherence times with compatibility with semiconductor technology. |
| TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs | Arjan Blankestijn, Uraz Odyurt, Amirreza Yousefzadeh | 2025-09-30 | 下载 | The Transformer Machine Learning (ML) architecture has been gaining considerable momentum in recent years. In particular, computational High-Energy Physics tasks such as jet tagging and particle track... |
| Benchmarking Deep Learning Convolutions on Energy-constrained CPUs | Enrique Galvez, Adrien Cassagne, Alix Munier, Manuel Bouyer | 2025-09-30 | 下载 | This work evaluates State-of-the-Art convolution algorithms for CPU-based CNN inference. Although most prior studies focus on GPUs or NPUs, CPU implementations remain comparatively under-optimized. |
| Runtime Energy Monitoring for RISC-V Soft-Cores | Alberto Scionti, Paolo Savio, Francesco Lubrano, Olivier Terzo, Marco Ferretti, Florin Apopei, Juri Bellucci, Ennio Spano, Luca Carriere | 2025-09-30 | 下载 | Energy efficiency is one of the major concern in designing advanced computing infrastructures. From single nodes to large-scale systems (data centers), monitoring the energy consumption of the computi... |
| Enabling Time-Aware Priority Traffic Management over Distributed FPGA Nodes | Alberto Scionti, Paolo Savio, Francesco Lubrano, Federico Stirano, Antonino Nespola, Olivier Terzo, Corrado De Sio, Luca Sterpone | 2025-09-30 | 下载 | Network Interface Cards (NICs) greatly evolved from simple basic devices moving traffic in and out of the network to complex heterogeneous systems offloading host CPUs from performing complex tasks on... |
| MUSS-TI: Multi-level Shuttle Scheduling for Large-Scale Entanglement Module Linked Trapped-Ion | Xian Wu, Chenghong Zhu, Jingbo Wang, Xin Wang | 2025-09-30 | 下载 | Trapped-ion computing is a leading architecture in the pursuit of scalable and high fidelity quantum systems. Modular quantum architectures based on photonic interconnects offer a promising path for s... |
| CIMNAS: A Joint Framework for Compute-In-Memory-Aware Neural Architecture Search | Olga Krestinskaya, Mohammed E. Fouda, Ahmed Eltawil, Khaled N. Salama | 2025-09-30 | 下载 | To maximize hardware efficiency and performance accuracy in Compute-In-Memory (CIM)-based neural network accelerators for Artificial Intelligence (AI) applications, co-optimizing both software and har... |
| SAIL: SRAM-Accelerated LLM Inference System with Lookup-Table-based GEMV | Jingyao Zhang, Jaewoo Park, Jongeun Lee, Elaheh Sadredini | 2025-09-30 | 下载 | Large Language Model (LLM) inference requires substantial computational resources, yet CPU-based inference remains essential for democratizing AI due to the widespread availability of CPUs compared to... |
| LLM-Powered Code Analysis and Optimization for Gaussian Splatting Kernels | Yi Hu, Huiyang Zhou | 2025-09-30 | 下载 | 3D Gaussian splatting (3DGS) is a transformative technique with profound implications on novel view synthesis and real-time rendering. Given its importance, there have been many attempts to improve it... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| BlockSDN-VC: A SDN-Based Virtual Coordinate-Enhanced Transaction Broadcast Framework for High-Performance Blockchains | Wenyang Jia, Jingjing Wang, Kai Lei | 2025-09-30 | 下载 | Modern blockchains need fast, reliable propagation to balance security and throughput. Virtual-coordinate methods speed dissemination but rely on slow iterative updates, leaving nodes out of sync. |
| Artificial Intelligence for Cost-Aware Resource Prediction in Big Data Pipelines | Harshit Goyal | 2025-09-30 | 下载 | Efficient resource allocation is a key challenge in modern cloud computing. Over-provisioning leads to unnecessary costs, while under-provisioning risks performance degradation and SLA violations. |
| FlowMoE: A Scalable Pipeline Scheduling Framework for Distributed Mixture-of-Experts Training | Yunqi Gao, Bing Hu, Mahdi Boloursaz Mashhadi, A-Long Jin, Yanfeng Zhang, Pei Xiao, Rahim Tafazolli, Merouane Debbah | 2025-09-30 | 下载 | The parameter size of modern large language models (LLMs) can be scaled up via the sparsely-activated Mixture-of-Experts (MoE) technique to avoid excessive increase of the computational costs. |
| LoRAFusion: Efficient LoRA Fine-Tuning for LLMs | Zhanda Zhu, Qidong Su, Yaoyao Ding, Kevin Song, Shang Wang, Gennady Pekhimenko | 2025-09-30 | 下载 | Low-Rank Adaptation (LoRA) has become the leading Parameter-Efficient Fine-Tuning (PEFT) method for Large Language Models (LLMs), as it significantly reduces GPU memory usage while maintaining competi... |
| Lattica: A Decentralized Cross-NAT Communication Framework for Scalable AI Inference and Training | Ween Yang, Jason Liu, Suli Wang, Xinyuan Song, Lynn Ai, Eric Yang, Bill Shi | 2025-09-30 | 下载 | The rapid expansion of distributed Artificial Intelligence (AI) workloads beyond centralized data centers creates a demand for new communication substrates. |
| TASP: Topology-aware Sequence Parallelism | Yida Wang, Ke Hong, Xiuhong Li, Yuanchao Xu, Wenxun Wang, Guohao Dai, Yu Wang | 2025-09-30 | 下载 | Long-context large language models (LLMs) face constraints due to the quadratic complexity of the self-attention mechanism. The mainstream sequence parallelism (SP) method, Ring Attention, attempts to... |
| Rearchitecting Datacenter Lifecycle for AI: A TCO-Driven Framework | Jovan Stojkovic, Chaojie Zhang, Íñigo Goiri, Ricardo Bianchini | 2025-09-30 | 下载 | The rapid rise of large language models (LLMs) has been driving an enormous demand for AI inference infrastructure, mainly powered by high-end GPUs. |
| CSnake: Detecting Self-Sustaining Cascading Failure via Causal Stitching of Fault Propagations | Shangshu Qian, Lin Tan, Yongle Zhang | 2025-09-30 | 下载 | Recent studies have revealed that self-sustaining cascading failures in distributed systems frequently lead to widespread outages, which are challenging to contain and recover from. |
| Tuning the Tuner: Introducing Hyperparameter Optimization for Auto-Tuning | Floris-Jan Willemsen, Rob V. van Nieuwpoort, Ben van Werkhoven | 2025-09-30 | 下载 | Automatic performance tuning (auto-tuning) is widely used to optimize performance-critical applications across many scientific domains by finding the best program variant among many choices. |
| Efficient Construction of Large Search Spaces for Auto-Tuning | Floris-Jan Willemsen, Rob V. van Nieuwpoort, Ben van Werkhoven | 2025-09-30 | 下载 | Automatic performance tuning, or auto-tuning, accelerates high-performance codes by exploring vast spaces of code variants. However, due to the large number of possible combinations and complex constr... |
| I Like To Move It -- Computation Instead of Data in the Brain | Fabian Czappa, Marvin Kaster, Felix Wolf | 2025-09-30 | 下载 | The detailed functioning of the human brain is still poorly understood. Brain simulations are a well-established way to complement experimental research, but must contend with the computational demand... |
| Parallax: Efficient LLM Inference Service over Decentralized Environment | Chris Tong, Youhe Jiang, Gufeng Chen, Tianyi Zhao, Sibian Lu, Wenjie Qu, Eric Yang, Lynn Ai, Binhang Yuan | 2025-09-30 | 下载 | Deploying a large language model (LLM) inference service remains costly because centralized serving depends on specialized GPU clusters and high-bandwidth interconnects in datacenters. |
| AGOCS -- Accurate Google Cloud Simulator Framework | Leszek Sliwko, Vladimir Getov | 2025-09-30 | 下载 | This paper presents the Accurate Google Cloud Simulator (AGOCS) - a novel high-fidelity Cloud workload simulator based on parsing real workload traces, which can be conveniently used on a desktop mach... |
| Hybrid Dual-Batch and Cyclic Progressive Learning for Efficient Distributed Training | Kuan-Wei Lu, Ding-Yong Hong, Pangfeng Liu, Jan-Jan Wu | 2025-09-30 | 下载 | Distributed machine learning is critical for training deep learning models on large datasets with numerous parameters. Current research primarily focuses on leveraging additional hardware resources an... |
| Enabling Time-Aware Priority Traffic Management over Distributed FPGA Nodes | Alberto Scionti, Paolo Savio, Francesco Lubrano, Federico Stirano, Antonino Nespola, Olivier Terzo, Corrado De Sio, Luca Sterpone | 2025-09-30 | 下载 | Network Interface Cards (NICs) greatly evolved from simple basic devices moving traffic in and out of the network to complex heterogeneous systems offloading host CPUs from performing complex tasks on... |
| Accelerating LLM Inference with Precomputed Query Storage | Jay H. Park, Youngju Cho, Choungsol Lee, Moonwook Oh, Euiseong Seo | 2025-09-30 | 下载 | Large language model (LLM) inference often suffers from high latency, particularly in resource-constrained environments such as on-device or edge deployments. |
| PAST: Pilot and Adaptive Orchestration for Timely and Resilient Service Delivery in Edge-Assisted UAV Networks under Spatio-Temporal Dynamics | Houyi Qi, Minghui Liwang, Liqun Fu, Sai Zou, Xinlei Yi, Wei Ni, Huaiyu Dai | 2025-09-30 | 下载 | Incentive-driven resource trading is essential for UAV applications with intensive, time-sensitive computing demands. Traditional spot trading suffers from negotiation delays and high energy costs, wh... |
| Layerwise Federated Learning for Heterogeneous Quantum Clients using Quorus | Jason Han, Nicholas S. DiBrita, Daniel Leeds, Jianqiang Li, Jason Ludmir, Tirthak Patel | 2025-09-30 | 下载 | Quantum machine learning (QML) holds the promise to solve classically intractable problems, but, as critical data can be fragmented across private clients, there is a need for distributed QML in a qua... |
| Adaptive and Resource-efficient Agentic AI Systems for Mobile and Embedded Devices: A Survey | Sicong Liu, Weiye Wu, Xiangrui Xu, Teng Li, Bowen Pang, Bin Guo, Zhiwen Yu | 2025-09-30 | 下载 | Foundation models have reshaped AI by unifying fragmented architectures into scalable backbones with multimodal reasoning and contextual adaptation. |
| LAPIS: A Performance Portable, High Productivity Compiler Framework | Brian Kelley, Sivasankaran Rajamanickam | 2025-09-30 | 下载 | Portability, performance, and productivity are three critical dimensions for evaluating a programming model or compiler infrastructure. Several modern programming models for computational science focu... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Review of Software for Designing and Operating Quantum Networks | Robert J. Hayek, Joaquin Chung, Rajkumar Kettimuthu | 2025-09-30 | 下载 | Quantum network protocol development is crucial to realizing a production-grade network that can support distributed sensing, secure communication, and utility-scale quantum computation. |
| Indoor/Outdoor Spectrum Sharing Enabled by GNSS-based Classifiers | Hossein Nasiri, Muhammad Iqbal Rochman, Monisha Ghosh | 2025-09-30 | 下载 | The desirability of the mid-band frequency range (1 - 10 GHz) for federal and commercial applications, combined with the growing applications for commercial indoor use-cases, such as factory automatio... |
| Introducing Large Language Models into the Design Flow of Time Sensitive Networking | Rubi Debnath, Luxi Zhao, Mohammadreza Barzegaran, Sebastian Steinhorst | 2025-09-30 | 下载 | The growing demand for real-time, safety-critical systems has significantly increased both the adoption and complexity of Time Sensitive Networking (TSN). |
| Target Wake Time Scheduling for Time-sensitive and Energy-efficient Wi-Fi Networks | Fabio Busacca, Corrado Puligheddu, Francesco Raviglione, Riccardo Rusca, Claudio Casetti, Carla Fabiana Chiasserini, Sergio Palazzo | 2025-09-30 | 下载 | Time Sensitive Networking (TSN) is fundamental for the reliable, low-latency networks that will enable the Industrial Internet of Things (IIoT). |
| Toward an Unbiased Collective Memory for Efficient LLM-Based Agentic 6G Cross-Domain Management | Hatim Chergui, Miguel Catalan Cid, Pouria Sayyad Khodashenas, Daniel Camps Mur, Christos Verikoukis | 2025-09-30 | 下载 | This paper introduces a novel framework for proactive cross-domain resource orchestration in 6G RAN-Edge networks, featuring large language model (LLM)-augmented agents. |
| On Computing Top- Simple Shortest Paths from a Single Source | Mattia D'Emidio, Gabriele Di Stefano | 2025-09-30 | 下载 | We investigate the problem of computing the top- simple shortest paths in weighted digraphs. While the single-pair variant -- finding the top- simple shortest paths between two specified vertice... |
| Flexible-Sector 6DMA Base Station: Modeling and Design | Yunli Li, Xiaoming Shi, Xiaodan Shao, Jie Xu, Rui Zhang | 2025-09-30 | 下载 | Six-dimensional movable antenna (6DMA) has emerged as a promising new technology for future wireless networks, which can adaptively adjust the three-dimensional (3D) positions and 3D rotations of ante... |
| Knowledge Defined Networking for 6G: A Reinforcement Learning Example for Resource Management | Erol Koçoğlu, Mehmet Ozdem, Tuğçe Bilen | 2025-09-30 | 下载 | 6G networks are expected to revolutionize connectivity, offering significant improvements in speed, capacity, and smart automation. However, existing network designs will struggle to handle the demand... |
| User-Centric Comparison of 5G NTN and DVB-S2/RCS2 Using OpenAirInterface and OpenSAND | Sumit Kumar, Juan Carlos Estrada-Jimenez, Ion Turcanu | 2025-09-30 | 下载 | The integration of satellite networks into next-generation mobile communication systems has gained considerable momentum with the advent of 5G Non-Terrestrial Networks (5G-NTN). |
| OpenID Connect for Agents (OIDC-A) 1.0: A Standard Extension for LLM-Based Agent Identity and Authorization | Subramanya Nagabhushanaradhya | 2025-09-30 | 下载 | OpenID Connect for Agents (OIDC-A) 1.0 is an extension to OpenID Connect Core 1.0 that provides a comprehensive framework for representing, authenticating, and authorizing LLM-based agents within the ... |
| User-Centric Communication Service Provision for Edge-Assisted Mobile Augmented Reality | Conghao Zhou, Jie Gao, Shisheng Hu, Nan Cheng, Weihua Zhuang, Xuemin Shen | 2025-09-30 | 下载 | Future 6G networks are envisioned to facilitate edge-assisted mobile augmented reality (MAR) via strengthening the collaboration between MAR devices and edge servers. |
| Intelligent Multi-link EDCA Optimization for Delay-Bounded QoS in Wi-Fi 7 | Peini Yi, Wenchi Cheng, Jingqing Wang, Jinzhe Pan, Yuehui Ouyang, Wei Zhang | 2025-09-30 | 下载 | IEEE 802.11be (Wi-Fi 7) introduces Multi-Link Operation (MLO) as a While MLO offers significant parallelism and capacity, realizing its full potential in guaranteeing strict delay bounds and optimizin... |
| From Literature to Insights: Methodological Guidelines for Survey Writing in Communications Research | Dusit Niyato, Octavia A. Dobre, Trung Q. Duong, George K. Karagiannidis, Robert Schober | 2025-09-30 | 下载 | The rapid growth of communications and networking research has created an unprecedented demand for high-quality survey and tutorial papers that can synthesize vast bodies of literature into coherent u... |
| CardioForest: An Explainable Ensemble Learning Model for Automatic Wide QRS Complex Tachycardia Diagnosis from ECG | Vaskar Chakma, Ju Xiaolin, Heling Cao, Xue Feng, Ji Xiaodong, Pan Haiyan, Gao Zhan | 2025-09-30 | 下载 | This study aims to develop and evaluate an ensemble machine learning-based framework for the automatic detection of Wide QRS Complex Tachycardia (WCT) from ECG signals, emphasizing diagnostic accuracy... |
| Think Less, Label Better: Multi-Stage Domain-Grounded Synthetic Data Generation for Fine-Tuning Large Language Models in Telecommunications | Chenhua Shi, Gregor Macdonald, Bhavika Jalli, Wanlu Lei, John Zou, Mridul Jain, Joji Philip | 2025-09-30 | 下载 | The success of large language models (LLMs) depends heavily on large-scale, high-quality instruction-following and reinforcement datasets. However, generating such data through human annotation is pro... |
| PAST: Pilot and Adaptive Orchestration for Timely and Resilient Service Delivery in Edge-Assisted UAV Networks under Spatio-Temporal Dynamics | Houyi Qi, Minghui Liwang, Liqun Fu, Sai Zou, Xinlei Yi, Wei Ni, Huaiyu Dai | 2025-09-30 | 下载 | Incentive-driven resource trading is essential for UAV applications with intensive, time-sensitive computing demands. Traditional spot trading suffers from negotiation delays and high energy costs, wh... |
| Oh-Trust: Overbooking and Hybrid Trading-empowered Resource Scheduling with Smart Reputation Update over Dynamic Edge Networks | Houyi Qi, Minghui Liwang, Liqun Fu, Xianbin Wang, Huaiyu Dai, Xiaoyu Xia | 2025-09-30 | 下载 | Incentive-driven computing resource sharing is crucial for meeting the ever-growing demands of emerging mobile applications. Although conventional spot trading offers a solution, it frequently leads t... |
| Deep Reinforcement Learning-Based Precoding for Multi-RIS-Aided Multiuser Downlink Systems with Practical Phase Shift | Po-Heng Chou, Bo-Ren Zheng, Wan-Jen Huang, Walid Saad, Yu Tsao, Ronald Y. Chang | 2025-09-30 | 下载 | This study considers multiple reconfigurable intelligent surfaces (RISs)-aided multiuser downlink systems with the goal of jointly optimizing the transmitter precoding and RIS phase shift matrix to ma... |
| Capacity-Net-Based RIS Precoding Design without Channel Estimation for mmWave MIMO System | Chun-Yuan Huang, Po-Heng Chou, Wan-Jen Huang, Ying-Ren Chien, Yu Tsao | 2025-09-30 | 下载 | In this paper, we propose Capacity-Net, a novel unsupervised learning approach aimed at maximizing the achievable rate in reflecting intelligent surface (RIS)-aided millimeter-wave (mmWave) multiple i... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Regression Language Models for Code | Yash Akhauri, Xingyou Song, Arissa Wongpanich, Bryan Lewandowski, Mohamed S. Abdelfattah | 2025-09-30 | 下载 | We study code-to-metric regression: predicting numeric outcomes of code executions, a challenging task due to the open-ended nature of programming languages. |
| Tuning the Tuner: Introducing Hyperparameter Optimization for Auto-Tuning | Floris-Jan Willemsen, Rob V. van Nieuwpoort, Ben van Werkhoven | 2025-09-30 | 下载 | Automatic performance tuning (auto-tuning) is widely used to optimize performance-critical applications across many scientific domains by finding the best program variant among many choices. |
| Efficient Construction of Large Search Spaces for Auto-Tuning | Floris-Jan Willemsen, Rob V. van Nieuwpoort, Ben van Werkhoven | 2025-09-30 | 下载 | Automatic performance tuning, or auto-tuning, accelerates high-performance codes by exploring vast spaces of code variants. However, due to the large number of possible combinations and complex constr... |
| AGOCS -- Accurate Google Cloud Simulator Framework | Leszek Sliwko, Vladimir Getov | 2025-09-30 | 下载 | This paper presents the Accurate Google Cloud Simulator (AGOCS) - a novel high-fidelity Cloud workload simulator based on parsing real workload traces, which can be conveniently used on a desktop mach... |
| Hybrid Schemes of NIST Post-Quantum Cryptography Standard Algorithms and Quantum Key Distribution for Key Exchange and Digital Signature | Abel C. H. Chen | 2025-09-30 | 下载 | Since the security of post-quantum cryptography (PQC) algorithms is based on the hardness of mathematical problems, while the security of quantum key distribution (QKD) relies on the fundamental princ... |