2025-07-01

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
CarbonClarity: Understanding and Addressing Uncertainty in Embodied Carbon for Sustainable Computing	Xuesi Chen, Leo Han, Anvita Bhagavathula, Udit Gupta	2025-07-01	下载	Embodied carbon footprint modeling has become an area of growing interest due to its significant contribution to carbon emissions in computing.
How Fast Can Graph Computations Go on Fine-grained Parallel Architectures	Yuqing Wang, Charles Colley, Brian Wheatman, Jiya Su, David F. Gleich, Andrew A. Chien	2025-07-01	下载	Large-scale graph problems are of critical and growing importance and historically parallel architectures have provided little support. In the spirit of co-design, we explore the question, How fast ca...
RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles	David Hunt, Shaocheng Luo, Spencer Hallyburton, Shafii Nillongo, Yi Li, Tingjun Chen, Miroslav Pajic	2025-07-01	下载	Low-cost indoor mobile robots have gained popularity with the increasing adoption of automation in homes and commercial spaces. However, existing lidar and camera-based solutions have limitations such...
A New Family of Thread to Core Allocation Policies for an SMT ARM Processor	Marta Navarro, Josué Feliu, Salvador Petit, María E. Gómez, Julio Sahuquillo	2025-07-01	下载	Modern high-performance servers commonly integrate Simultaneous Multithreading (SMT) processors, which efficiently boosts throughput over single-threaded cores.
VEDA: Efficient LLM Generation Through Voting-based KV Cache Eviction and Dataflow-flexible Accelerator	Zhican Wang, Hongxiang Fan, Haroon Waris, Gang Wang, Zhenyu Li, Jianfei Jiang, Yanan Sun, Guanghui He	2025-07-01	下载	Large Language Models (LLMs) excel in natural language processing tasks but pose significant computational and memory challenges for edge deployment due to their intensive resource demands.
ChatHLS: Towards Systematic Design Automation and Optimization for High-Level Synthesis	Runkai Li, Jia Xiong, Xiuyuan He, Jiaqi Lv, Jieru Zhao, Xi Wang	2025-07-01	下载	The increasing complexity of computational demands has spurred the adoption of domain-specific accelerators, yet traditional hardware design methodologies remain constrained by prolonged development a...
Presto: Hardware Acceleration of Ciphers for Hybrid Homomorphic Encryption	Yeonsoo Jeon, Mattan Erez, Michael Orshansky	2025-07-01	下载	Hybrid Homomorphic Encryption (HHE) combines symmetric key and homomorphic encryption to reduce ciphertext expansion crucial in client-server deployments of HE.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Capacity Planning and Scheduling for Jobs with Uncertainty in Resource Usage and Duration	Sunandita Patra, Mehtab Pathan, Mahmoud Mahfouz, Parisa Zehtabi, Wided Ouaja, Daniele Magazzeni, Manuela Veloso	2025-07-01	下载	Organizations around the world schedule jobs (programs) regularly to perform various tasks dictated by their end users. With the major movement towards using a cloud computing infrastructure, our orga...
FLARE: A Dataflow-Aware and Scalable Hardware Architecture for Neural-Hybrid Scientific Lossy Compression	Wenqi Jia, Ying Huang, Jian Xu, Zhewen Hu, Sian Jin, Jiannan Tian, Yuede Ji, Miao Yin	2025-07-01	下载	Scientific simulation leveraging high-performance computing (HPC) systems is crucial for modeling complex systems and phenomena in fields such as astrophysics, climate science, and fluid dynamics, gen...
Stannic: Systolic STochAstic ONliNe SchedulIng AcCelerator	Adam H. Ross, Vairavan Palaniappan, Debjit Pal	2025-07-01	下载	Efficient workload scheduling is a critical challenge in modern heterogeneous computing environments, particularly in high-performance computing (HPC) systems.
Efficient Gate Reordering for Distributed Quantum Compiling in Data Centers	Riccardo Mengoni, Walter Nadalin, Mathys Rennela, Jimmy Rotureau, Tom Darras, Julien Laurat, Eleni Diamanti, Ioannis Lavdas	2025-07-01	下载	Just as classical computing relies on distributed systems, the quantum computing era requires new kinds of infrastructure and software tools. Quantum networks will become the backbone of hybrid, quant...
How Fast Can Graph Computations Go on Fine-grained Parallel Architectures	Yuqing Wang, Charles Colley, Brian Wheatman, Jiya Su, David F. Gleich, Andrew A. Chien	2025-07-01	下载	Large-scale graph problems are of critical and growing importance and historically parallel architectures have provided little support. In the spirit of co-design, we explore the question, How fast ca...
Turning AI Data Centers into Grid-Interactive Assets: Results from a Field Demonstration in Phoenix, Arizona	Philip Colangelo, Ayse K. Coskun, Jack Megrue, Ciaran Roberts, Shayan Sengupta, Varun Sivaram, Ethan Tiao, Aroon Vijaykar, Chris Williams, Daniel C. Wilson, Zack MacFarland, Daniel Dreiling, Nathan Morey, Anuja Ratnayake, Baskar Vairamohan	2025-07-01	下载	Artificial intelligence (AI) is fueling exponential electricity demand growth, threatening grid reliability, raising prices for communities paying for new energy infrastructure, and stunting AI innova...
A New Family of Thread to Core Allocation Policies for an SMT ARM Processor	Marta Navarro, Josué Feliu, Salvador Petit, María E. Gómez, Julio Sahuquillo	2025-07-01	下载	Modern high-performance servers commonly integrate Simultaneous Multithreading (SMT) processors, which efficiently boosts throughput over single-threaded cores.
yProv4ML: Effortless Provenance Tracking for Machine Learning Systems	Gabriele Padovani, Valentine Anantharaj, Sandro Fiore	2025-07-01	下载	The rapid growth of interest in large language models (LLMs) reflects their potential for flexibility and generalization, and attracted the attention of a diverse range of researchers.
PANDAS: Peer-to-peer, Adaptive Networking for Data Availability Sampling within Ethereum Consensus Timebounds	Matthieu Pigaglio, Onur Ascigil, Michał Król, Sergi Rene, Felix Lange, Kaleem Peeroo, Ramin Sadre, Vladimir Stankovic, Etienne Rivière	2025-07-01	下载	Layer-2 protocols can assist Ethereum's limited throughput, but globally broadcasting layer-2 data limits their scalability. The Danksharding evolution of Ethereum aims to support the selective distri...
Provenance Tracking in Large-Scale Machine Learning Systems	Gabriele Padovani, Valentine Anantharaj, Sandro Fiore	2025-07-01	下载	As the demand for large scale AI models continues to grow, the optimization of their training to balance computational efficiency, execution time, accuracy and energy consumption represents a critical...
Safe Low Bandwidth SPV: A Formal Treatment of Simplified Payment Verification Protocols and Security Bounds	Craig S Wright	2025-07-01	下载	This paper presents a complete formal specification, protocol description, and mathematical proof structure for Simplified Payment Verification (SPV) as originally defined in the Bitcoin whitepaper \c...
Accelerating Loading WebGraphs in ParaGrapher	Mohsen Koohi Esfahani	2025-07-01	下载	ParaGrapher is a graph loading API and library that enables graph processing frameworks to load large-scale compressed graphs with minimal overhead.
Toward Edge General Intelligence with Multiple-Large Language Model (Multi-LLM): Architecture, Trust, and Orchestration	Haoxiang Luo, Yinqiu Liu, Ruichen Zhang, Jiacheng Wang, Gang Sun, Dusit Niyato, Hongfang Yu, Zehui Xiong, Xianbin Wang, Xuemin Shen	2025-07-01	下载	Edge computing enables real-time data processing closer to its source, thus improving the latency and performance of edge-enabled AI applications.
DynoStore: A wide-area distribution system for the management of data over heterogeneous storage	Dante D. Sanchez-Gallegos, J. L. Gonzalez-Compean, Maxime Gonthier, Valerie Hayot-Sasson, J. Gregory Pauloski, Haochen Pan, Kyle Chard, Jesus Carretero, Ian Foster	2025-07-01	下载	Data distribution across different facilities offers benefits such as enhanced resource utilization, increased resilience through replication, and improved performance by processing data near its sour...
Collaborative Multi-Agent Reinforcement Learning Approach for Elastic Cloud Resource Scaling	Bruce Fang, Danyi Gao	2025-07-01	下载	This paper addresses the challenges of rapid resource variation and highly uncertain task loads in cloud computing environments. It proposes an optimization method for elastic cloud resource scaling b...
Edge Computing and its Application in Robotics: A Survey	Nazish Tahir, Ramviyas Parasuraman	2025-07-01	下载	The Edge computing paradigm has gained prominence in both academic and industry circles in recent years. By implementing edge computing facilities and services in robotics, it becomes a key enabler in...
Towards Resource-Efficient Serverless LLM Inference with SLINFER	Chuhao Xu, Zijun Li, Quan Chen, Han Zhao, Xueyan Tang, Minyi Guo	2025-07-01	下载	The rise of LLMs has driven demand for private serverless deployments, characterized by moderate-sized models and infrequent requests. While existing serverless solutions follow exclusive GPU allocati...
Real-Time In-Network Machine Learning on P4-Programmable FPGA SmartNICs with Fixed-Point Arithmetic and Taylor	Mohammad Firas Sada, John J. Graham, Mahidhar Tatineni, Dmitry Mishin, Thomas A. DeFanti, Frank Würthwein	2025-07-01	下载	As machine learning (ML) applications become integral to modern network operations, there is an increasing demand for network programmability that enables low-latency ML inference for tasks such as Qu...
Find a Scapegoat: Poisoning Membership Inference Attack and Defense to Federated Learning	Wenjin Mo, Zhiyuan Li, Minghong Fang, Mingwei Fang	2025-07-01	下载	Federated learning (FL) allows multiple clients to collaboratively train a global machine learning model with coordination from a central server, without needing to share their raw data.
Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and NVIDIA Data Center GPUs	Mohammad Firas Sada, John J. Graham, Elham E Khoda, Mahidhar Tatineni, Dmitry Mishin, Rajesh K. Gupta, Rick Wagner, Larry Smarr, Thomas A. DeFanti, Frank Würthwein	2025-07-01	下载	This study presents a benchmarking analysis of the Qualcomm Cloud AI 100 Ultra (QAic) accelerator for large language model (LLM) inference, evaluating its energy efficiency (throughput per watt), perf...
HelixPipe: Efficient Distributed Training of Long Sequence Transformers with Attention Parallel Pipeline Parallelism	Geng Zhang, Shenggan Cheng, Xuanlei Zhao, Ziming Liu, Yang You	2025-07-01	下载	As transformer sequence lengths grow, existing pipeline parallelisms incur suboptimal performance due to the quadratic attention computation and the substantial memory overhead.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
A Full-Stack Platform Architecture for Self-Organised Social Coordination	Matthew Scott, Jeremy Pitt	2025-07-01	下载	To mitigate the restrictive centralising and monopolistic tendencies of platformisation, we aim to empower local communities by democratising platforms for self-organised social coordination.
QUIC Delay Control: an implementation of congestion and delay control	Saverio Mascolo, Andrea Vittorio Balillo, Gioacchino Manfredi, Davide D'Agostino, Luca De Cicco	2025-07-01	下载	A new congestion and delay control algorithm named QUIC Delay Control (QUIC-DC) is proposed for controlling not only congestion but also the queueing delay encountered along the forward communication ...
Enhancing Vehicular Platooning with Wireless Federated Learning: A Resource-Aware Control Framework	Beining Wu, Jun Huang, Qiang Duan, Liang Dong, Zhipeng Cai	2025-07-01	下载	This paper aims to enhance the performance of Vehicular Platooning (VP) systems integrated with Wireless Federated Learning (WFL). In highly dynamic environments, vehicular platoons experience frequen...
Stealtooth: Breaking Bluetooth Security Abusing Silent Automatic Pairing	Keiichiro Kimura, Hiroki Kuzuno, Yoshiaki Shiraishi, Masakatu Morii	2025-07-01	下载	Bluetooth is a pervasive wireless communication technology used by billions of devices for short-range connectivity. The security of Bluetooth relies on the pairing process, where devices establish sh...
PANDAS: Peer-to-peer, Adaptive Networking for Data Availability Sampling within Ethereum Consensus Timebounds	Matthieu Pigaglio, Onur Ascigil, Michał Król, Sergi Rene, Felix Lange, Kaleem Peeroo, Ramin Sadre, Vladimir Stankovic, Etienne Rivière	2025-07-01	下载	Layer-2 protocols can assist Ethereum's limited throughput, but globally broadcasting layer-2 data limits their scalability. The Danksharding evolution of Ethereum aims to support the selective distri...
Toward Edge General Intelligence with Multiple-Large Language Model (Multi-LLM): Architecture, Trust, and Orchestration	Haoxiang Luo, Yinqiu Liu, Ruichen Zhang, Jiacheng Wang, Gang Sun, Dusit Niyato, Hongfang Yu, Zehui Xiong, Xianbin Wang, Xuemin Shen	2025-07-01	下载	Edge computing enables real-time data processing closer to its source, thus improving the latency and performance of edge-enabled AI applications.
Remote Rendering for Virtual Reality: performance comparison of multimedia frameworks and protocols	Daniel Mejías, Inhar Yeregui, Roberto Viola, Miguel Fernández, Mario Montagud	2025-07-01	下载	The increasing complexity of Extended Reality (XR) applications demands substantial processing power and high bandwidth communications, often unavailable on lightweight devices.
Towards a Playground to Democratize Experimentation and Benchmarking of AI Agents for Network Troubleshooting	Zhihao Wang, Alessandro Cornacchia, Franco Galante, Carlo Centofanti, Alessio Sacco, Dingde Jiang	2025-07-01	下载	Recent research has demonstrated the effectiveness of Artificial Intelligence (AI), and more specifically, Large Language Models (LLMs), in supporting network configuration synthesis and automating ne...
Edge Computing and its Application in Robotics: A Survey	Nazish Tahir, Ramviyas Parasuraman	2025-07-01	下载	The Edge computing paradigm has gained prominence in both academic and industry circles in recent years. By implementing edge computing facilities and services in robotics, it becomes a key enabler in...
Real-Time In-Network Machine Learning on P4-Programmable FPGA SmartNICs with Fixed-Point Arithmetic and Taylor	Mohammad Firas Sada, John J. Graham, Mahidhar Tatineni, Dmitry Mishin, Thomas A. DeFanti, Frank Würthwein	2025-07-01	下载	As machine learning (ML) applications become integral to modern network operations, there is an increasing demand for network programmability that enables low-latency ML inference for tasks such as Qu...
Seeing Through the Fog: Empowering Mobile Devices to Expose and Mitigate RAN Buffer Effects on Delay-Sensitive Protocols	Yuxin Liu, Tianyang Zhang, Kyle Jamieson, Yaxiong Xie	2025-07-01	下载	Delay-based protocols rely on end-to-end delay measurements to detect network congestion. However, in cellular networks, Radio Access Network (RAN) buffers introduce significant delays unrelated to co...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Development and Comparative Evaluation of Three Artificial Intelligence Models (NLP, LLM, JEPA) for Predicting Triage in Emergency Departments: A 7-Month Retrospective Proof-of-Concept	Edouard Lansiaux, Ramy Azzouz, Emmanuel Chazard, Amélie Vromant, Eric Wiel	2025-07-01	下载	Emergency departments struggle with persistent triage errors, especially undertriage and overtriage, which are aggravated by growing patient volumes and staff shortages.
Turning AI Data Centers into Grid-Interactive Assets: Results from a Field Demonstration in Phoenix, Arizona	Philip Colangelo, Ayse K. Coskun, Jack Megrue, Ciaran Roberts, Shayan Sengupta, Varun Sivaram, Ethan Tiao, Aroon Vijaykar, Chris Williams, Daniel C. Wilson, Zack MacFarland, Daniel Dreiling, Nathan Morey, Anuja Ratnayake, Baskar Vairamohan	2025-07-01	下载	Artificial intelligence (AI) is fueling exponential electricity demand growth, threatening grid reliability, raising prices for communities paying for new energy infrastructure, and stunting AI innova...
PANDAS: Peer-to-peer, Adaptive Networking for Data Availability Sampling within Ethereum Consensus Timebounds	Matthieu Pigaglio, Onur Ascigil, Michał Król, Sergi Rene, Felix Lange, Kaleem Peeroo, Ramin Sadre, Vladimir Stankovic, Etienne Rivière	2025-07-01	下载	Layer-2 protocols can assist Ethereum's limited throughput, but globally broadcasting layer-2 data limits their scalability. The Danksharding evolution of Ethereum aims to support the selective distri...
Empirical Analysis Of Heuristic and Approximation Algorithms for the The Mutual-Visibility Problem	Vanja Stojanović, Bor Pangeršič	2025-07-01	下载	The NP-complete mutual-visibility (MV) problem currently lacks empirical analysis on its practical behaviour despite theoretical studies. This paper addresses this gap by implementing and evaluating t...
Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms	Zain Taufique, Aman Vyas, Antonio Miele, Pasi Liljeberg, Anil Kanduri	2025-07-01	下载	Compound AI (cAI) systems chain multiple AI models to solve complex problems. cAI systems are typically composed of deep neural networks (DNNs), transformers, and large language models (LLMs), exhibit...

2025-07-01 ​

cs.AR - Architecture ​

cs.DC - Distributed, Parallel, and Cluster Computing ​

cs.NI - Networking and Internet Architecture ​

cs.PF - Performance ​

2025-07-01

cs.AR - Architecture

cs.DC - Distributed, Parallel, and Cluster Computing

cs.NI - Networking and Internet Architecture

cs.PF - Performance