Skip to content

2025-04-30

cs.AR - Architecture

标题作者发布日期PDF摘要
Exploration of Cryptocurrency Mining-Specific GPUs in AI Applications: A Case Study of CMP 170HXXing Kangwei2025-04-30下载This study systematically tests a computational power reuse scheme proposed by the open source community disabling specific instruction sets (Fused Multiply Add instructions) through CUDA source code ...
GPU Performance Portability needs AutotuningBurkhard Ringlein, Thomas Parnell, Radu Stoica2025-04-30下载As LLMs grow in complexity, achieving state-of-the-art performance requires tight co-design across algorithms, software, and hardware. Today's reliance on a single dominant platform limits portability...
Coyote v2: Raising the Level of Abstraction for Data Center FPGAsBenjamin Ramhorst, Dario Korolija, Maximilian Jakob Heer, Jonas Dann, Luhao Liu, Gustavo Alonso2025-04-30下载In the trend towards hardware specialization, FPGAs play a dual role as accelerators for offloading, e.g., network virtualization, and as a vehicle for prototyping and exploring hardware designs.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Audo-Sight: Enabling Ambient Interaction For Blind And Visually Impaired IndividualsBhanuja Ainary2025-04-30下载Visually impaired people face significant challenges when attempting to interact with and understand complex environments, and traditional assistive technologies often struggle to quickly provide nece...
GPRat: Gaussian Process Regression with Asynchronous TasksMaksim Helmann, Alexander Strack, Dirk Pflüger2025-04-30下载Python is the de-facto language for software development in artificial intelligence (AI). Commonly used libraries, such as PyTorch and TensorFlow, rely on parallelization built into their BLAS backend...
Message Optimality and Message-Time Trade-offs for APSP and BeyondFabien Dufoulon, Shreyas Pai, Gopal Pandurangan, Sriram Pemmaraju, Peter Robinson2025-04-30下载Round complexity is an extensively studied metric of distributed algorithms. In contrast, our knowledge of the \emph{message complexity} of distributed computing problems and its relationship (if any)...
Near-Optimal Distributed Ruling Sets for Trees and High-Girth GraphsMalte Baumecker, Yannic Maus, Jara Uitto2025-04-30下载Given a graph G=(V,E)G=(V,E), a β-ruling set is a subset SVS\subseteq V that is i) independent, and ii) every node vVv\in V has a node of SS within distance β.
Exploration of Cryptocurrency Mining-Specific GPUs in AI Applications: A Case Study of CMP 170HXXing Kangwei2025-04-30下载This study systematically tests a computational power reuse scheme proposed by the open source community disabling specific instruction sets (Fused Multiply Add instructions) through CUDA source code ...
Deterministic Distributed DFS and Other Problems via Cycle Separators in Planar GraphsBenjamin Jauregui, Pedro Montealegre, Ivan Rapaport2025-04-30下载One of the most basic techniques in algorithm design consists of breaking a problem into subproblems and then proceeding recursively. In the case of graph algorithms, one way to implement this approac...
Scientific Workflow Scheduling in Cloud Considering Cold Start and Variable Pricing ModelSuvarthi Sarkar, Sparsh Mittal, Shivam Garg, Aryabartta Sahu2025-04-30下载Cloud computing has become a pivotal platform for executing scientific workflows due to its scalable and cost-effective infrastructure. Scientific Cloud Service Providers (SCSPs) act as intermediaries...
CWASI: A WebAssembly Runtime Shim for Inter-function Communication in the Serverless Edge-Cloud ContinuumCynthia Marcelino, Stefan Nastic2025-04-30下载Serverless Computing brings advantages to the Edge-Cloud continuum, like simplified programming and infrastructure management. In composed workflows, where serverless functions need to exchange data c...
UAV Marketplace Simulation Tool for BVLOS OperationsKıvanç Şerefoğlu, Önder Gürcan, Reyhan Aydoğan2025-04-30下载We present a simulation tool for evaluating team formation in autonomous multi-UAV (Unmanned Aerial Vehicle) missions that operate Beyond Visual Line of Sight (BVLOS).
Galvatron: An Automatic Distributed System for Efficient Foundation Model TrainingXinyi Liu, Yujie Wang, Shenhan Zhu, Fangcheng Fu, Qingshuo Liu, Guangming Lin, Bin Cui2025-04-30下载Galvatron is a distributed system for efficiently training large-scale Foundation Models. It overcomes the complexities of selecting optimal parallelism strategies by automatically identifying the mos...
Tolerating Disasters with Hierarchical ConsensusWassim Yahyaoui, Joachim Bruneau-Queyreix, Jérémie Decouchant, Marcus Völp2025-04-30下载Geo-replication provides disaster recovery after catastrophic accidental failures or attacks, such as fires, blackouts or denial-of-service attacks to a data center or region.
Robust and Scalable Renaming with Subquadratic BitsSirui Bai, Xinyu Fu, Yuheng Wang, Yuyi Wang, Chaodong Zheng2025-04-30下载In the renaming problem, a set of nn nodes, each with a unique identity from a large namespace [N][N], needs to obtain new unique identities in a smaller namespace [M][M].
ZipLLM: Efficient LLM Storage via Model-Aware Synergistic Data Deduplication and CompressionZirui Wang, Tingfeng Lan, Zhaoyuan Su, Juncheng Yang, Yue Cheng2025-04-30下载Modern model hubs, such as Hugging Face, store tens of petabytes of LLMs, with fine-tuned variants vastly outnumbering base models and dominating storage consumption.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Q Cells in Wireless NetworksMartin Haenggi2025-04-30下载For a given set of transmitters such as cellular base stations or WiFi access points, is it possible to analytically characterize the set of locations that are "covered" in the sense that users at the...
Generalizing Biased Backpressure Routing and Scheduling to Wireless Multi-hop Networks with Advanced Air-interfacesZhongyuan Zhao, Yujun Ming, Ananthram Swami, Kevin Chan, Fikadu Dagefu, Santiago Segarra2025-04-30下载Backpressure (BP) routing and scheduling is a well-established resource allocation method for wireless multi-hop networks, known for its fully distributed operations and proven maximum queue stability...
DBSCAN-based Vehicle Clustering and UAV Placement for NOMA-based Resource Management in Cellular V2X CommunicationsHossein Davoudi, Behrouz Shahgholi Ghahfarokhi, Neda Moghim, Sachin Shetty2025-04-30下载In the future wireless networks, terrestrial, aerial, space, and maritime wireless networks are integrated into a unified network to meet the needs of a fully connected global network.
Toward Realization of Low-Altitude Economy Networks: Core Architecture, Integrated Technologies, and Future DirectionsYixian Wang, Geng Sun, Zemin Sun, Jiacheng Wang, Jiahui Li, Changyuan Zhao, Jing Wu, Shuang Liang, Minghao Yin, Pengfei Wang, Dusit Niyato, Sumei Sun, Dong In Kim2025-04-30下载The rise of the low-altitude economy (LAE) is propelling urban development and emerging industries by integrating advanced technologies to enhance efficiency, safety, and sustainability in low-altitud...
FreeBeacon: Efficient Communication and Data Aggregation in Battery-Free IoTGaosheng Liu, Kasım Sinan Yıldırım, Lin Wang2025-04-30下载To improve sustainability, Internet-of-Things (IoT) is increasingly adopting battery-free devices powered by ambient energy scavenged from the environment.
Optimal Online Probe Allocation for Classical and Quantum Network TomographyXuchuang Wang, Yu-Zhen Janice Chen, Matheus Guedes de Andrade, Mohammad Hajiesmaili, John C. S. Lui, Ting He, Don Towsley2025-04-30下载How to efficiently perform network tomography is a fundamental problem in network management and monitoring. A network tomography task usually consists of applying multiple probing experiments, e.g.
A Novel Compound AI Model for 6G Networks in 3D ContinuumMilos Gravara, Andrija Stanisic, Stefan Nastic2025-04-30下载The 3D continuum presents a complex environment that spans the terrestrial, aerial and space domains, with 6Gnetworks serving as a key enabling technology.
A Unified QoS-Aware Multiplexing Framework for Next Generation Immersive Communication with Legacy Wireless ApplicationsJihong Li, Shunqing Zhang, Tao Yu, Guangjin Pan, Kaixuan Huang, Xiaojing Chen, Yanzan Sun, Junyu Liu, Jiandong Li, Derrick Wing Kwan Ng2025-04-30下载Immersive communication, including emerging augmented reality, virtual reality, and holographic telepresence, has been identified as a key service for enabling next-generation wireless applications.
Generative QoE Modeling: A Lightweight Approach for Telecom NetworksVinti Nayar, Kanica Sachdev, Brejesh Lall2025-04-30下载Quality of Experience (QoE) prediction plays a crucial role in optimizing resource management and enhancing user satisfaction across both telecommunication and OTT services.
Low latency FPGA implementation of twisted Edward curve cryptography hardware accelerator over prime fieldMd Rownak Hossain, Md Sazedur Rahman, Kh Shahriya Zaman, Walid El Fezzani, Mohammad Arif Sobhan Bhuiyan, Chia Chao Kang, Teh Jia Yew, Mahdi H. Miraz2025-04-30下载The performance of any elliptic curve cryptography hardware accelerator significantly relies on the efficiency of the underlying point multiplication (PM) architecture.
Covert Prompt Transmission for Secure Large Language Model ServicesRuichen Zhang, Yinqiu Liu, Shunpu Tang, Jiacheng Wang, Dusit Niyato, Geng Sun, Yonghui Li, Sumei Sun2025-04-30下载This paper investigates covert prompt transmission for secure and efficient large language model (LLM) services over wireless networks. We formulate a latency minimization problem under fidelity and d...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Confidential Serverless ComputingPatrick Sabanic, Masanori Misono, Teofil Bodea, Julian Pritzi, Michael Hackl, Dimitrios Stavrakakis, Pramod Bhatotia2025-04-30下载Although serverless computing offers compelling cost and deployment simplicity advantages, a significant challenge remains in securely managing sensitive data as it flows through the network of epheme...
Concurrency Testing in the Linux Kernel via eBPFJiacheng Xu, Dylan Wolff, Xing Yi Han, Jialin Li, Abhik Roychoudhury2025-04-30下载Concurrency is vital for our critical software to meet modern performance requirements, yet concurrency bugs are notoriously difficult to detect and reproduce.

基于 VitePress 构建