Skip to content

2025-05-07

cs.AR - Architecture

标题作者发布日期PDF摘要
Veritas: Deterministic Verilog Code Synthesis from LLM-Generated Conjunctive Normal FormPrithwish Basu Roy, Akashdeep Saha, Manaar Alam, Johann Knechtel, Michail Maniatakos, Ozgur Sinanoglu, Ramesh Karri2025-05-07下载Automated Verilog code synthesis poses significant challenges and typically demands expert oversight. Traditional high-level synthesis (HLS) methods often fail to scale for real-world designs.
Flexing RISC-V Instruction Subset Processors to Extreme EdgeAlireza Raisiardali, Konstantinos Iordanou, Jedrzej Kufel, Kowshik Gudimetla, Kris Myny, Emre Ozer2025-05-07下载This paper presents an automated approach for designing processors that support a subset of the RISC-V instruction set architecture (ISA) for a new class of applications at Extreme Edge.
Edge-GPU Based Face Tracking for Face Detection and Recognition AccelerationAsma Baobaid, Mahmoud Meribout2025-05-07下载Cost-effective machine vision systems dedicated to real-time and accurate face detection and recognition in public places are crucial for many modern applications.
Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and RecognitionAsma Baobaid, Mahmoud Meribout2025-05-07下载Video face detection and recognition in public places at the edge is required in several applications, such as security reinforcement and contactless access to authorized venues.
Accelerating Triangle Counting with Real Processing-in-Memory SystemsLorenzo Asquini, Manos Frouzakis, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu, Francesco Silvestri2025-05-07下载Triangle Counting (TC) is a procedure that involves enumerating the number of triangles within a graph. It has important applications in numerous fields, such as social or biological network analysis ...
Onboard Optimization and Learning: A SurveyMonirul Islam Pavel, Siyi Hu, Mahardhika Pratama, Ryszard Kowalczyk2025-05-07下载Onboard learning is a transformative approach in edge AI, enabling real-time data processing, decision-making, and adaptive model training directly on resource-constrained devices without relying on c...
SpeedLLM: An FPGA Co-design of Large Language Model Inference AcceleratorPeipei Wang, Wu Guan, Liping Liang, Zhijun Wang, Hanqing Luo, Zhibin Zhang2025-05-07下载This paper introduces SpeedLLM, a neural network accelerator designed on the Xilinx Alevo U280 platform and optimized for the Tinyllama framework to enhance edge computing performance.
In-Situ Hardware Error Detection Using Specification-Derived Petri Net Models and Behavior-Derived State SequencesTomonari Tanaka, Takumi Uezono, Kohei Suenaga, Masanori Hashimoto2025-05-07下载In hardware accelerators used in data centers and safety-critical applications, soft errors and resultant silent data corruption significantly compromise reliability, particularly when upsets occur in...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific InsightsOzan Gokdemir, Carlo Siebenschuh, Alexander Brace, Azton Wells, Brian Hsu, Kyle Hippe, Priyanka V. Setty, Aswathy Ajith, J. Gregory Pauloski, Varuni Sastry, Sam Foreman, Huihuo Zheng, Heng Ma, Bharat Kale, Nicholas Chia, Thomas Gibbs, Michael E. Papka, Thomas Brettin, Francis J. Alexander, Anima Anandkumar, Ian Foster, Rick Stevens, Venkatram Vishwanath, Arvind Ramanathan2025-05-07下载The volume of scientific literature is growing exponentially, leading to underutilized discoveries, duplicated efforts, and limited cross-disciplinary collaboration.
ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate DownscalingXiao Wang, Jong-Youl Choi, Takuya Kurihaya, Isaac Lyngaas, Hong-Jun Yoon, Xi Xiao, David Pugmire, Ming Fan, Nasik M. Nafi, Aristeidis Tsaris, Ashwin M. Aji, Maliha Hossain, Mohamed Wahib, Dali Wang, Peter Thornton, Prasanna Balaprakash, Moetasim Ashfaq, Dan Lu2025-05-07下载Sparse observations and coarse-resolution climate models limit effective regional decision-making, underscoring the need for robust downscaling.
Big Data Architecture for Large OrganizationsFathima Nuzla Ismail, Abira Sengupta, Shanika Amarasoma2025-05-07下载The exponential growth of big data has transformed how large organisations leverage information to drive innovation, optimise processes, and maintain competitive advantages.
Exploring Influence Factors on LLM Suitability for No-Code Development of End User IoT ApplicationsMinghe Wang, Alexandra Kapp, Trever Schirmer, Tobias Pfandzelter, David Bermbach2025-05-07下载No-Code Development Platforms (NCDPs) empower non-technical end users to build applications tailored to their specific demands without writing code.
Optimal Deterministic Rendezvous in Labeled LinesYann Bourreau, Ananth Narayanan, Alexandre Nolin2025-05-07下载In a rendezvous task, some mobile agents dispersed in a network have to gather at an arbitrary common site. We consider the rendezvous problem on the infinite labeled line, with 22 agents, without co...
Comparing CPU and GPU compute of PERMANOVA on MI300AIgor Sfiligoi2025-05-07下载Comparing the tradeoffs of CPU and GPU compute for memory-heavy algorithms is often challenging, due to the drastically different memory subsystems on host CPUs and discrete GPUs.
FDA-Opt: Communication-Efficient Federated Fine-Tuning of Language ModelsMichail Theologitis, Vasilis Samoladas, Antonios Deligiannakis2025-05-07下载Federated Learning (FL) enables the utilization of vast, previously inaccessible data sources. At the same time, pre-trained Language Models (LMs) have taken the world by storm and for good reason.
An Asynchronous Distributed-Memory Parallel Algorithm for k-mer CountingSouvadra Hati, Akihiro Hayashi, Richard Vuduc2025-05-07下载This paper describes a new asynchronous algorithm and implementation for the problem of k-mer counting (KC), which concerns quantifying the frequency of length k substrings in a DNA sequence.
Accelerating Triangle Counting with Real Processing-in-Memory SystemsLorenzo Asquini, Manos Frouzakis, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu, Francesco Silvestri2025-05-07下载Triangle Counting (TC) is a procedure that involves enumerating the number of triangles within a graph. It has important applications in numerous fields, such as social or biological network analysis ...
Learning-Based Approaches for Job Shop Scheduling Problems: A ReviewKarima Rihane, Adel Dabah, Abdelhakim AitZai2025-05-07下载Job Shop Scheduling (JSS) is one of the most studied combinatorial optimization problems. It involves scheduling a set of jobs with predefined processing constraints on a set of machines to achieve a ...
FRAIN to Train: A Fast-and-Reliable Solution for Decentralized Federated LearningSanghyeon Park, Soo-Mook Moon2025-05-07下载Federated learning (FL) enables collaborative model training across distributed clients while preserving data locality. Although FedAvg pioneered synchronous rounds for global model averaging, slower ...
Mazzaroth: A High-Throughput DAG Consensus with State RootHaohan Li2025-05-07下载Nakamoto Consensus achieves a decentralized ledger through a single-chain blockchain, assuming a maximum network delay, which limits block generation speed, resulting in low throughput.
Maxing Out the SVM: Performance Impact of Memory and Program Cache Sizes in the Agave ValidatorTuran Vural, Yuki Yuminaga, Alex Petrosyan, Ben Livshits2025-05-07下载In this paper we analyze some of the bottlenecks in the execution pipeline of Solana's Agave validator client, focusing on RAM and program cache usage under mainnet conditions.
Plexus: Taming Billion-edge Graphs with 3D Parallel Full-graph GNN TrainingAditya K. Ranjan, Siddharth Singh, Cunyang Wei, Abhinav Bhatele2025-05-07下载Graph neural networks (GNNs) leverage the connectivity and structure of real-world graphs to learn intricate properties and relationships between nodes.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Impact of Weather on Satellite Communication: Evaluating Starlink ResilienceMuhammad Asad Ullah, Antti Heikkinen, Mikko Uitto, Antti Anttonen, Konstantin Mikhaylov2025-05-07下载Satellite communications have emerged as one of the most feasible solutions to provide global wireless coverage and connect the unconnected. Starlink dominates the market with over 7,000 operational s...
Pipelining Split Learning in Multi-hop Edge NetworksWei Wei, Zheng Lin, Tao Li, Xuanheng Li, Xianhao Chen2025-05-07下载To support large-scale model training, split learning (SL) enables multiple edge devices/servers to share the intensive training workload. However, most existing works on SL focus solely on two-tier m...
Design and Evaluation of an NDN-Based Network for Distributed Digital TwinsChen Chen, Zihan Jia, Ze Wang, Lin Cui, Fung Po Tso2025-05-07下载Digital twins (DT) have received significant attention due to their numerous benefits, such as real-time data analytics and cost reduction in production.
Joint Task Offloading and Channel Allocation in Spatial-Temporal Dynamic for MEC NetworksTianyi Shi, Tiankui Zhang, Jonathan Loo, Rong Huang, Yapeng Wang2025-05-07下载Computation offloading and resource allocation are critical in mobile edge computing (MEC) systems to handle the massive and complex requirements of applications restricted by limited resources.
On-Device LLM for Context-Aware Wi-Fi RoamingJu-Hyung Lee, Yanqing Lu, Klaus Doppler2025-05-07下载Roaming in Wireless LAN (Wi-Fi) is a critical yet challenging task for maintaining seamless connectivity in dynamic mobile environments. Conventional threshold-based or heuristic schemes often fail, l...
LLMs' Suitability for Network Security: A Case Study of STRIDE Threat ModelingAbdulAziz AbdulGhaffar, Ashraf Matrawy2025-05-07下载Artificial Intelligence (AI) is expected to be an integral part of next-generation AI-native 6G networks. With the prevalence of AI, researchers have identified numerous use cases of AI in network sec...
Satellite-Assisted Low-Altitude Economy Networking: Concepts, Applications, and OpportunitiesShizhao He, Jiacheng Wang, Ying-Chang Liang, Geng Sun, Dusit Niyato2025-05-07下载The low-altitude economy (LAE) is a new economic paradigm that leverages low-altitude vehicles (LAVs) to perform diverse missions across diverse areas.
Shadow Wireless Intelligence: Large Language Model-Driven Reasoning in Covert CommunicationsYuanai Xie, Zhaozhi Liu, Xiao Zhang, Shihua Zhang, Rui Hou, Minrui Xu, Ruichen Zhang, Dusit Niyato2025-05-07下载Covert Communications (CC) can secure sensitive transmissions in industrial, military, and mission-critical applications within 6G wireless networks.

cs.PF - Performance

标题作者发布日期PDF摘要
Multiserver-job Response Time under Multilevel ScalingIsaac Grosof, Hayriye Ayhan2025-05-07下载We study the multiserver-job setting in the load-focused multilevel scaling limit, where system load approaches capacity much faster than the growth of the number of servers nn.
Comparing CPU and GPU compute of PERMANOVA on MI300AIgor Sfiligoi2025-05-07下载Comparing the tradeoffs of CPU and GPU compute for memory-heavy algorithms is often challenging, due to the drastically different memory subsystems on host CPUs and discrete GPUs.

基于 VitePress 构建