Skip to content

2025-03-15

cs.AR - Architecture

标题作者发布日期PDF摘要
VeriMind: Agentic LLM for Automated Verilog Generation with a Novel Evaluation MetricBardia Nadimi, Ghali Omar Boutaib, Hao Zheng2025-03-15下载Designing Verilog modules requires meticulous attention to correctness, efficiency, and adherence to design specifications. However, manually writing Verilog code remains a complex and time-consuming ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
LLM & HPC:Benchmarking DeepSeek's Performance in High-Performance Computing TasksNoujoud Nader, Patrick Diehl, Steve Brandt, Hartmut Kaiser2025-03-15下载Large Language Models (LLMs), such as GPT-4 and DeepSeek, have been applied to a wide range of domains in software engineering. However, their potential in the context of High-Performance Computing (H...
FedTilt: Towards Multi-Level Fairness-Preserving and Robust Federated LearningBinghui Zhang, Luis Mares De La Cruz, Binghui Wang2025-03-15下载Federated Learning (FL) is an emerging decentralized learning paradigm that can partly address the privacy concern that cannot be handled by traditional centralized and distributed learning.
Adaptive Fault Tolerance Mechanisms of Large Language Models in Cloud Computing EnvironmentsYihong Jin, Ze Yang, Xinhe Xu, Yihan Zhang, Shuyang Ji2025-03-15下载With the rapid evolution of Large Language Models (LLMs) and their large-scale experimentation in cloud-computing spaces, the challenge of guaranteeing their security and efficiency in a failure scena...
FAILS: A Framework for Automated Collection and Analysis of LLM Service IncidentsSándor Battaglini-Fischer, Nishanthi Srinivasan, Bálint László Szarvas, Xiaoyu Chu, Alexandru Iosup2025-03-15下载Large Language Model (LLM) services such as ChatGPT, DALLE, and Cursor have quickly become essential for society, businesses, and individuals, empowering applications such as chatbots, image generatio...
PIPO: Pipelined Offloading for Efficient Inference on Consumer DevicesYangyijian Liu, Jun Li, Wu-Jun Li2025-03-15下载The high memory and computation demand of large language models (LLMs) makes them challenging to be deployed on consumer devices due to limited GPU memory.
A Survey on Federated Fine-tuning of Large Language ModelsYebo Wu, Chunlin Tian, Jingguang Li, He Sun, Kahou Tam, Zhanting Zhou, Haicheng Liao, Jing Xiong, Zhijiang Guo, Li Li, Chengzhong Xu2025-03-15下载Large Language Models (LLMs) have demonstrated impressive success across various tasks. Integrating LLMs with Federated Learning (FL), a paradigm known as FedLLM, offers a promising avenue for collabo...
MoDM: Efficient Serving for Image Generation via Mixture-of-Diffusion ModelsYuchen Xia, Divyam Sharma, Yichao Yuan, Souvik Kundu, Nishil Talati2025-03-15下载Diffusion-based text-to-image generation models trade latency for quality: small models are fast but generate lower-quality images, while large models produce better images but are slow.
CCRSat: A Collaborative Computation Reuse Framework for Satellite Edge Computing NetworksYe Zhang, Zhishu Shen, Dawen Jiang, Xiangrui Liu, Qiushi Zheng, Jiong Jin2025-03-15下载In satellite computing applications, such as remote sensing, tasks often involve similar or identical input data, leading to the same processing results.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Agentic Search Engine for Real-Time IoT DataAbdelrahman Elewah, Khalid Elgazzar2025-03-15下载The Internet of Things (IoT) has enabled diverse devices to communicate over the Internet, yet the fragmentation of IoT systems limits seamless data sharing and coordinated management.
MODRIC: A Cost Effective MODular Data Center Network Architecture with Rich InterConnectionsNabajyoti Medhi, Kumarjit Ray, Rajdeep Ghosh, Dilip Kumar Saikia2025-03-15下载Shipping container based modular architectures provide design flexibility in data centers with building blocks to expand the network as and when needed.
Open Wireless Digital Twin: End-to-End 5G Mobility Emulation with OpenAirInterface and Ray TracingTetsuya Iye, Masaya Sakamoto, Shohei Takaya, Eisaku Sato, Yuki Susukida, Yu Nagaoka, Kazuki Maruta, Jin Nakazato2025-03-15下载This study presents an end-to-end wireless digital twin platform constructed using open-source software and open data to enhance the evaluation of mobile communication systems.
Hierarchical Evolutionary Optimization with Predictive Modeling for Stable Delay-Constrained Routing in Vehicular NetworksZhang Zhiou, Guo Weian, Zhang Qin, Lin Haibin, Li Dongyang2025-03-15下载Vehicular Ad Hoc Networks (VANETs) are a cornerstone of intelligent transportation systems, facilitating real-time communication between vehicles and infrastructure.
End-to-End Edge AI Service Provisioning Framework in 6G ORANYun Tang, Udhaya Chandhar Srinivasan, Benjamin James Scott, Obumneme Umealor, Dennis Kevogo, Weisi Guo2025-03-15下载With the advent of 6G, Open Radio Access Network (O-RAN) architectures are evolving to support intelligent, adaptive, and automated network orchestration.

cs.PF - Performance

标题作者发布日期PDF摘要
FAILS: A Framework for Automated Collection and Analysis of LLM Service IncidentsSándor Battaglini-Fischer, Nishanthi Srinivasan, Bálint László Szarvas, Xiaoyu Chu, Alexandru Iosup2025-03-15下载Large Language Model (LLM) services such as ChatGPT, DALLE, and Cursor have quickly become essential for society, businesses, and individuals, empowering applications such as chatbots, image generatio...

基于 VitePress 构建