Skip to content

2025-02-12

cs.AR - Architecture

标题作者发布日期PDF摘要
InTAR: Inter-Task Auto-Reconfigurable Accelerator Design for High Data Volume Variation in DNNsZifan He, Anderson Truong, Yingqi Cao, Jason Cong2025-02-12下载The rise of deep neural networks (DNNs) has driven an increased demand for computing power and memory. Modern DNNs exhibit high data volume variation (HDV) across tasks, which poses challenges for FPG...
LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 BitsZikai Zhou, Qizheng Zhang, Hermann Kumbong, Kunle Olukotun2025-02-12下载Fine-tuning large language models (LLMs) is increasingly costly as models scale to hundreds of billions of parameters, and even parameter-efficient fine-tuning (PEFT) methods like LoRA remain resource...
DEMOTIC: A Differentiable Sampler for Multi-Level Digital CircuitsArash Ardakani, Minwoo Kang, Kevin He, Qijing Huang, Vighnesh Iyer, Suhong Moon, John Wawrzynek2025-02-12下载Efficient sampling of satisfying formulas for circuit satisfiability (CircuitSAT), a well-known NP-complete problem, is essential in modern front-end applications for thorough testing and verification...
Neuromorphic Principles for Efficient Large Language Models on Intel Loihi 2Steven Abreu, Sumit Bam Shrestha, Rui-Jie Zhu, Jason Eshraghian2025-02-12下载Large language models (LLMs) deliver impressive performance but require large amounts of energy. In this work, we present a MatMul-free LLM architecture adapted for Intel's neuromorphic processor, Loi...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Optimal Resource Utilization in Hyperledger Fabric: A Comprehensive SPN-Based Performance Evaluation ParadigmCarlos Melo, Glauber Gonçalves, Francisco A. Silva, Leonel Feitosa, Iure Fé, André Soares, Eunmi Choi, Tuan Anh Nguyen, Dugki Min2025-02-12下载Hyperledger Fabric stands as a leading framework for permissioned blockchain systems, ensuring data security and auditability for enterprise applications.
Performance Modeling and Evaluation of Hyperledger Fabric: An Analysis Based on Transaction Flow and Endorsement PoliciesCarlos Melo, Glauber Gonçalves, Francisco A. Silva, André Soares2025-02-12下载Blockchain is a paradigm derived from distributed systems, protocols, and security concepts. However, can blockchain applications provide services in industrial environments, especially concerning per...
FedMHO: Heterogeneous One-Shot Federated Learning Towards Resource-Constrained Edge DevicesDezhong Yao, Yuexin Shi, Tongtong Liu, Zhiqiang Xu2025-02-12下载Federated Learning (FL) is increasingly adopted in edge computing scenarios, where a large number of heterogeneous clients operate under constrained or sufficient resources.
Efficient Split Learning LSTM Models for FPGA-based Edge IoT DevicesRomina Soledad Molina, Vukan Ninkovic, Dejan Vukobratovic, Maria Liz Crespo, Marco Zennaro2025-02-12下载Split Learning (SL) recently emerged as an efficient paradigm for distributed Machine Learning (ML) suitable for the Internet Of Things (IoT)-Cloud systems.
Morpheus Consensus: Excelling on trails and autobahnsAndrew Lewis-Pye, Ehud Shapiro2025-02-12下载Recent research in consensus has often focussed on protocols for State-Machine-Replication (SMR) that can handle high throughputs. Such state-of-the-art protocols (generally DAG-based) induce undue ov...
Accelerating Stable Matching between Workers and Spatial-Temporal Tasks for Dynamic MCS: A Stagewise Service Trading ApproachHouyi Qi, Minghui Liwang, Xianbin Wang, Liqun Fu, Yiguang Hong, Li Li, Zhipeng Cheng2025-02-12下载Designing effective incentive mechanisms in mobile crowdsensing (MCS) networks is crucial for engaging distributed mobile users (workers) to contribute heterogeneous data for various applications (tas...
Memory Offloading for Large Language Model Inference with Latency SLO GuaranteesChenxiang Ma, Zhisheng Ye, Hanyu Zhao, Zehua Yang, Tianhao Fu, Jiaxun Han, Jie Zhang, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Yong Li, Diyu Zhou2025-02-12下载Offloading large language models (LLMs) state to host memory during inference promises to reduce operational costs by supporting larger models, longer inputs, and larger batch sizes.
Democratizing AI: Open-source Scalable LLM Training on GPU-based SupercomputersSiddharth Singh, Prajwal Singhania, Aditya Ranjan, John Kirchenbauer, Jonas Geiping, Yuxin Wen, Neel Jain, Abhimanyu Hans, Manli Shu, Aditya Tomar, Tom Goldstein, Abhinav Bhatele2025-02-12下载Training and fine-tuning large language models (LLMs) with hundreds of billions to trillions of parameters requires tens of thousands of GPUs, and a highly scalable software stack.
Provably Robust Federated Reinforcement LearningMinghong Fang, Xilong Wang, Neil Zhenqiang Gong2025-02-12下载Federated reinforcement learning (FRL) allows agents to jointly learn a global decision-making policy under the guidance of a central server. While FRL has advantages, its decentralized design makes i...
Future Resource Bank for ISAC: Achieving Fast and Stable Win-Win Matching for Both Individuals and CoalitionsHouyi Qi, Minghui Liwang, Seyyedali Hosseinalipour, Liqun Fu, Sai Zou, Wei Ni2025-02-12下载Future wireless networks must support emerging applications where environmental awareness is as critical as data transmission. Integrated Sensing and Communication (ISAC) enables this vision by allowi...
General Coded Computing: Adversarial SettingsParsa Moradi, Hanzaleh Akbarinodehi, Mohammad Ali Maddah-Ali2025-02-12下载Conventional coded computing frameworks are predominantly tailored for structured computations, such as matrix multiplication and polynomial evaluation.
Parallel kk-Core Decomposition: Theory and PracticeYouzhe Liu, Xiaojun Dong, Yan Gu, Yihan Sun2025-02-12下载This paper proposes efficient solutions for kk-core decomposition with high parallelism. The problem of kk-core decomposition is fundamental in graph analysis and has applications across various dom...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Geofeed Adoption and AuthenticationDipsy Desai, Kicho Yu, Sulyab Thottungal Valapu2025-02-12下载IP Geofeed is a recently proposed informational standard that allows network operators to publish the geographical location of deployed IPv4 and IPv6 prefixes.
Investigation of Advanced Persistent Threats Network-based Tactics, Techniques and ProceduresAlmuthanna Alageel, Sergio Maffeis, Imperial College London2025-02-12下载The scarcity of data and the high complexity of Advanced Persistent Threats (APTs) attacks have created challenges in comprehending their behavior and hindered the exploration of effective detection t...
A Framework to Develop and Validate RL-Based Obstacle-Aware UAV Positioning AlgorithmsKamran Shafafi, Manuel Ricardo, Rui Campos2025-02-12下载Unmanned Aerial Vehicles (UAVs) increasingly enhance the Quality of Service (QoS) in wireless networks due to their flexibility and cost-effectiveness.
Mapping the Landscape of Generative AI in Network Monitoring and ManagementGiampaolo Bovenzi, Francesco Cerasuolo, Domenico Ciuonzo, Davide Di Monda, Idio Guarino, Antonio Montieri, Valerio Persico, Antonio Pescapè2025-02-12下载Generative Artificial Intelligence (GenAI) models such as LLMs, GPTs, and Diffusion Models have recently gained widespread attention from both the research and the industrial communities.
The Forest Behind the Tree: Revealing Hidden Smart Home Communication PatternsFrançois De Keersmaeker, Rémi Van Boxem, Cristel Pelsser, Ramin Sadre2025-02-12下载The widespread use of Smart Home devices has attracted significant research interest in understanding their behavior within home networks. Unlike general-purpose computers, these devices exhibit relat...
Testbed Development: An Intelligent O-RAN based Cell-Free MIMO NetworkYi Chu, Mostafa Rahmani, Josh Shackleton, David Grace, Kanapathippillai Cumanan, Hamed Ahmadi, Alister Burr2025-02-12下载Cell-free multiple input multiple output (CF-MIMO) systems improve spectral and energy efficiencies using distributed access points (APs) to provide reliable service across an area equivalent to multi...
Accelerating Stable Matching between Workers and Spatial-Temporal Tasks for Dynamic MCS: A Stagewise Service Trading ApproachHouyi Qi, Minghui Liwang, Xianbin Wang, Liqun Fu, Yiguang Hong, Li Li, Zhipeng Cheng2025-02-12下载Designing effective incentive mechanisms in mobile crowdsensing (MCS) networks is crucial for engaging distributed mobile users (workers) to contribute heterogeneous data for various applications (tas...
The MoE-Empowered Edge LLMs Deployment: Architecture, Challenges, and OpportunitiesNing Li, Song Guo, Tuo Zhang, Muqing Li, Zicong Hong, Qihua Zhou, Xin Yuan, Haijun Zhang2025-02-12下载The powerfulness of LLMs indicates that deploying various LLMs with different scales and architectures on end, edge, and cloud to satisfy different requirements and adaptive heterogeneous hardware is ...
Take What You Need: Flexible Multi-Task Semantic Communications with Channel AdaptationXiang Chen, Shuying Gan, Chenyuan Feng, Xijun Wang, Tony Q. S. Quek2025-02-12下载The growing demand for efficient semantic communication systems capable of managing diverse tasks and adapting to fluctuating channel conditions has driven the development of robust, resource-efficien...
From Layers to States: A State Space Model Perspective to Deep Neural Network Layer DynamicsQinshuo Liu, Weiqin Zhao, Wei Huang, Yanwen Fang, Lequan Yu, Guodong Li2025-02-12下载The depth of neural networks is a critical factor for their capability, with deeper models often demonstrating superior performance. Motivated by this, significant efforts have been made to enhance la...
Future Resource Bank for ISAC: Achieving Fast and Stable Win-Win Matching for Both Individuals and CoalitionsHouyi Qi, Minghui Liwang, Seyyedali Hosseinalipour, Liqun Fu, Sai Zou, Wei Ni2025-02-12下载Future wireless networks must support emerging applications where environmental awareness is as critical as data transmission. Integrated Sensing and Communication (ISAC) enables this vision by allowi...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Investigation of Advanced Persistent Threats Network-based Tactics, Techniques and ProceduresAlmuthanna Alageel, Sergio Maffeis, Imperial College London2025-02-12下载The scarcity of data and the high complexity of Advanced Persistent Threats (APTs) attacks have created challenges in comprehending their behavior and hindered the exploration of effective detection t...

cs.PF - Performance

标题作者发布日期PDF摘要
Novel Lower Bounds on M/G/k SchedulingZiyuan Wang, Izzy Grosof2025-02-12下载In queueing systems, effective scheduling algorithms are essential for optimizing performance. Optimal scheduling for the M/G/k queue has been explored in the heavy traffic limit, but much remains unk...
Optimizing Asynchronous Federated Learning: A Delicate Trade-Off Between Model-Parameter Staleness and Update FrequencyAbdelkrim Alahyane, Céline Comte, Matthieu Jonckheere, Éric Moulines2025-02-12下载Synchronous federated learning (FL) scales poorly with the number of clients due to the straggler effect. Algorithms like FedAsync and GeneralizedFedAsync address this limitation by enabling asynchron...
LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 BitsZikai Zhou, Qizheng Zhang, Hermann Kumbong, Kunle Olukotun2025-02-12下载Fine-tuning large language models (LLMs) is increasingly costly as models scale to hundreds of billions of parameters, and even parameter-efficient fine-tuning (PEFT) methods like LoRA remain resource...

基于 VitePress 构建