Skip to content

2025-07-11

cs.AR - Architecture

标题作者发布日期PDF摘要
Hybrid Systolic Array Accelerator with Optimized Dataflow for Edge Large Language Model InferenceChun-Ting Chen, HanGyeol Mun, Jian Meng, Mohamed S. Abdelfattah, Jae-sun Seo2025-07-11下载Edge inference for large language models (LLM) offers secure, low-latency, and cost-effective inference solutions. We emphasize that an edge accelerator should achieve high area efficiency and minimiz...
CEO-DC: Driving Decarbonization in HPC Data Centers with Actionable InsightsRubén Rodríguez Álvarez, Denisa-Andreea Constantinescu, Miguel Peón-Quirós, David Atienza2025-07-11下载The rapid growth of data centers is increasing energy demand and widening the carbon gap in the ICT sector, as fossil fuels still dominate global energy production.
Fast and Efficient Merge of Sorted Input Lists in Hardware Using List Offset Merge SortersRobert B. Kent, Marios S. Pattichis2025-07-11下载A new set of hardware merge sort devices are introduced here, which merge multiple sorted input lists into a single sorted output list in a fast and efficient manner.
CCSS: Hardware-Accelerated RTL Simulation with Fast Combinational Logic Computing and Sequential Logic SynchronizationWeigang Feng, Yijia Zhang, Zekun Wang, Zhengyang Wang, Yi Wang, Peijun Ma, Ningyi Xu2025-07-11下载As transistor counts in a single chip exceed tens of billions, the complexity of RTL-level simulation and verification has grown exponentially, often extending simulation campaigns to several months.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
On Evaluating Performance of LLM Inference Serving SystemsAmey Agrawal, Nitin Kedia, Anmol Agarwal, Jayashree Mohan, Nipun Kwatra, Souvik Kundu, Ramachandran Ramjee, Alexey Tumanov2025-07-11下载The rapid evolution of Large Language Model (LLM) inference systems has yielded significant efficiency improvements. However, our systematic analysis reveals that current evaluation methodologies freq...
A Sparsity Predicting Approach for Large Language Models via Activation Pattern ClusteringNobel Dhar, Bobin Deng, Md Romyull Islam, Xinyue Zhang, Kazi Fahim Ahmad Nasif, Kun Suo2025-07-11下载Large Language Models (LLMs) exhibit significant activation sparsity, where only a subset of neurons are active for a given input. Although this sparsity presents opportunities to reduce computational...
MQFQ-Sticky: Fair Queueing For Serverless GPU FunctionsAlexander Fuerst, Siddharth Anil, Vishakha Dixit, Purushottam, Kulkarni, Prateek Sharma2025-07-11下载Hardware accelerators like GPUs are now ubiquitous in data centers, but are not fully supported by common cloud abstractions such as Functions as a Service (FaaS).
Carbon-Aware Workflow Scheduling with Fixed Mapping and Deadline ConstraintDominik Schweisgut, Anne Benoit, Yves Robert, Henning Meyerhenke2025-07-11下载Large data and computing centers consume a significant share of the world's energy consumption. A prominent subset of the workloads in such centers are workflows with interdependent tasks, usually rep...
CCSS: Hardware-Accelerated RTL Simulation with Fast Combinational Logic Computing and Sequential Logic SynchronizationWeigang Feng, Yijia Zhang, Zekun Wang, Zhengyang Wang, Yi Wang, Peijun Ma, Ningyi Xu2025-07-11下载As transistor counts in a single chip exceed tens of billions, the complexity of RTL-level simulation and verification has grown exponentially, often extending simulation campaigns to several months.
Towards AI-Native RAN: An Operator's Perspective of 6G Day 1 StandardizationNan Li, Qi Sun, Lehan Wang, Xiaofei Xu, Jinri Huang, Chunhui Liu, Jing Gao, Yuhong Huang, Chih-Lin I2025-07-11下载Artificial Intelligence/Machine Learning (AI/ML) has become the most certain and prominent feature of 6G mobile networks. Unlike 5G, where AI/ML was not natively integrated but rather an add-on featur...
Content-Oblivious Leader Election in 2-Edge-Connected NetworksJérémie Chalopin, Yi-Jun Chang, Lyuting Chen, Giuseppe A. Di Luna, Haoran Zhou2025-07-11下载Censor-Hillel, Cohen, Gelles, and Sela (PODC 2022 & Distributed Computing 2023) studied fully-defective asynchronous networks, where communication channels may suffer an extreme form of alteration err...
Fast and Interactive Byzantine Fault-tolerant Web Services via Session-Based Consensus DecouplingAhmad Zaki Akmal, Azkario Rizky Pratama, Guntur Dharma Putra2025-07-11下载Byzantine fault-tolerant (BFT) web services provide critical integrity guarantees for distributed applications but face significant latency challenges that hinder interactive user experiences.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Knowledge Graph-Based approach for Sustainable 6G End-to-End System DesignAkshay Jain, Sylvaine Kerboeuf, Sokratis Barmpounakis, Cristóbal Vinagre Z., Stefan Wendt, Dinh Thai Bui, Pol Alemany, Riccardo Nicolicchia, José María Jorquera Valero, Dani Korpi, Mohammad Hossein Moghaddam, Mikko A. Uusitalo, Patrik Rugeland, Abdelkader Outtagarts, Karthik Upadhya, Panagiotis Demestichas, Raul Muñoz, Manuel Gil Pérez, Daniel Adanza, Ricard Vilalta2025-07-11下载Previous generations of cellular communication, such as 5G, have been designed with the objective of improving key performance indicators (KPIs) such as throughput, latency, etc.
Qualitative Assessment of Low Power Wide Area Network Protocols and their Security AspectWesley dos Reis Bezerra, Lais Machado Bezerra, Carlos Becker Westphall2025-07-11下载There are currently many communication options in the Internet of Things, even in particular areas such as constrained and battery-powered devices, such as Low Power Wide Area Networks.
Stabilizing and Optimizing Inter-Shell Routing in LEO Networks with Integrated Routing CostYaojia Wang, Qi Zhang, Kun Qiu, Yue Gao2025-07-11下载The low Earth orbit (LEO) mega-constellation network (LMCN), which uses thousands of satellites across multi-shell architectures to deliver different services, is facing challenges in inter-shell rout...
Recovery of UAV Swarm-enabled Collaborative Beamforming in Low-altitude Wireless Networks under Wind Field DisturbancesGeng Sun, Chenbang Liu, Jiahui Li, Guannan Qu, Shuang Liang, Jiacheng Wang, Changyuan Zhao, Dusit Niyato2025-07-11下载Unmanned aerial vehicle (UAV) swarms utilizing collaborative beamforming (CB) in low-altitude wireless networks (LAWN) demonstrate significant potential for enhanced communication range, energy effici...
Age of Information Optimization in Laser-charged UAV-assisted IoT Networks: A Multi-agent Deep Reinforcement Learning MethodGeng Sun, Likun Zhang, Jiahui Li, Jing Wu, Jiacheng Wang, Zemin Sun, Changyuan Zhao, Victor C. M. Leung2025-07-11下载The integration of unmanned aerial vehicles (UAVs) with Internet of Things (IoT) networks offers promising solutions for efficient data collection.
Towards AI-Native RAN: An Operator's Perspective of 6G Day 1 StandardizationNan Li, Qi Sun, Lehan Wang, Xiaofei Xu, Jinri Huang, Chunhui Liu, Jing Gao, Yuhong Huang, Chih-Lin I2025-07-11下载Artificial Intelligence/Machine Learning (AI/ML) has become the most certain and prominent feature of 6G mobile networks. Unlike 5G, where AI/ML was not natively integrated but rather an add-on featur...

cs.PF - Performance

标题作者发布日期PDF摘要
MH-FSF: A Unified Framework for Overcoming Benchmarking and Reproducibility Limitations in Feature Selection EvaluationVanderson Rocha, Diego Kreutz, Gabriel Canto, Hendrio Bragança, Eduardo Feitosa2025-07-11下载Feature selection is vital for building effective predictive models, as it reduces dimensionality and emphasizes key features. However, current research often suffers from limited benchmarking and rel...
CEO-DC: Driving Decarbonization in HPC Data Centers with Actionable InsightsRubén Rodríguez Álvarez, Denisa-Andreea Constantinescu, Miguel Peón-Quirós, David Atienza2025-07-11下载The rapid growth of data centers is increasing energy demand and widening the carbon gap in the ICT sector, as fossil fuels still dominate global energy production.

基于 VitePress 构建