Appearance
2025-08-21
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ASIC-Agent: An Autonomous Multi-Agent System for ASIC Design with Benchmark Evaluation | Ahmed Allam, Youssef Mansour, Mohamed Shalan | 2025-08-21 | 下载 | Large Language Models (LLMs) have demonstrated remarkable capabilities in Register Transfer Level (RTL) design, enabling high-quality code generation from natural language descriptions. |
| Putting the Context back into Memory | David A. Roberts | 2025-08-21 | 下载 | Requests arriving at main memory are often different from what programmers can observe or estimate by using CPU-based monitoring. Hardware cache prefetching, memory request scheduling and interleaving... |
| Row-Column Hybrid Grouping for Fault-Resilient Multi-Bit Weight Representation on IMC Arrays | Kang Eun Jeon, Sangheum Yeon, Jinhee Kim, Hyeonsu Bang, Johnny Rhe, Jong Hwan Ko | 2025-08-21 | 下载 | This paper addresses two critical challenges in analog In-Memory Computing (IMC) systems that limit their scalability and deployability: the computational unreliability caused by stuck-at faults (SAFs... |
| JEDI-linear: Fast and Efficient Graph Neural Networks for Jet Tagging on FPGAs | Zhiqiang Que, Chang Sun, Sudarshan Paramesvaran, Emyr Clement, Katerina Karakoulaki, Christopher Brown, Lauri Laatu, Arianna Cox, Alexander Tapper, Wayne Luk, Maria Spiropulu | 2025-08-21 | 下载 | Graph Neural Networks (GNNs), particularly Interaction Networks (INs), have shown exceptional performance for jet tagging at the CERN High-Luminosity Large Hadron Collider (HL-LHC). |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Multi-IaC-Eval: Benchmarking Cloud Infrastructure as Code Across Multiple Formats | Sam Davidson, Li Sun, Bhavana Bhasker, Laurent Callot, Anoop Deoras | 2025-08-21 | 下载 | Infrastructure as Code (IaC) is fundamental to modern cloud computing, enabling teams to define and manage infrastructure through machine-readable configuration files. |
| ASIC-Agent: An Autonomous Multi-Agent System for ASIC Design with Benchmark Evaluation | Ahmed Allam, Youssef Mansour, Mohamed Shalan | 2025-08-21 | 下载 | Large Language Models (LLMs) have demonstrated remarkable capabilities in Register Transfer Level (RTL) design, enabling high-quality code generation from natural language descriptions. |
| HyperFlexis: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast Scaling | Zahra Yousefijamarani, Xinglu Wang, Qian Wang, Morgan Lindsay Heisler, Taha Shabani, Niloofar Gholipour, Parham Yassini, Hong Chang, Kan Chen, Qiantao Zhang, Xiaolong Bai, Jiannan Wang, Ying Xiong, Yong Zhang, Zhenan Fan | 2025-08-21 | 下载 | Modern large language model (LLM) serving systems face challenges from highly variable requests with diverse lengths, priorities, and stage-specific service-level objectives (SLOs). |
| Mitigating context switching in densely packed Linux clusters with Latency-Aware Group Scheduling | Al Amjad Tawfiq Isstaif, Evangelia Kalyvianaki, Richard Mortier | 2025-08-21 | 下载 | Cluster orchestrators such as Kubernetes depend on accurate estimates of node capacity and job requirements. Inaccuracies in either lead to poor placement decisions and degraded cluster performance. |
| CausalMesh: A Formally Verified Causal Cache for Stateful Serverless Computing | Haoran Zhang, Zihao Zhang, Shuai Mu, Sebastian Angel, Vincent Liu | 2025-08-21 | 下载 | Stateful serverless workflows consist of multiple serverless functions that access state on a remote database. Developers sometimes add a cache layer between the serverless runtime and the database to... |
| Efficient Mixed-Precision Large Language Model Inference with TurboMind | Li Zhang, Youhe Jiang, Guoliang He, Xin Chen, Han Lv, Qian Yao, Fangcheng Fu, Kai Chen | 2025-08-21 | 下载 | Mixed-precision inference techniques reduce the memory and computational demands of Large Language Models (LLMs) by applying hybrid precision formats to model weights, activations, and KV caches. |
| Lower Bounds for -Set Agreement in Fault-Prone Networks | Pierre Fraigniaud, Minh Hang Nguyen, Ami Paz, Ulrich Schmid, Hugo Rincon Galeana | 2025-08-21 | 下载 | We develop a new lower bound for k-set agreement in synchronous message-passing systems connected by an arbitrary directed communication network, where up to t processes may crash. |
| Universal Dancing by Luminous Robots under Sequential Schedulers | Caterina Feletti, Paola Flocchini, Debasish Pattanayak, Giuseppe Prencipe, Nicola Santoro | 2025-08-21 | 下载 | The Dancing problem requires a swarm of autonomous mobile robots to form a sequence of patterns, aka perform a choreography. Existing work has proven that some crucial restrictions on choreographi... |
| On the Effectiveness of Graph Reordering for Accelerating Approximate Nearest Neighbor Search on GPU | Yutaro Oguri, Mai Nishimura, Yusuke Matsui | 2025-08-21 | 下载 | We present the first systematic investigation of graph reordering effects for graph-based Approximate Nearest Neighbor Search (ANNS) on a GPU. |
| Databelt: A Continuous Data Path for Serverless Workflows in the 3D Compute Continuum | Cynthia Marcelino, Leonard Guelmino, Thomas Pusztai, Stefan Nastic | 2025-08-21 | 下载 | Typically, serverless functions rely on remote storage services for managing state, which can result in increased latency and network communication overhead. |
| Optimizing Compilation for Distributed Quantum Computing via Clustering and Annealing | Ruilin Zhou, Jinglei Cheng, Yuhang Gan, Junyu Liu, Chen Qian | 2025-08-21 | 下载 | Efficiently mapping quantum programs onto Distributed quantum computing (DQC) are challenging, particularly when considering the heterogeneous quantum processing units (QPUs) with different structures... |
| Reliable Multi-view 3D Reconstruction for `Just-in-time' Edge Environments | Md. Nurul Absur, Abhinav Kumar, Swastik Brahma, Saptarshi Debroy | 2025-08-21 | 下载 | Multi-view 3D reconstruction applications are revolutionizing critical use cases that require rapid situational-awareness, such as emergency response, tactical scenarios, and public safety. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Interface on demand: Towards AI native Control interfaces for 6G | Abhishek Dandekar, Prashiddha D. Thapa, Ashrafur Rahman, Julius Schulz-Zander | 2025-08-21 | 下载 | Traditional standardized network interfaces face significant limitations, including vendor-specific incompatibilities, rigid design assumptions, and lack of adaptability for new functionalities. |
| Unlocking the Performance Potential of Mega-Constellation Networks: An Exploration of Structure-Building Paradigms | Xiangtong Wang, Wei Li, Menglong Yang, Songchen Han | 2025-08-21 | 下载 | Mega-constellation networks (MCNs) are transforming global internet access by providing ubiquitous connectivity to millions of users worldwide. |
| Toward Autonomous Digital Populations for Communication-Sensing-Computation Ecosystem | Gaosheng Zhao, Dong In Kim | 2025-08-21 | 下载 | Future communication networks are expected to achieve deep integration of communication, sensing, and computation, forming a tightly coupled and autonomously operating infrastructure system. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CXLAimPod: CXL Memory is all you need in AI era | Yiwei Yang, Yusheng Zheng, Yiqi Chen, Zheng Liang, Kexin Chu, Zhe Zhou, Andi Quinn, Wei Zhang | 2025-08-21 | 下载 | The proliferation of data-intensive applications, ranging from large language models to key-value stores, increasingly stresses memory systems with mixed read-write access patterns. |
| Iridescent: A Framework Enabling Online System Implementation Specialization | Vaastav Anand, Deepak Garg, Antoine Kaufmann | 2025-08-21 | 下载 | Specializing systems to specifics of the workload they serve and platform they are running on often significantly improves performance. However, specializing systems is difficult in practice because o... |
| Mitigating context switching in densely packed Linux clusters with Latency-Aware Group Scheduling | Al Amjad Tawfiq Isstaif, Evangelia Kalyvianaki, Richard Mortier | 2025-08-21 | 下载 | Cluster orchestrators such as Kubernetes depend on accurate estimates of node capacity and job requirements. Inaccuracies in either lead to poor placement decisions and degraded cluster performance. |
| Putting the Context back into Memory | David A. Roberts | 2025-08-21 | 下载 | Requests arriving at main memory are often different from what programmers can observe or estimate by using CPU-based monitoring. Hardware cache prefetching, memory request scheduling and interleaving... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Putting the Context back into Memory | David A. Roberts | 2025-08-21 | 下载 | Requests arriving at main memory are often different from what programmers can observe or estimate by using CPU-based monitoring. Hardware cache prefetching, memory request scheduling and interleaving... |
| Efficient Mixed-Precision Large Language Model Inference with TurboMind | Li Zhang, Youhe Jiang, Guoliang He, Xin Chen, Han Lv, Qian Yao, Fangcheng Fu, Kai Chen | 2025-08-21 | 下载 | Mixed-precision inference techniques reduce the memory and computational demands of Large Language Models (LLMs) by applying hybrid precision formats to model weights, activations, and KV caches. |
| SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts--Extended Version | Nghiem Thanh Pham, Tung Kieu, Duc-Manh Nguyen, Son Ha Xuan, Nghia Duong-Trung, Danh Le-Phuoc | 2025-08-21 | 下载 | Small Language Models (SLMs) offer computational efficiency and accessibility, yet a systematic evaluation of their performance and environmental impact remains lacking. |
| KG-EDAS: A Meta-Metric Framework for Evaluating Knowledge Graph Completion Models | Haji Gul, Abul Ghani Naim, Ajaz Ahmad Bhat | 2025-08-21 | 下载 | Knowledge Graphs (KGs) enable applications in various domains such as semantic search, recommendation systems, and natural language processing. |