Appearance
2025-04-02
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| PIMDAL: Mitigating the Memory Bottleneck in Data Analytics using a Real Processing-in-Memory System | Manos Frouzakis, Juan Gómez-Luna, Geraldo F. Oliveira, Mohammad Sadrosadati, Onur Mutlu | 2025-04-02 | 下载 | Database Management Systems (DBMSs) are crucial for efficient data management and analytics, and are used in several different application domains. |
| A flexible framework for early power and timing comparison of time-multiplexed CGRA kernel executions | Maxime Henri Aspros, Juan Sapriza, Giovanni Ansaloni, David Atienza | 2025-04-02 | 下载 | At the intersection between traditional CPU architectures and more specialized options such as FPGAs or ASICs lies the family of reconfigurable hardware architectures, termed Coarse-Grained Reconfigur... |
| Efficient Calibration for RRAM-based In-Memory Computing using DoRA | Weirong Dong, Kai Zhou, Zhen Kong, Quan Cheng, Junkai Huang, Zhengke Yang, Masanori Hashimoto, Longyang Lin | 2025-04-02 | 下载 | Resistive In-Memory Computing (RIMC) offers ultra-efficient computation for edge AI but faces accuracy degradation due to RRAM conductance drift over time. |
| MERE: Hardware-Software Co-Design for Masking Cache Miss Latency in Embedded Processors | Dean You, Jieyu Jiang, Xiaoxuan Wang, Yushu Du, Zhihang Tan, Wenbo Xu, Hui Wang, Jiapeng Guan, Zhenyuan Wang, Ran Wei, Shuai Zhao, Zhe Jiang | 2025-04-02 | 下载 | Runahead execution is a technique to mask memory latency caused by irregular memory accesses. By pre-executing the application code during occurrences of long-latency operations and prefetching antici... |
| HH-PIM: Dynamic Optimization of Power and Performance with Heterogeneous-Hybrid PIM for Edge AI Devices | Sangmin Jeon, Kangju Lee, Kyeongwon Lee, Woojoo Lee | 2025-04-02 | 下载 | Processing-in-Memory (PIM) architectures offer promising solutions for efficiently handling AI applications in energy-constrained edge environments. |
| Versatile silicon integrated photonic processor: a reconfigurable solution for next-generation AI clusters | Ying Zhu, Yifan Liu, Xinyu Yang, Kailai Liu, Xin Hua, Ming Luo, Jia Liu, Siyao Chang, Shengxiang Zhang, Miao Wu, Zhicheng Wang, Hongguang Zhang, Daigao Chen, Xi Xiao, Shaohua Yu | 2025-04-02 | 下载 | The Artificial Intelligence models pose serious challenges in intensive computing and high-bandwidth communication for conventional electronic circuit-based computing clusters. |
| FireGuard: A Generalized Microarchitecture for Fine-Grained Security Analysis on OoO Superscalar Cores | Zhe Jiang, Sam Ainsworth, Timothy Jones | 2025-04-02 | 下载 | High-performance security guarantees rely on hardware support. Generic programmable support for fine-grained instruction analysis has gained broad interest in the literature as a fundamental building ... |
| MEEK: Re-thinking Heterogeneous Parallel Error Detection Architecture for Real-World OoO Superscalar Processors | Zhe Jiang, Minli Liao, Sam Ainsworth, Dean You, Timothy Jones | 2025-04-02 | 下载 | Heterogeneous parallel error detection is an approach to achieving fault-tolerant processors, leveraging multiple power-efficient cores to re-execute software originally run on a high-performance core... |
| GigaAPI for GPU Parallelization | M. Suvarna, O. Tehrani | 2025-04-02 | 下载 | GigaAPI is a user-space API that simplifies multi-GPU programming, bridging the gap between the capabilities of parallel GPU systems and the ability of developers to harness their full potential. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| PIMDAL: Mitigating the Memory Bottleneck in Data Analytics using a Real Processing-in-Memory System | Manos Frouzakis, Juan Gómez-Luna, Geraldo F. Oliveira, Mohammad Sadrosadati, Onur Mutlu | 2025-04-02 | 下载 | Database Management Systems (DBMSs) are crucial for efficient data management and analytics, and are used in several different application domains. |
| Efficient Federated Learning Tiny Language Models for Mobile Network Feature Prediction | Daniel Becking, Ingo Friese, Karsten Müller, Thomas Buchholz, Mandy Galkow-Schneider, Wojciech Samek, Detlev Marpe | 2025-04-02 | 下载 | In telecommunications, Autonomous Networks (ANs) automatically adjust configurations based on specific requirements (e.g., bandwidth) and available resources. |
| Improved Bounds for Coin Flipping, Leader Election, and Random Selection | Eshan Chattopadhyay, Mohit Gurumukhani, Noam Ringach, Rocco Servedio | 2025-04-02 | 下载 | Random selection, leader election, and collective coin flipping are fundamental tasks in fault-tolerant distributed computing. We study these problems in the full-information model where despite decad... |
| Distributed Triangle Detection is Hard in Few Rounds | Sepehr Assadi, Janani Sundaresan | 2025-04-02 | 下载 | In the distributed triangle detection problem, we have an -vertex network with one player for each vertex of the graph who sees the edges incident on the vertex. |
| Shared-Memory Hierarchical Process Mapping | Christian Schulz, Henning Woydt | 2025-04-02 | 下载 | Modern large-scale scientific applications consist of thousands to millions of individual tasks. These tasks involve not only computation but also communication with one another. |
| Satellite Edge Artificial Intelligence with Large Models: Architectures and Technologies | Yuanming Shi, Jingyang Zhu, Chunxiao Jiang, Linling Kuang, Khaled B. Letaief | 2025-04-02 | 下载 | Driven by the growing demand for intelligent remote sensing applications, large artificial intelligence (AI) models pre-trained on large-scale unlabeled datasets and fine-tuned for downstream tasks ha... |
| Approximate Agreement Algorithms for Byzantine Collaborative Learning | Mélanie Cambus, Darya Melnyk, Tijana Milentijević, Stefan Schmid | 2025-04-02 | 下载 | In Byzantine collaborative learning, clients in a peer-to-peer network collectively learn a model without sharing their data by exchanging and aggregating stochastic gradient estimates. |
| Split Federated Learning for Low-Altitude Wireless Networks: Joint Sensing, Communication, Computation, and Control Co-design | Xiangwang Hou, Xianghe Wang, Jiacheng Wang, Zekai Zhang, Jun Du, Jingjing Wang, Yong Ren | 2025-04-02 | 下载 | Unmanned aerial vehicles (UAVs) with integrated sensing, communication, computation and control (ISC3) capabilities have become key enablers of next-generation wireless networks. |
| Exploiting the Uncertainty of the Longest Paths: Response Time Analysis for Probabilistic DAG Tasks | Yiyang Gao, Shuai Zhao, Boyang Li, Xinwei Fang, Zhiyang Lin, Zhe Jiang, Nan Guan | 2025-04-02 | 下载 | Parallel real-time systems (e.g., autonomous driving systems) often contain functionalities with complex dependencies and execution uncertainties, leading to significant timing variability which can b... |
| Accelerating Blockchain Scalability: New Models for Parallel Transaction Execution in the EVM | Souradeep Das, Konpat Preechakul, Jonas Bäumer, Riddhi Patel, Jefferson Jinchuan Li | 2025-04-02 | 下载 | As the number of decentralized applications and users on Ethereum grows, the ability of the blockchain to efficiently handle a growing number of transactions becomes increasingly strained. |
| Age-Aware Partial Gradient Update Strategy for Federated Learning Over the Air | Ruihao Du, Jiaqi Zhu, Zeshen Li, Howard H. Yang | 2025-04-02 | 下载 | Frequent parameter exchanges between clients and the edge server incur substantial communication overhead, posing a critical bottleneck in federated learning (FL). |
| Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design | Mohan Zhang, Pingzhi Li, Jie Peng, Mufan Qiu, Tianlong Chen | 2025-04-02 | 下载 | Mixture-of-Experts (MoE) has successfully scaled up models while maintaining nearly constant computing costs. By employing a gating network to route input tokens, it selectively activates a subset of ... |
| GigaAPI for GPU Parallelization | M. Suvarna, O. Tehrani | 2025-04-02 | 下载 | GigaAPI is a user-space API that simplifies multi-GPU programming, bridging the gap between the capabilities of parallel GPU systems and the ability of developers to harness their full potential. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| FastFlow: Early Yet Robust Network Flow Classification using the Minimal Number of Time-Series Packets | Rushi Jayeshkumar Babaria, Minzhao Lyu, Gustavo Batista, Vijay Sivaraman | 2025-04-02 | 下载 | Network traffic classification is of great importance for network operators in their daily routines, such as analyzing the usage patterns of multimedia applications and optimizing network configuratio... |
| Toward a Sustainable Low-Altitude Economy: A Survey of Energy-Efficient RIS-UAV Networks | Manzoor Ahmed, Aized Amin Soofi, Feroz Khan, Salman Raza, Wali Ullah Khan, Lina Su, Fang Xu, Zhu Han | 2025-04-02 | 下载 | The integration of RIS into UAV networks presents a transformative solution for achieving energy-efficient and reliable communication, particularly within the rapidly expanding low-altitude economy (L... |
| Asynchronous Traffic Shaping and Redundancy: Avoiding Unbounded Latencies in In-Car Networks | Teresa Lübeck, Philipp Meyer, Timo Häckel, Franz Korf, Thomas C. Schmidt | 2025-04-02 | 下载 | Time-Sensitive Networking enhances Ethernet-based In-Vehicle Networks (IVNs) with real-time capabilities. Different traffic shaping algorithms have been proposed for time-critical communication, of wh... |
| Strengthening Multi-Robot Systems for SAR: Co-Designing Robotics and Communication Towards 6G | Juan Bravo-Arrabal, Ricardo Vázquez-Martín, J. J. Fernández-Lozano, Alfonso García-Cerezo | 2025-04-02 | 下载 | This paper presents field-tested use cases from Search and Rescue (SAR) missions, highlighting the co-design of mobile robots and communication systems to support Edge-Cloud architectures based on 5G ... |
| A Deep Incremental Framework for Multi-Service Multi-Modal Devices in NextG AI-RAN Systems | Mrityunjoy Gain, Kitae Kim, Avi Deb Raha, Apurba Adhikary, Walid Saad, Zhu Han, Choong Seon Hong | 2025-04-02 | 下载 | In this paper, we propose a deep incremental framework for efficient RAN management, introducing the Multi-Service-Modal UE (MSMU) system, which enables a single UE to handle eMBB and uRLLC services s... |
| Satellite Edge Artificial Intelligence with Large Models: Architectures and Technologies | Yuanming Shi, Jingyang Zhu, Chunxiao Jiang, Linling Kuang, Khaled B. Letaief | 2025-04-02 | 下载 | Driven by the growing demand for intelligent remote sensing applications, large artificial intelligence (AI) models pre-trained on large-scale unlabeled datasets and fine-tuned for downstream tasks ha... |
| Optimization of BLE Broadcast Mode in Offline Finding Network | L Zhang, C Feng, T Xia | 2025-04-02 | 下载 | In the Offline Finding Network(OFN), offline Bluetooth tags broadcast to the surrounding area, the finder devices receiving the broadcast signal and upload location information to the IoT(Internet of ... |
| Balancing Subjectivity and Objectivity in Network Selection: A Decision-Making Framework Towards Digital Twins | Brahim Mefgouda, Hanen Idoudi, Mohammad Al-Quraan, Ismail Lotfi, Omar Alhussein, Lina Mohjazi, Sami Muhaidat | 2025-04-02 | 下载 | Selecting the optimal radio access technology (RAT) during vertical handovers (VHO) in heterogeneous wireless networks (HWNs) is critical. Multi-attribute decision-making (MADM) is the most common app... |
| The Multifractal IP Address Structure: Physical Explanation and Implications | Chris Misa, Ram Durairajan, Arpit Gupta, Reza Rejaie, Walter Willinger | 2025-04-02 | 下载 | The structure of IP addresses observed in Internet traffic plays a critical role for a wide range of networking problems of current interest. For example, modern network telemetry systems that take ad... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Enhancing Traffic Sign Recognition On The Performance Based On Yolov8 | Baba Ibrahim, Zhou Kui | 2025-04-02 | 下载 | This paper Traffic sign recognition plays a crucial role in the development of autonomous vehicles and advanced driver-assistance systems (ADAS). |
| GigaAPI for GPU Parallelization | M. Suvarna, O. Tehrani | 2025-04-02 | 下载 | GigaAPI is a user-space API that simplifies multi-GPU programming, bridging the gap between the capabilities of parallel GPU systems and the ability of developers to harness their full potential. |