Skip to content

2024-12-06

cs.AR - Architecture

标题作者发布日期PDF摘要
Branch Target Buffer Reverse Engineering on ArmJunpeng Wan2024-12-06下载The Branch Target Buffer (BTB) plays a critical role in efficient CPU branch prediction. Understanding the design and implementation of the BTB provides valuable insights for both compiler design and ...
HiVeGen -- Hierarchical LLM-based Verilog Generation for Scalable Chip DesignJinwei Tang, Jiayin Qin, Kiran Thorat, Chen Zhu-Tian, Yu Cao, Yang, Zhao, Caiwen Ding2024-12-06下载With Large Language Models (LLMs) recently demonstrating impressive proficiency in code generation, it is promising to extend their abilities to Hardware Description Language (HDL).
Gaze into the Pattern: Characterizing Spatial Patterns with Internal Temporal Correlations for Hardware PrefetchingZixiao Chen, Chentao Wu, Yunfei Gu, Ranhao Jia, Jie Li, Minyi Guo2024-12-06下载Hardware prefetching is one of the most widely-used techniques for hiding long data access latency. To address the challenges faced by hardware prefetching, architects have proposed to detect and expl...
Hard Math -- Easy UVM: Pragmatic solutions for verifying hardware algorithms using UVMMark Litterick, Aleksandar Ivankovic, Bojan Arsov, Aman Kumar2024-12-06下载This paper presents pragmatic solutions for verifying complex mathematical algorithms implemented in hardware in an efficient and effective manner.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
FogROS2-FT: Fault Tolerant Cloud RoboticsKaiyuan Chen, Kush Hari, Trinity Chung, Michael Wang, Nan Tian, Christian Juette, Jeffrey Ichnowski, Liu Ren, John Kubiatowicz, Ion Stoica, Ken Goldberg2024-12-06下载Cloud robotics enables robots to offload complex computational tasks to cloud servers for performance and ease of management. However, cloud compute can be costly, cloud services can suffer occasional...
An Experimental Framework for Implementing Decentralized Autonomous Database Systems in RustPrakash Aryan, Radhika Khatri, Vijayakumar Balakrishnan2024-12-06下载This paper presents an experimental framework for implementing Decentralized Autonomous Database Systems (DADBS) using the Rust programming language.
From Theory to Practice: Demonstrators of FAIR Data Spaces Across Different SectorsNikolaus Glombiewski, Zeyd Boukhers, Christian Beilschmidt, Johannes Drönner, Michael Mattig, Artur Piet, Robert Pietrzynski, Mehrshad Jaberansary, Macedo Maia, Sebastian Beyvers, Yeliz Üçer Yediel, Muhammad Hamza Akhtar, Heiner Oberkampf, Jonathan Hartman, Bernhard Seeger, Christoph Lange2024-12-06下载The principles of data spaces for sovereign data exchange across trusted organizations have so far mainly been adopted in business-to-business settings, and recently scaled to cloud environments.
NebulaFL: Effective Asynchronous Federated Learning for JointCloud ComputingFei Gao, Ming Hu, Zhiyu Xie, Peichang Shi, Xiaofei Xie, Guodong Yi, Huaimin Wang2024-12-06下载With advancements in AI infrastructure and Trusted Execution Environment (TEE) technology, Federated Learning as a Service (FLaaS) through JointCloud Computing (JCC) is promising to break through the ...
Distributed Massive MIMO-Aided Task Offloading in Satellite-Terrestrial Integrated Multi-Tier VEC NetworksYixin Liu, Shaoling Liang, Kunlun Wang, Wen Chen, Yonghui Li, George K. Karagiannidis2024-12-06下载This paper proposes a distributed massive multiple input multiple-output (DM-MIMO) aided multi-tier vehicular edge computing (VEC) system. In particular, each vehicle terminal (VT) offloads its comput...
Overlay Network Construction: Improved Overall and Node-Wise Message ComplexityYi-Jun Chang, Yanyu Chen, Gopinath Mishra2024-12-06下载We consider the problem of constructing distributed overlay networks, where nodes in a reconfigurable system can create or sever connections with nodes whose identifiers they know.
Code generation and runtime techniques for enabling data-efficient deep learning training on GPUsKun Wu2024-12-06下载As deep learning models scale, their training cost has surged significantly. Due to both hardware advancements and limitations in current software stacks, the need for data efficiency has risen.
DRDST: Low-latency DAG Consensus through Robust Dynamic Sharding and Tree-broadcasting for IoVRunhua Chen, Haoxiang Luo, Gang Sun, Hongfang Yu, Dusit Niyato, Schahram Dustdar2024-12-06下载The Internet of Vehicles (IoV) is emerging as a pivotal technology for enhancing traffic management and safety. Its rapid development demands solutions for enhanced communication efficiency and reduce...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
FogROS2-FT: Fault Tolerant Cloud RoboticsKaiyuan Chen, Kush Hari, Trinity Chung, Michael Wang, Nan Tian, Christian Juette, Jeffrey Ichnowski, Liu Ren, John Kubiatowicz, Ion Stoica, Ken Goldberg2024-12-06下载Cloud robotics enables robots to offload complex computational tasks to cloud servers for performance and ease of management. However, cloud compute can be costly, cloud services can suffer occasional...
Enhanced 5G/B5G Network Planning/Optimization deploying RIS in Urban/Outdoor ScenariosValdemar Farré, Juan C. Estrada-Jiménez, José D. Vega Sánchez, Juan A. Vasquez-Peralvo, Symeon Chatzinotas2024-12-06下载In recent years, the fifth-generation (5G) mobile network has been developed worldwide to remarkably improve network performance and spectral efficiency.
Location-Driven Programmable Wireless Environments through Light-emitting RIS (LeRIS)Dimitrios Bozanis, Dimitrios Tyrovolas, Vasilis K. Papanikolaou, Sotiris A. Tegos, Panagiotis D. Diamantoulakis, Christos K. Liaskos, Robert Schober, George K. Karagiannidis2024-12-06下载As 6G wireless networks seek to enable robust and dynamic programmable wireless environments (PWEs), reconfigurable intelligent surfaces (RISs) have emerged as a cornerstone for controlling electromag...
Self-Organizing Complex Networks with AI-Driven Adaptive Nodes for Optimized Connectivity and Energy EfficiencyAzra Seyyedi, Mahdi Bohlouli, SeyedEhsan Nedaaee Oskoee2024-12-06下载High connectivity and robustness are critical requirements in distributed networks, as they ensure resilience, efficient communication, and adaptability in dynamic environments.
NebulaFL: Effective Asynchronous Federated Learning for JointCloud ComputingFei Gao, Ming Hu, Zhiyu Xie, Peichang Shi, Xiaofei Xie, Guodong Yi, Huaimin Wang2024-12-06下载With advancements in AI infrastructure and Trusted Execution Environment (TEE) technology, Federated Learning as a Service (FLaaS) through JointCloud Computing (JCC) is promising to break through the ...
Neural Representation for Wireless Radiation Field Reconstruction: A 3D Gaussian Splatting ApproachChaozheng Wen, Jingwen Tong, Yingdong Hu, Zehong Lin, Jun Zhang2024-12-06下载Wireless channel modeling plays a pivotal role in designing, analyzing, and optimizing wireless communication systems. Nevertheless, developing an effective channel modeling approach has been a long-s...
DRDST: Low-latency DAG Consensus through Robust Dynamic Sharding and Tree-broadcasting for IoVRunhua Chen, Haoxiang Luo, Gang Sun, Hongfang Yu, Dusit Niyato, Schahram Dustdar2024-12-06下载The Internet of Vehicles (IoV) is emerging as a pivotal technology for enhancing traffic management and safety. Its rapid development demands solutions for enhanced communication efficiency and reduce...

cs.PF - Performance

标题作者发布日期PDF摘要
APOLLO: SGD-like Memory, AdamW-level PerformanceHanqing Zhu, Zhenyu Zhang, Wenyan Cong, Xi Liu, Sem Park, Vikas Chandra, Bo Long, David Z. Pan, Zhangyang Wang, Jinwon Lee2024-12-06下载Large language models (LLMs) are notoriously memory-intensive during training, particularly with the popular AdamW optimizer. This memory burden necessitates using more or higher-end GPUs or reducing ...
One-Hop Sub-Query Result Caches for Graph Database SystemsHieu Nguyen, Jun Li, Shahram Ghandeharizadeh2024-12-06下载This paper introduces a novel one-hop sub-query result cache for processing graph read transactions, gR-Txs, in a graph database system. The one-hop navigation is from a vertex using either its in-com...

基于 VitePress 构建