Appearance
2024-12-06
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Branch Target Buffer Reverse Engineering on Arm | Junpeng Wan | 2024-12-06 | 下载 | The Branch Target Buffer (BTB) plays a critical role in efficient CPU branch prediction. Understanding the design and implementation of the BTB provides valuable insights for both compiler design and ... |
| HiVeGen -- Hierarchical LLM-based Verilog Generation for Scalable Chip Design | Jinwei Tang, Jiayin Qin, Kiran Thorat, Chen Zhu-Tian, Yu Cao, Yang, Zhao, Caiwen Ding | 2024-12-06 | 下载 | With Large Language Models (LLMs) recently demonstrating impressive proficiency in code generation, it is promising to extend their abilities to Hardware Description Language (HDL). |
| Gaze into the Pattern: Characterizing Spatial Patterns with Internal Temporal Correlations for Hardware Prefetching | Zixiao Chen, Chentao Wu, Yunfei Gu, Ranhao Jia, Jie Li, Minyi Guo | 2024-12-06 | 下载 | Hardware prefetching is one of the most widely-used techniques for hiding long data access latency. To address the challenges faced by hardware prefetching, architects have proposed to detect and expl... |
| Hard Math -- Easy UVM: Pragmatic solutions for verifying hardware algorithms using UVM | Mark Litterick, Aleksandar Ivankovic, Bojan Arsov, Aman Kumar | 2024-12-06 | 下载 | This paper presents pragmatic solutions for verifying complex mathematical algorithms implemented in hardware in an efficient and effective manner. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| FogROS2-FT: Fault Tolerant Cloud Robotics | Kaiyuan Chen, Kush Hari, Trinity Chung, Michael Wang, Nan Tian, Christian Juette, Jeffrey Ichnowski, Liu Ren, John Kubiatowicz, Ion Stoica, Ken Goldberg | 2024-12-06 | 下载 | Cloud robotics enables robots to offload complex computational tasks to cloud servers for performance and ease of management. However, cloud compute can be costly, cloud services can suffer occasional... |
| An Experimental Framework for Implementing Decentralized Autonomous Database Systems in Rust | Prakash Aryan, Radhika Khatri, Vijayakumar Balakrishnan | 2024-12-06 | 下载 | This paper presents an experimental framework for implementing Decentralized Autonomous Database Systems (DADBS) using the Rust programming language. |
| From Theory to Practice: Demonstrators of FAIR Data Spaces Across Different Sectors | Nikolaus Glombiewski, Zeyd Boukhers, Christian Beilschmidt, Johannes Drönner, Michael Mattig, Artur Piet, Robert Pietrzynski, Mehrshad Jaberansary, Macedo Maia, Sebastian Beyvers, Yeliz Üçer Yediel, Muhammad Hamza Akhtar, Heiner Oberkampf, Jonathan Hartman, Bernhard Seeger, Christoph Lange | 2024-12-06 | 下载 | The principles of data spaces for sovereign data exchange across trusted organizations have so far mainly been adopted in business-to-business settings, and recently scaled to cloud environments. |
| NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing | Fei Gao, Ming Hu, Zhiyu Xie, Peichang Shi, Xiaofei Xie, Guodong Yi, Huaimin Wang | 2024-12-06 | 下载 | With advancements in AI infrastructure and Trusted Execution Environment (TEE) technology, Federated Learning as a Service (FLaaS) through JointCloud Computing (JCC) is promising to break through the ... |
| Distributed Massive MIMO-Aided Task Offloading in Satellite-Terrestrial Integrated Multi-Tier VEC Networks | Yixin Liu, Shaoling Liang, Kunlun Wang, Wen Chen, Yonghui Li, George K. Karagiannidis | 2024-12-06 | 下载 | This paper proposes a distributed massive multiple input multiple-output (DM-MIMO) aided multi-tier vehicular edge computing (VEC) system. In particular, each vehicle terminal (VT) offloads its comput... |
| Overlay Network Construction: Improved Overall and Node-Wise Message Complexity | Yi-Jun Chang, Yanyu Chen, Gopinath Mishra | 2024-12-06 | 下载 | We consider the problem of constructing distributed overlay networks, where nodes in a reconfigurable system can create or sever connections with nodes whose identifiers they know. |
| Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs | Kun Wu | 2024-12-06 | 下载 | As deep learning models scale, their training cost has surged significantly. Due to both hardware advancements and limitations in current software stacks, the need for data efficiency has risen. |
| DRDST: Low-latency DAG Consensus through Robust Dynamic Sharding and Tree-broadcasting for IoV | Runhua Chen, Haoxiang Luo, Gang Sun, Hongfang Yu, Dusit Niyato, Schahram Dustdar | 2024-12-06 | 下载 | The Internet of Vehicles (IoV) is emerging as a pivotal technology for enhancing traffic management and safety. Its rapid development demands solutions for enhanced communication efficiency and reduce... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| FogROS2-FT: Fault Tolerant Cloud Robotics | Kaiyuan Chen, Kush Hari, Trinity Chung, Michael Wang, Nan Tian, Christian Juette, Jeffrey Ichnowski, Liu Ren, John Kubiatowicz, Ion Stoica, Ken Goldberg | 2024-12-06 | 下载 | Cloud robotics enables robots to offload complex computational tasks to cloud servers for performance and ease of management. However, cloud compute can be costly, cloud services can suffer occasional... |
| Enhanced 5G/B5G Network Planning/Optimization deploying RIS in Urban/Outdoor Scenarios | Valdemar Farré, Juan C. Estrada-Jiménez, José D. Vega Sánchez, Juan A. Vasquez-Peralvo, Symeon Chatzinotas | 2024-12-06 | 下载 | In recent years, the fifth-generation (5G) mobile network has been developed worldwide to remarkably improve network performance and spectral efficiency. |
| Location-Driven Programmable Wireless Environments through Light-emitting RIS (LeRIS) | Dimitrios Bozanis, Dimitrios Tyrovolas, Vasilis K. Papanikolaou, Sotiris A. Tegos, Panagiotis D. Diamantoulakis, Christos K. Liaskos, Robert Schober, George K. Karagiannidis | 2024-12-06 | 下载 | As 6G wireless networks seek to enable robust and dynamic programmable wireless environments (PWEs), reconfigurable intelligent surfaces (RISs) have emerged as a cornerstone for controlling electromag... |
| Self-Organizing Complex Networks with AI-Driven Adaptive Nodes for Optimized Connectivity and Energy Efficiency | Azra Seyyedi, Mahdi Bohlouli, SeyedEhsan Nedaaee Oskoee | 2024-12-06 | 下载 | High connectivity and robustness are critical requirements in distributed networks, as they ensure resilience, efficient communication, and adaptability in dynamic environments. |
| NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing | Fei Gao, Ming Hu, Zhiyu Xie, Peichang Shi, Xiaofei Xie, Guodong Yi, Huaimin Wang | 2024-12-06 | 下载 | With advancements in AI infrastructure and Trusted Execution Environment (TEE) technology, Federated Learning as a Service (FLaaS) through JointCloud Computing (JCC) is promising to break through the ... |
| Neural Representation for Wireless Radiation Field Reconstruction: A 3D Gaussian Splatting Approach | Chaozheng Wen, Jingwen Tong, Yingdong Hu, Zehong Lin, Jun Zhang | 2024-12-06 | 下载 | Wireless channel modeling plays a pivotal role in designing, analyzing, and optimizing wireless communication systems. Nevertheless, developing an effective channel modeling approach has been a long-s... |
| DRDST: Low-latency DAG Consensus through Robust Dynamic Sharding and Tree-broadcasting for IoV | Runhua Chen, Haoxiang Luo, Gang Sun, Hongfang Yu, Dusit Niyato, Schahram Dustdar | 2024-12-06 | 下载 | The Internet of Vehicles (IoV) is emerging as a pivotal technology for enhancing traffic management and safety. Its rapid development demands solutions for enhanced communication efficiency and reduce... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| APOLLO: SGD-like Memory, AdamW-level Performance | Hanqing Zhu, Zhenyu Zhang, Wenyan Cong, Xi Liu, Sem Park, Vikas Chandra, Bo Long, David Z. Pan, Zhangyang Wang, Jinwon Lee | 2024-12-06 | 下载 | Large language models (LLMs) are notoriously memory-intensive during training, particularly with the popular AdamW optimizer. This memory burden necessitates using more or higher-end GPUs or reducing ... |
| One-Hop Sub-Query Result Caches for Graph Database Systems | Hieu Nguyen, Jun Li, Shahram Ghandeharizadeh | 2024-12-06 | 下载 | This paper introduces a novel one-hop sub-query result cache for processing graph read transactions, gR-Txs, in a graph database system. The one-hop navigation is from a vertex using either its in-com... |