2024-12-31

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Enabling New HDLs with Agents	Mark Zakharov, Farzaneh Rabiei Kashanaki, Jose Renau	2024-12-31	下载	Large Language Models (LLMs) based agents are transforming the programming language landscape by facilitating learning for beginners, enabling code generation, and optimizing documentation workflows.
12-bit Delta-Sigma ADC operating at a temperature of up to 250C in Standard 0.18 μm SOI CMOS	Christian Sbrana, Alessandro Catania, Tommaso Toschi, Sebastiano Strangio, Giuseppe Iannaccone	2024-12-31	下载	Some applications require electronic systems to operate at extremely high temperature. Extending the operating temperature range of automotive-grade CMOS processes -- through the use of dedicated desi...
Q3DE: A fault-tolerant quantum computer architecture for multi-bit burst errors by cosmic rays	Yasunari Suzuki, Takanori Sugiyama, Tomochika Arai, Wang Liao, Koji Inoue, Teruo Tanimoto	2024-12-31	下载	Demonstrating small error rates by integrating quantum error correction (QEC) into an architecture of quantum computing is the next milestone towards scalable fault-tolerant quantum computing (FTQC).
Debunking the CUDA Myth Towards GPU-based AI Systems	Yunjae Lee, Juntaek Lim, Jehyeon Bang, Eunyeong Cho, Huijong Jeong, Taesu Kim, Hyungjun Kim, Joonhyung Lee, Jinseop Im, Ranggi Hwang, Se Jung Kwon, Dongsoo Lee, Minsoo Rhu	2024-12-31	下载	This paper presents a comprehensive evaluation of Intel Gaudi NPUs as an alternative to NVIDIA GPUs, which is currently the de facto standard in AI system design.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
UPC Sentinel: An Accurate Approach for Detecting Upgradeability Proxy Contracts in Ethereum	Amir M. Ebrahimi, Bram Adams, Gustavo A. Oliva, Ahmed E. Hassan	2024-12-31	下载	Software applications that run on a blockchain platform are known as DApps. DApps are built using smart contracts, which are immutable after deployment.
Impossibility of Self-Organized Aggregation without Computation	Roy Steinberg, Kiril Solovey	2024-12-31	下载	In their seminal work, Gauci et al. (2014) studied the fundamental task of aggregation, wherein multiple robots need to gather without an a priori agreed-upon meeting location, using minimal hardware.
Constant Degree Networks for Almost-Everywhere Reliable Transmission	Mitali Bafna, Dor Minzer	2024-12-31	下载	In the almost-everywhere reliable message transmission problem, introduced by [Dwork, Pippenger, Peleg, Upfal'86], the goal is to design a sparse communication network $G$ that supports efficient, fau...
Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data Movement	Junjie Li	2024-12-31	下载	BLAS is a fundamental building block of advanced linear algebra libraries and many modern scientific computing applications. GPUs are known for their strong arithmetic computing capabilities and are h...
Towards Sustainable Large Language Model Serving	Sophia Nguyen, Beihao Zhou, Yi Ding, Sihang Liu	2024-12-31	下载	In this work, we study LLMs from a carbon emission perspective, addressing both operational and embodied emissions, and paving the way for sustainable LLM serving.
FedCod: An Efficient Communication Protocol for Cross-Silo Federated Learning with Coding	Peishen Yan, Jun Li, Hao Wang, Tao Song, Yang Hua, Lu Peng, Haihui Zhou, Haibing Guan	2024-12-31	下载	Federated Learning (FL) is an innovative distributed machine learning paradigm that enables multiple parties to collaboratively train a model without sharing their raw data, thereby preserving data pr...
OciorMVBA: Near-Optimal Error-Free Asynchronous MVBA	Jinyuan Chen	2024-12-31	下载	In this work, we propose an error-free, information-theoretically secure, asynchronous multi-valued validated Byzantine agreement (MVBA) protocol, called OciorMVBA.
Debunking the CUDA Myth Towards GPU-based AI Systems	Yunjae Lee, Juntaek Lim, Jehyeon Bang, Eunyeong Cho, Huijong Jeong, Taesu Kim, Hyungjun Kim, Joonhyung Lee, Jinseop Im, Ranggi Hwang, Se Jung Kwon, Dongsoo Lee, Minsoo Rhu	2024-12-31	下载	This paper presents a comprehensive evaluation of Intel Gaudi NPUs as an alternative to NVIDIA GPUs, which is currently the de facto standard in AI system design.
Parallel I/O Characterization and Optimization on Large-Scale HPC Systems: A 360-Degree Survey	Hammad Ather, Jean Luca Bez, Chen Wang, Hank Childs, Allen D. Malony, Suren Byna	2024-12-31	下载	Driven by artificial intelligence, data science, and high-resolution simulations, I/O workloads and hardware on high-performance computing (HPC) systems have become increasingly complex.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
So Timely, Yet So Stale: The Impact of Clock Drift in Real-Time Systems	Mehrdad Salimnejad, Nikolaos Pappas, Marios Kountouris	2024-12-31	下载	In this paper, we address the problem of timely delivery of status update packets in a real-time communication system, where a transmitter sends status updates generated by a source to a receiver over...
Toward Digital Network Twins: Integrating Sionna RT in ns-3 for 6G Multi-RAT Networks Simulations	Roberto Pegurri, Francesco Linsalata, Eugenio Moro, Jakob Hoydis, Umberto Spagnolini	2024-12-31	下载	The increasing complexity of 6G systems demands innovative tools for network management, simulation, and optimization. This work introduces the integration of ns-3 with Sionna RT, establishing the fou...
The Space above the Sky: Uniting Global-Scale Ground Station as a Service for Efficient Orbital Data Processing	Heng Zhao, Sheng Cen, Yifei Zhu	2024-12-31	下载	Large constellations of Earth Observation Low Earth Orbit satellites collect enormous amounts of image data every day. This amount of data needs to be transferred to data centers for processing via gr...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Combining Type Checking and Formal Verification for Lightweight OS Correctness	Ramla Ijaz, Kevin Boos, Lin Zhong	2024-12-31	下载	This paper reports our experience of providing lightweight correctness guarantees to an open-source Rust OS, Theseus. First, we report new developments in intralingual design that leverage Rust's type...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data Movement	Junjie Li	2024-12-31	下载	BLAS is a fundamental building block of advanced linear algebra libraries and many modern scientific computing applications. GPUs are known for their strong arithmetic computing capabilities and are h...
Parallel I/O Characterization and Optimization on Large-Scale HPC Systems: A 360-Degree Survey	Hammad Ather, Jean Luca Bez, Chen Wang, Hank Childs, Allen D. Malony, Suren Byna	2024-12-31	下载	Driven by artificial intelligence, data science, and high-resolution simulations, I/O workloads and hardware on high-performance computing (HPC) systems have become increasingly complex.