Skip to content

2024-12-31

cs.AR - Architecture

标题作者发布日期PDF摘要
Enabling New HDLs with AgentsMark Zakharov, Farzaneh Rabiei Kashanaki, Jose Renau2024-12-31下载Large Language Models (LLMs) based agents are transforming the programming language landscape by facilitating learning for beginners, enabling code generation, and optimizing documentation workflows.
12-bit Delta-Sigma ADC operating at a temperature of up to 250C in Standard 0.18 μm SOI CMOSChristian Sbrana, Alessandro Catania, Tommaso Toschi, Sebastiano Strangio, Giuseppe Iannaccone2024-12-31下载Some applications require electronic systems to operate at extremely high temperature. Extending the operating temperature range of automotive-grade CMOS processes -- through the use of dedicated desi...
Q3DE: A fault-tolerant quantum computer architecture for multi-bit burst errors by cosmic raysYasunari Suzuki, Takanori Sugiyama, Tomochika Arai, Wang Liao, Koji Inoue, Teruo Tanimoto2024-12-31下载Demonstrating small error rates by integrating quantum error correction (QEC) into an architecture of quantum computing is the next milestone towards scalable fault-tolerant quantum computing (FTQC).
Debunking the CUDA Myth Towards GPU-based AI SystemsYunjae Lee, Juntaek Lim, Jehyeon Bang, Eunyeong Cho, Huijong Jeong, Taesu Kim, Hyungjun Kim, Joonhyung Lee, Jinseop Im, Ranggi Hwang, Se Jung Kwon, Dongsoo Lee, Minsoo Rhu2024-12-31下载This paper presents a comprehensive evaluation of Intel Gaudi NPUs as an alternative to NVIDIA GPUs, which is currently the de facto standard in AI system design.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
UPC Sentinel: An Accurate Approach for Detecting Upgradeability Proxy Contracts in EthereumAmir M. Ebrahimi, Bram Adams, Gustavo A. Oliva, Ahmed E. Hassan2024-12-31下载Software applications that run on a blockchain platform are known as DApps. DApps are built using smart contracts, which are immutable after deployment.
Impossibility of Self-Organized Aggregation without ComputationRoy Steinberg, Kiril Solovey2024-12-31下载In their seminal work, Gauci et al. (2014) studied the fundamental task of aggregation, wherein multiple robots need to gather without an a priori agreed-upon meeting location, using minimal hardware.
Constant Degree Networks for Almost-Everywhere Reliable TransmissionMitali Bafna, Dor Minzer2024-12-31下载In the almost-everywhere reliable message transmission problem, introduced by [Dwork, Pippenger, Peleg, Upfal'86], the goal is to design a sparse communication network GG that supports efficient, fau...
Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data MovementJunjie Li2024-12-31下载BLAS is a fundamental building block of advanced linear algebra libraries and many modern scientific computing applications. GPUs are known for their strong arithmetic computing capabilities and are h...
Towards Sustainable Large Language Model ServingSophia Nguyen, Beihao Zhou, Yi Ding, Sihang Liu2024-12-31下载In this work, we study LLMs from a carbon emission perspective, addressing both operational and embodied emissions, and paving the way for sustainable LLM serving.
FedCod: An Efficient Communication Protocol for Cross-Silo Federated Learning with CodingPeishen Yan, Jun Li, Hao Wang, Tao Song, Yang Hua, Lu Peng, Haihui Zhou, Haibing Guan2024-12-31下载Federated Learning (FL) is an innovative distributed machine learning paradigm that enables multiple parties to collaboratively train a model without sharing their raw data, thereby preserving data pr...
OciorMVBA: Near-Optimal Error-Free Asynchronous MVBAJinyuan Chen2024-12-31下载In this work, we propose an error-free, information-theoretically secure, asynchronous multi-valued validated Byzantine agreement (MVBA) protocol, called OciorMVBA.
Debunking the CUDA Myth Towards GPU-based AI SystemsYunjae Lee, Juntaek Lim, Jehyeon Bang, Eunyeong Cho, Huijong Jeong, Taesu Kim, Hyungjun Kim, Joonhyung Lee, Jinseop Im, Ranggi Hwang, Se Jung Kwon, Dongsoo Lee, Minsoo Rhu2024-12-31下载This paper presents a comprehensive evaluation of Intel Gaudi NPUs as an alternative to NVIDIA GPUs, which is currently the de facto standard in AI system design.
Parallel I/O Characterization and Optimization on Large-Scale HPC Systems: A 360-Degree SurveyHammad Ather, Jean Luca Bez, Chen Wang, Hank Childs, Allen D. Malony, Suren Byna2024-12-31下载Driven by artificial intelligence, data science, and high-resolution simulations, I/O workloads and hardware on high-performance computing (HPC) systems have become increasingly complex.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
So Timely, Yet So Stale: The Impact of Clock Drift in Real-Time SystemsMehrdad Salimnejad, Nikolaos Pappas, Marios Kountouris2024-12-31下载In this paper, we address the problem of timely delivery of status update packets in a real-time communication system, where a transmitter sends status updates generated by a source to a receiver over...
Toward Digital Network Twins: Integrating Sionna RT in ns-3 for 6G Multi-RAT Networks SimulationsRoberto Pegurri, Francesco Linsalata, Eugenio Moro, Jakob Hoydis, Umberto Spagnolini2024-12-31下载The increasing complexity of 6G systems demands innovative tools for network management, simulation, and optimization. This work introduces the integration of ns-3 with Sionna RT, establishing the fou...
The Space above the Sky: Uniting Global-Scale Ground Station as a Service for Efficient Orbital Data ProcessingHeng Zhao, Sheng Cen, Yifei Zhu2024-12-31下载Large constellations of Earth Observation Low Earth Orbit satellites collect enormous amounts of image data every day. This amount of data needs to be transferred to data centers for processing via gr...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Combining Type Checking and Formal Verification for Lightweight OS CorrectnessRamla Ijaz, Kevin Boos, Lin Zhong2024-12-31下载This paper reports our experience of providing lightweight correctness guarantees to an open-source Rust OS, Theseus. First, we report new developments in intralingual design that leverage Rust's type...

cs.PF - Performance

标题作者发布日期PDF摘要
Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data MovementJunjie Li2024-12-31下载BLAS is a fundamental building block of advanced linear algebra libraries and many modern scientific computing applications. GPUs are known for their strong arithmetic computing capabilities and are h...
Parallel I/O Characterization and Optimization on Large-Scale HPC Systems: A 360-Degree SurveyHammad Ather, Jean Luca Bez, Chen Wang, Hank Childs, Allen D. Malony, Suren Byna2024-12-31下载Driven by artificial intelligence, data science, and high-resolution simulations, I/O workloads and hardware on high-performance computing (HPC) systems have become increasingly complex.

基于 VitePress 构建