Skip to content

2024-09-11

cs.AR - Architecture

标题作者发布日期PDF摘要
Extracting TCPIP Headers at High Speed for the Anonymized Network Traffic Graph ChallengeZhaoyang Han, Andrew Briasco-Stewart, Michael Zink, Miriam Leeser2024-09-11下载Field Programmable Gate Arrays (FPGAs) play a significant role in computationally intensive network processing due to their flexibility and efficiency.
Next-generation Probabilistic Computing Hardware with 3D MOSAICs, Illusion Scale-up, and Co-designTathagata Srimani, Robert Radway, Masoud Mohseni, Kerem Çamsarı, Subhasish Mitra2024-09-11下载The vast majority of 21st century AI workloads are based on gradient-based deterministic algorithms such as backpropagation. One of the key reasons for the dominance of deterministic ML algorithms is ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Scoping Sustainable Collaborative Mixed RealityYasra Chandio, Noman Bashir, Tian Guo, Elsa Olivetti, Fatima Anwar2024-09-11下载Mixed Reality (MR) is becoming ubiquitous as it finds its applications in education, healthcare, and other sectors beyond leisure. While MR end devices, such as headsets, have low energy intensity, th...
HERL: Tiered Federated Learning with Adaptive Homomorphic Encryption using Reinforcement LearningJiaxang Tang, Zeshan Fayyaz, Mohammad A. Salahuddin, Raouf Boutaba, Zhi-Li Zhang, Ali Anwar2024-09-11下载Federated Learning is a well-researched approach for collaboratively training machine learning models across decentralized data while preserving privacy.
MPPI-Generic: A CUDA Library for Stochastic Trajectory OptimizationBogdan Vlahov, Jason Gibson, Manan Gandhi, Evangelos A. Theodorou2024-09-11下载This paper introduces a new C++/CUDA library for GPU-accelerated stochastic optimization called MPPI-Generic. It provides implementations of Model Predictive Path Integral control, Tube-Model Predicti...
Federated Impression for Learning with Distributed Heterogeneous DataAtrin Arya, Sana Ayromlou, Armin Saadat, Purang Abolmaesumi, Xiaoxiao Li2024-09-11下载Standard deep learning-based classification approaches may not always be practical in real-world clinical applications, as they require a centralized collection of all samples.
Next-generation Probabilistic Computing Hardware with 3D MOSAICs, Illusion Scale-up, and Co-designTathagata Srimani, Robert Radway, Masoud Mohseni, Kerem Çamsarı, Subhasish Mitra2024-09-11下载The vast majority of 21st century AI workloads are based on gradient-based deterministic algorithms such as backpropagation. One of the key reasons for the dominance of deterministic ML algorithms is ...
Optimizing the Weather Research and Forecasting Model with OpenMP Offload and CodeeChayanon, Wichitrnithed, Woo-Sun-Yang, Yun, He, Brad Richardson, Koichi Sakaguchi, Manuel Arenaz, William I. Gustafson, Jacob Shpund, Ulises Costi Blanco, Alvaro Goldar Dieste2024-09-11下载Currently, the Weather Research and Forecasting model (WRF) utilizes shared memory (OpenMP) and distributed memory (MPI) parallelisms. To take advantage of GPU resources on the Perlmutter supercompute...
Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPUZhenyu Ning, Jieru Zhao, Qihao Jin, Wenchao Ding, Minyi Guo2024-09-11下载Multimodal Large Language Models (MLLMs) are distinguished by their multimodal comprehensive ability and widely used in many real-world applications including GPT-4o, autonomous driving and robotics.
Data Backup System with No Impact on Business Processing Utilizing Storage and Container TechnologiesSatoru Watanabe2024-09-11下载Data backup is a core technology for improving system resilience to system failures. Data backup in enterprise systems is required to minimize the impacts on business processing, which can be categori...
Distributed Convolutional Neural Network Training on Mobile and Edge ClustersPranav Rama, Madison Threadgill, Andreas Gerstlauer2024-09-11下载The training of deep and/or convolutional neural networks (DNNs/CNNs) is traditionally done on servers with powerful CPUs and GPUs. Recent efforts have emerged to localize machine learning tasks fully...
Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional GeneratorKangyang Luo, Shuai Wang, Xiang Li, Yunshi Lan, Ming Gao, Jinlong Shu2024-09-11下载Federated Learning (FL) is gaining popularity as a distributed learning framework that only shares model parameters or gradient updates and keeps private data locally.
FreeRide: Harvesting Bubbles in Pipeline ParallelismJiashu Zhang, Zihan Pan, Molly, Xu, Khuzaima Daudjee, Sihang Liu2024-09-11下载The occurrence of bubbles in pipeline parallelism is an inherent limitation that can account for more than 40% of the large language model (LLM) training time and is one of the main reasons for the un...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Echoes of Privacy: Uncovering the Profiling Practices of Voice AssistantsTina Khezresmaeilzadeh, Elaine Zhu, Kiersten Grieco, Daniel J. Dubois, Konstantinos Psounis, David Choffnes2024-09-11下载Many companies, including Google, Amazon, and Apple, offer voice assistants as a convenient solution for answering general voice queries and accessing their services.
Extracting TCPIP Headers at High Speed for the Anonymized Network Traffic Graph ChallengeZhaoyang Han, Andrew Briasco-Stewart, Michael Zink, Miriam Leeser2024-09-11下载Field Programmable Gate Arrays (FPGAs) play a significant role in computationally intensive network processing due to their flexibility and efficiency.
Extensions to BIER Tree Engineering (BIER-TE) for Large Multicast Domains and 1:1 Protection: Concept, Implementation and PerformanceMoritz Flüchter, Steffen Lindner, Fabian Ihle, Toerless Eckert, Michael Menth2024-09-11下载Bit Index Explicit Replication (BIER) has been proposed by the IETF as a stateless multicast transport technology. BIER adds a BIER header containing a bitstring indicating receivers of an IP multicas...
Synchronization Control-Plane Protocol for Quantum Link LayerBrandon Ru, Winston K. G. Seah, Alvin C. Valera2024-09-11下载Heralded entanglement generation between nodes of a future quantum internet is a fundamental operation that unlocks the potential for quantum communication.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
SafeBPF: Hardware-assisted Defense-in-depth for eBPF Kernel ExtensionsSoo Yee Lim, Tanya Prasad, Xueyuan Han, Thomas Pasquier2024-09-11下载The eBPF framework enables execution of user-provided code in the Linux kernel. In the last few years, a large ecosystem of cloud services has leveraged eBPF to enhance container security, system obse...

cs.PF - Performance

标题作者发布日期PDF摘要
Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPUZhenyu Ning, Jieru Zhao, Qihao Jin, Wenchao Ding, Minyi Guo2024-09-11下载Multimodal Large Language Models (MLLMs) are distinguished by their multimodal comprehensive ability and widely used in many real-world applications including GPT-4o, autonomous driving and robotics.

基于 VitePress 构建