Skip to content

2025-10-29

cs.AR - Architecture

标题作者发布日期PDF摘要
Detecting Anomalies in Machine Learning Infrastructure via Hardware TelemetryZiji Chen, Steven W. D. Chien, Peng Qian, Noa Zilberman2025-10-29下载Modern machine learning (ML) has grown into a tightly coupled, full-stack ecosystem that combines hardware, software, network, and applications.
CHIPSIM: A Co-Simulation Framework for Deep Learning on Chiplet-Based SystemsLukas Pfromm, Alish Kanani, Harsh Sharma, Janardhan Rao Doppa, Partha Pratim Pande, Umit Y. Ogras2025-10-29下载Due to reduced manufacturing yields, traditional monolithic chips cannot keep up with the compute, memory, and communication demands of data-intensive applications, such as rapidly growing deep neural...
Accurate Leakage Speculation for Quantum Error CorrectionChaithanya Naik Mude, Swamit Tannu2025-10-29下载Quantum Error Correction (QEC) protects qubits against bit- and phase-flip errors in the |0> or |1> subspace, but physical qubits can also leak into higher energy levels (e.g., |2>).
PDA-LSTM: Knowledge-driven page data arrangement based on LSTM for LCM supression in QLC 3D NAND flash memoriesQianhui Li, Weiya Wang, Qianqi Zhao, Tong Qu, Jing He, Xuhong Qiang, Jingwen Hou, Ke Chen, Bao Zhang, Qi Wang2025-10-29下载Quarter level cell (QLC) 3D NAND flash memory is emerging as the predominant storage solution in the era of artificial intelligence. QLC 3D NAND flash stores 4 bit per cell to expand the storage densi...
DIRC-RAG: Accelerating Edge RAG with Robust High-Density and High-Loading-Bandwidth Digital In-ReRAM ComputationKunming Shao, Zhipeng Liao, Jiangnan Yu, Liang Zhao, Qiwei Li, Xijie Huang, Jingyu He, Fengshi Tian, Yi Zou, Xiaomeng Wang, Tim Kwang-Ting Cheng, Chi-Ying Tsui2025-10-29下载Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by integrating external knowledge retrieval but faces challenges on edge devices due to high storage, energy, and latency dem...
Silicon-based Josephson junction field-effect transistors enabling cryogenic logic and quantum technologiesYusheng Xiong, Kaveh Delfanazari2025-10-29下载The continuous miniaturisation of metal-oxide-semiconductor field-effect transistors (MOSFETs) from long- to short-channel architectures has advanced beyond the predictions of Moore's Law.
Large Language Model for Verilog Code Generation: Literature Review and the Road AheadGuang Yang, Wei Zheng, Xiang Chen, Dong Liang, Peng Hu, Yukui Yang, Shaohang Peng, Zhenghan Li, Jiahui Feng, Xiao Wei, Kexin Sun, Deyuan Ma, Haotian Cheng, Yiheng Shen, Xing Hu, Terry Yue Zhuo, David Lo2025-10-29下载Code generation has emerged as a critical research area at the intersection of Software Engineering (SE) and Artificial Intelligence (AI), attracting significant attention from both academia and indus...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Detecting Anomalies in Machine Learning Infrastructure via Hardware TelemetryZiji Chen, Steven W. D. Chien, Peng Qian, Noa Zilberman2025-10-29下载Modern machine learning (ML) has grown into a tightly coupled, full-stack ecosystem that combines hardware, software, network, and applications.
Foundations of Fiat-Denominated Loans Collateralized by CryptocurrenciesPavel Hubáček, Jan Václavek, Michelle Yeo2025-10-29下载The rising importance of cryptocurrencies as financial assets pushed their applicability from an object of speculation closer to standard financial instruments such as loans.
Holon Streaming: Global Aggregations with Windowed CRDTsJonas Spenger, Kolya Krafeld, Ruben van Gemeren, Philipp Haller, Paris Carbone2025-10-29下载Scaling global aggregations is a challenge for exactly-once stream processing systems. Current systems implement these either by computing the aggregation in a single task instance, or by static aggre...
Effect of Full Common Randomness Replication in Symmetric PIR on Graph-Based Replicated SystemsShreya Meel, Sennur Ulukus2025-10-29下载We revisit the problem of symmetric private information retrieval (SPIR) in settings where the database replication is modeled by a simple graph.
Distributed Q-learning-based Shortest-Path Tree Construction in IoT Sensor NetworksVan-Vi Vo, Tien-Dung Nguyen, Duc-Tai Le, Hyunseung Choo2025-10-29下载Efficient routing in IoT sensor networks is critical for minimizing energy consumption and latency. Traditional centralized algorithms, such as Dijkstra's, are computationally intensive and ill-suited...
Opt4GPTQ: Co-Optimizing Memory and Computation for 4-bit GPTQ Quantized LLM Inference on Heterogeneous PlatformsYaozheng Zhang, Wei Wang, Jie Kong, Jiehan Zhou, Xianwei Zhang, Huanqing Cui, Han Bao, Yuhai Liu2025-10-29下载The increasing adoption of large language models (LLMs) on heterogeneous computing platforms poses significant challenges to achieving high inference efficiency.
Can Like Attract Like? A Study of Homonymous Gathering in NetworksStéphane Devismes, Yoann Dieudonné, Arnaud Labourel2025-10-29下载A team of mobile agents, starting from distinct nodes of a network, have to meet at the same node and declare that they all met. Agents execute the same algorithm, which they start when activated by a...
Scheduling Data-Intensive Workloads in Large-Scale Distributed Systems: Trends and ChallengesGeorgios L. Stavrinides, Helen D. Karatza2025-10-29下载With the explosive growth of big data, workloads tend to get more complex and computationally demanding. Such applications are processed on distributed interconnected resources that are becoming large...
A Privacy-Preserving Ecosystem for Developing Machine Learning Algorithms Using Patient Data: Insights from the TUM.ai MakeathonSimon Süwer, Mai Khanh Mai, Christoph Klein, Nicola Götzenberger, Denis Dalić, Andreas Maier, Jan Baumbach2025-10-29下载The integration of clinical data offers significant potential for the development of personalized medicine. However, its use is severely restricted by the General Data Protection Regulation (GDPR), es...
MoEntwine: Unleashing the Potential of Wafer-scale Chips for Large-scale Expert Parallel InferenceXinru Tang, Jingxiang Hou, Dingcheng Jiang, Taiquan Wei, Jiaxin Liu, Jinyi Deng, Huizheng Wang, Qize Yang, Haoran Shang, Chao Li, Yang Hu, Shouyi Yin2025-10-29下载As large language models (LLMs) continue to scale up, mixture-of-experts (MoE) has become a common technology in SOTA models. MoE models rely on expert parallelism (EP) to alleviate memory bottleneck,...
mLR: Scalable Laminography Reconstruction based on MemoizationBin Ma, Viktor Nikitin, Xi Wang, Tekin Bicer, Dong Li2025-10-29下载ADMM-FFT is an iterative method with high reconstruction accuracy for laminography but suffers from excessive computation time and large memory consumption.
Machine Learning and CPU (Central Processing Unit) Scheduling Co-Optimization over a Network of Computing CentersMohammadreza Doostmohammadian, Zulfiya R. Gabidullina, Hamid R. Rabiee2025-10-29下载In the rapidly evolving research on artificial intelligence (AI) the demand for fast, computationally efficient, and scalable solutions has increased in recent years.
Multi-Resolution Model Fusion for Accelerating the Convolutional Neural Network TrainingKewei Wang, Claire Songhyun Lee, Sunwoo Lee, Vishu Gupta, Jan Balewski, Alex Sim, Peter Nugent, Ankit Agrawal, Alok Choudhary, Kesheng Wu, Wei-keng Liao2025-10-29下载Neural networks are rapidly gaining popularity in scientific research, but training the models is often very time-consuming. Particularly when the training data samples are large high-dimensional arra...
Timing Games in Responsive Consensus ProtocolsKaya Alpturer, Kushal Babel, Aditya Saraf2025-10-29下载Optimistic responsiveness -- the ability of a consensus protocol to operate at the speed of the network -- is widely used in consensus protocol design to optimize latency and throughput.
The Singularity Theory of Concurrent Programs: A Topological Characterization and Detection of Deadlocks and LivelocksDi Zhang2025-10-29下载This paper introduces a novel paradigm for the analysis and verification of concurrent programs -- the Singularity Theory. We model the execution space of a concurrent program as a branched topologica...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Zero Added Loss Multiplexing (ZALM) Source SimulationJerry Horgan, Alexander Nico-Katz, Shelbi L. Jenkins, Ashley N. Tittelbaugh, Vivek Visan, Rohan Bali, Marco Ruffini, Boulat A. Bash, Daniel C. Kilper2025-10-29下载Zero Added Loss Multiplexing (ZALM) offers broadband, per channel heralded EPR pairs, with a rich parameter space that allows its performance to be tailored for specific applications.
FedSelect-ME: A Secure Multi-Edge Federated Learning Framework with Adaptive Client ScoringHanie Vatani, Reza Ebrahimi Atani2025-10-29下载Federated Learning (FL) enables collaborative model training without sharing raw data but suffers from limited scalability, high communication costs, and privacy risks due to its centralized architect...
Identity Management for Agentic AI: The new frontier of authorization, authentication, and security for an AI agent worldTobin South, Subramanya Nagabhushanaradhya, Ayesha Dissanayaka, Sarah Cecchetti, George Fletcher, Victor Lu, Aldo Pietropaolo, Dean H. Saxe, Jeff Lombardo, Abhishek Maligehalli Shivalingaiah, Stan Bounev, Alex Keisner, Andor Kesselman, Zack Proser, Ginny Fahs, Andrew Bunyea, Ben Moskowitz, Atul Tulshibagwale, Dazza Greenwood, Jiaxin Pei, Alex Pentland2025-10-29下载The rapid rise of AI agents presents urgent challenges in authentication, authorization, and identity management. Current agent-centric protocols (like MCP) highlight the demand for clarified best pra...
MetaLore: Learning to Orchestrate Communication and Computation for Metaverse SynchronizationElif Ebru Ohri, Qi Liao, Anastasios Giovanidis, Francesca Fossati, Nour-El-Houda Yellas2025-10-29下载As augmented and virtual reality evolve, achieving seamless synchronization between physical and digital realms remains a critical challenge, especially for real-time applications where delays affect ...
Q-Learning-Based Time-Critical Data Aggregation Scheduling in IoTVan-Vi Vo, Tien-Dung Nguyen, Duc-Tai Le, Hyunseung Choo2025-10-29下载Time-critical data aggregation in Internet of Things (IoT) networks demands efficient, collision-free scheduling to minimize latency for applications like smart cities and industrial automation.
Resource Allocation in Hybrid Radio-Optical IoT Networks using GNN with Multi-task LearningAymen Hamrouni, Sofie Pollin, Hazem Sallouha2025-10-29下载This paper addresses the problem of dual-technology scheduling in hybrid Internet-of-Things (IoT) networks that integrate Optical Wireless Communication (OWC) with Radio Frequency (RF).
Deep Reinforcement Learning-Based Cooperative Rate Splitting for Satellite-to-Underground Communication NetworksKaiqiang Lin, Kangchun Zhao, Yijie Mao2025-10-29下载Reliable downlink communication in satellite-to-underground networks remains challenging due to severe signal attenuation caused by underground soil and refraction in the air-soil interface.
Device to Device Pairs Sharding based on DistanceK Prajwal, Tharun K, Navaneeth P, Ishwar Mandal, Kiran M2025-10-29下载In the conventional cellular system, devices are not allowed to communicate directly with each other in the licensed cellular bandwidth and all communications take place through the base stations.
Evaluating Learning Congestion control Schemes for LEO ConstellationsMihai Mazilu, Aiden Valentine, George Parisis2025-10-29下载Low Earth Orbit (LEO) satellite networks introduce unique congestion control (CC) challenges due to frequent handovers, rapidly changing round-trip times (RTTs), and non-congestive loss.
Energy consumption assessment of a Virtual Reality Remote Rendering application over 5G networksRoberto Viola, Mikel Irazola, José Ramón Juárez, Minh Nguyen, Alexander Zoubarev, Alexander Futasz, Louay Bassbouss, Amr A. AbdelNabi, Javier Fernández Hidalgo2025-10-29下载This paper investigates the energy implications of remote rendering for Virtual Reality (VR) applications within a real 5G testbed. Remote rendering enables lightweight devices to access high-performa...
Is Protective DNS Blocking the Wild West?David Plonka, Branden Palacio, Debbie Perouli2025-10-29下载We perform a passive measurement study investigating how a Protective DNS service might perform in a Research & Education Network serving hundreds of member institutions.
Adversarial Pre-Padding: Generating Evasive Network Traffic Against Transformer-Based ClassifiersQuanliang Jing, Xinxin Fan, Yanyan Liu, Jingping Bi2025-10-29下载To date, traffic obfuscation techniques have been widely adopted to protect network data privacy and security by obscuring the true patterns of traffic.
TCP ROCCET: An RTT-Oriented CUBIC Congestion Control Extension for 5G and Beyond NetworksLukas Prause, Mark Akselrod2025-10-29下载The behavior of loss-based TCP congestion control algorithms like TCP CUBIC continues to be a challenge in modern cellular networks. Due to the large RLC layer buffers required to deal with short-term...
Adaptive Design of mmWave Initial Access Codebooks using Reinforcement LearningSabrine Aroua, Christos Anastasios Bovolis, Bo Göransson, Anastasios Giovanidis, Mathieu Leconte, Apostolos Destounis2025-10-29下载Initial access (IA) is the process by which user equipment (UE) establishes its first connection with a base station. In 5G systems, particularly at millimeter-wave frequencies, IA integrates beam man...
ML-Based Preamble Collision Detection in the Random Access Procedure of Cellular IoT NetworksGiancarlo Maldonado Cardenas, Diana C. Gonzalez, Judy C. Guevara, Carlos A. Astudillo, Nelson L. S. da Fonseca2025-10-29下载Preamble collision in the random access channel (RACH) is a major bottleneck in massive machine-type communication (mMTC) scenarios, typical of cellular IoT (CIoT) deployments.
Time-Series Foundation Models for ISP Traffic ForecastingFan Liu, Behrooz Farkiani, Patrick Crowley2025-10-29下载Accurate network-traffic forecasting enables proactive capacity planning and anomaly detection in Internet Service Provider (ISP) networks. Recent advances in time-series foundation models (TSFMs) hav...
Learning-Based vs Human-Derived Congestion Control: An In-Depth Experimental StudyMihai Mazilu, Luca Giacomoni, George Parisis2025-10-29下载Learning-based congestion control (CC), including Reinforcement-Learning, promises efficient CC in a fast-changing networking landscape, where evolving communication technologies, applications and tra...
Performance Evaluation of Multimedia Traffic in Cloud Storage Services over Wi-Fi and LTE NetworksAlbert Espinal, V. Sanchez Padilla, Yesenia Cevallos2025-10-29下载The performance of Dropbox, Google Drive, and OneDrive cloud storage services was evaluated under Wi-Fi and LTE network conditions during multimedia file uploads.

cs.PF - Performance

标题作者发布日期PDF摘要
Detecting Anomalies in Machine Learning Infrastructure via Hardware TelemetryZiji Chen, Steven W. D. Chien, Peng Qian, Noa Zilberman2025-10-29下载Modern machine learning (ML) has grown into a tightly coupled, full-stack ecosystem that combines hardware, software, network, and applications.
Outperforming Multiserver SRPT at All LoadsIzzy Grosof, Daniela Hurtado-Lange2025-10-29下载A well-designed scheduling policy can unlock significant performance improvements with no additional resources. Multiserver SRPT (SRPT-kk) is known to achieve asymptotically optimal mean response tim...
Opt4GPTQ: Co-Optimizing Memory and Computation for 4-bit GPTQ Quantized LLM Inference on Heterogeneous PlatformsYaozheng Zhang, Wei Wang, Jie Kong, Jiehan Zhou, Xianwei Zhang, Huanqing Cui, Han Bao, Yuhai Liu2025-10-29下载The increasing adoption of large language models (LLMs) on heterogeneous computing platforms poses significant challenges to achieving high inference efficiency.
The influence of the random numbers quality on the results in stochastic simulations and machine learningBenjamin A. Antunes2025-10-29下载Pseudorandom number generators (PRNGs) are ubiquitous in stochastic simulations and machine learning (ML), where they drive sampling, parameter initialization, regularization, and data shuffling.
mLR: Scalable Laminography Reconstruction based on MemoizationBin Ma, Viktor Nikitin, Xi Wang, Tekin Bicer, Dong Li2025-10-29下载ADMM-FFT is an iterative method with high reconstruction accuracy for laminography but suffers from excessive computation time and large memory consumption.
A Study on Inference Latency for Vision Transformers on Mobile DevicesZhuojin Li, Marco Paolieri, Leana Golubchik2025-10-29下载Given the significant advances in machine learning techniques on mobile devices, particularly in the domain of computer vision, in this work we quantitatively study the performance characteristics of ...

基于 VitePress 构建