Skip to content

2025-07-26

cs.AR - Architecture

标题作者发布日期PDF摘要
AxOSyn: An Open-source Framework for Synthesizing Novel Approximate Arithmetic OperatorsSiva Satyendra Sahoo, Salim Ullah, Akash Kumar2025-07-26下载Edge AI deployments are becoming increasingly complex, necessitating energy-efficient solutions for resource-constrained embedded systems. Approximate computing, which allows for controlled inaccuraci...
A Scalable Resource Management Layer for FPGA SoCs in 6G Radio UnitsNikolaos Bartzoudis, José Rubio Fernández, David López-Bueno, Antonio Román Villarroel2025-07-26下载This work presents a perspective on addressing the underutilization of computing resources in FPGA SoC devices deployed in 5G radio and edge computing infrastructure.
ChipletPart: Cost-Aware Partitioning for 2.5D SystemsAlexander Graening, Puneet Gupta, Andrew B. Kahng, Bodhisatta Pramanik, Zhiang Wang2025-07-26下载Industry adoption of chiplets has been growing as chiplets are a cost-effective option for making large, high-performance systems. Consequently, partitioning large systems into chiplets is increasingl...
Smaller, Faster, Cheaper: Architectural Designs for Efficient Machine LearningSteven Walton2025-07-26下载Major advancements in the capabilities of computer vision models have been primarily fueled by rapid expansion of datasets, model parameters, and computational budgets, leading to ever-increasing dema...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Racing to Idle: Energy Efficiency of Matrix Multiplication on Heterogeneous CPU and GPU ArchitecturesMufakir Qamar Ansari, Mudabir Qamar Ansari2025-07-26下载The paradigm shift towards multi-core and heterogeneous computing, driven by the fundamental power and thermal limits of single-core processors, has established energy efficiency as a first-class desi...
K4K^4: Online Log Anomaly Detection Via Unsupervised Typicality LearningWeicong Chen, Vikash Singh, Zahra Rahmani, Debargha Ganguly, Mohsen Hariri, Vipin Chaudhary2025-07-26下载Existing Log Anomaly Detection (LogAD) methods are often slow, dependent on error-prone parsing, and use unrealistic evaluation protocols. We introduce K4K^4, an unsupervised and parser-independent fr...
Parallel Hierarchical Agglomerative Clustering in Low DimensionsMohammadHossein Bateni, Laxman Dhulipala, Willem Fletcher, Kishen N Gowda, D Ellis Hershkowitz, Rajesh Jayaram, Jakub Łącki2025-07-26下载Hierarchical Agglomerative Clustering (HAC) is an extensively studied and widely used method for hierarchical clustering in Rk\mathbb{R}^k based on repeatedly merging the closest pair of clusters acco...
MTASet: A Tree-based Set for Efficient Range Queries in Update-heavy WorkloadsDaniel Manor, Mor Perry, Moshe Sulamy2025-07-26下载In concurrent data structures, the efficiency of set operations can vary significantly depending on the workload characteristics. Numerous concurrent set implementations are optimized and fine-tuned t...
Offloading tracing for real-time systems using a scalable cloud infrastructureDavid Jannis Schmidt, Grigory Fridman, Florian von Zabiensky2025-07-26下载Real-time embedded systems require precise timing and fault detection to ensure correct behavior. Traditional tracing tools often rely on local desktops with limited processing and storage capabilitie...
A Fast Parallel Median Filtering Algorithm Using Hierarchical TilingLouis Sugy2025-07-26下载Median filtering is a non-linear smoothing technique widely used in digital image processing to remove noise while retaining sharp edges. It is particularly well suited to removing outliers (impulse n...
MegatronApp: Efficient and Comprehensive Management on Distributed LLM TrainingBohan Zhao, Guang Yang, Shuo Chen, Ruitao Liu, Tingrui Zhang, Yongchao He, Wei Xu2025-07-26下载The rapid escalation in the parameter count of large language models (LLMs) has transformed model training from a single-node endeavor into a highly intricate, cross-node activity.
CleANN: Efficient Full Dynamism in Graph-based Approximate Nearest Neighbor SearchZiyu Zhang, Yuanhao Wei, Joshua Engels, Julian Shun2025-07-26下载Approximate nearest neighbor search (ANNS) has become a quintessential algorithmic problem for various other foundational data tasks for AI workloads.
Accelerating Matrix Multiplication: A Performance Comparison Between Multi-Core CPU and GPUMufakir Qamar Ansari, Mudabir Qamar Ansari2025-07-26下载Matrix multiplication is a foundational operation in scientific computing and machine learning, yet its computational complexity makes it a significant bottleneck for large-scale applications.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Towards Next Generation Immersive Applications in 5G EnvironmentsRohail Asim, Ankit Bhardwaj, Lakshmi Suramanian, Yasir Zaki2025-07-26下载The Multi-user Immersive Reality (MIR) landscape is evolving rapidly, with applications spanning virtual collaboration, entertainment, and training.
A Scalable Resource Management Layer for FPGA SoCs in 6G Radio UnitsNikolaos Bartzoudis, José Rubio Fernández, David López-Bueno, Antonio Román Villarroel2025-07-26下载This work presents a perspective on addressing the underutilization of computing resources in FPGA SoC devices deployed in 5G radio and edge computing infrastructure.
Optimizing Spreading Factor Selection for Mobile LoRa Gateways Using Single-Channel HardwareW. A. Sasindu Wijesuriya2025-07-26下载The deployment of mobile LoRa gateways using low-cost single-channel hardware presents a significant challenge in maintaining reliable communication due to the lack of dynamic configuration support.
Predicting Locations of Cell Towers for Network Capacity ExpansionSowmiyan Morri, Joy Bose, L Raghunatha Reddy, Sai Hareesh Anamandra2025-07-26下载Network capacity expansion is a critical challenge for telecom operators, requiring strategic placement of new cell sites to ensure optimal coverage and performance.

基于 VitePress 构建