IEEE VIS 2025 Content: ASight: Fine-tuning Auto-Scheduling Optimizations for Model Deployment via Visual Analytics

ASight: Fine-tuning Auto-Scheduling Optimizations for Model Deployment via Visual Analytics

Laixin Xie -

Chenyang Zhang -

Ruofei Ma -

Xingxing Xing -

Wei Wan -

Quan Li -

Image not found
Screen-reader Accessible PDF

Room: Hall E1

Keywords

Optimization, Codes, Hardware, Schedules, Runtime, Measurement, Visual analytics, Optimal scheduling, Aerodynamics, Training

Abstract

Upon completing the design and training phases, deploying a deep learning model to specific hardware becomes necessary prior to its implementation in practical applications. To enhance the performance of the model, the developers must optimize it to decrease inference latency. Auto-scheduling, an automated approach that generates optimization schemes, offers a feasible option for large-scale auto-deployment. Nevertheless, the low-level code generated by auto-scheduling closely resembles hardware coding and may present challenges for human comprehension, thereby hindering future manual optimization efforts. In this study, we introduce ASight, a visual analytics system to assist engineers in identifying performance bottlenecks, comprehending the auto-generated low-level code, and obtaining insights from auto-scheduling optimizations. We develop a subgraph matching algorithm capable of identifying graph isomorphism among Intermediate Representations to track performance bottlenecks from low-level metrics to high-level computational graphs. To address the substantial profiling metrics involved in auto-scheduling and derive optimization design principles by summarizing commonalities among auto-scheduling optimizations, we propose an enhanced visualization for the large search space of auto-scheduling. We validate the effectiveness of ASight through two case studies, one focused on a local machine and the other on a data center, along with a quantitative experiment exploring optimization design principles.