IEEE VIS 2025 Content: ASight: Fine-tuning Auto-Scheduling Optimizations for Model Deployment via Visual Analytics

ASight: Fine-tuning Auto-Scheduling Optimizations for Model Deployment via Visual Analytics



 Laixin Xie -

 Chenyang Zhang -

 Ruofei Ma -

 Xingxing Xing -

 Wei Wan -

 Quan Li -

 Screen-reader Accessible PDF

 DOI: 10.1109/TVCG.2025.3574194

Room: Hall E1

2025-11-06T13:48:00.000ZGMT-0600Change your timezone on the schedule page
2025-11-06T13:48:00.000Z

Recorded video from this session can be viewed at the following link.
https://youtu.be/fkfdha6pCeI

Keywords

Optimization, Codes, Hardware, Schedules, Runtime, Measurement, Visual analytics, Optimal scheduling, Aerodynamics, Training

Abstract

Upon completing the design and training phases, deploying a deep learning model to specific hardware becomes necessary prior to its implementation in practical applications. To enhance the performance of the model, the developers must optimize it to decrease inference latency. Auto-scheduling, an automated approach that generates optimization schemes, offers a feasible option for large-scale auto-deployment. Nevertheless, the low-level code generated by auto-scheduling closely resembles hardware coding and may present challenges for human comprehension, thereby hindering future manual optimization efforts. In this study, we introduce ASight, a visual analytics system to assist engineers in identifying performance bottlenecks, comprehending the auto-generated low-level code, and obtaining insights from auto-scheduling optimizations. We develop a subgraph matching algorithm capable of identifying graph isomorphism among Intermediate Representations to track performance bottlenecks from low-level metrics to high-level computational graphs. To address the substantial profiling metrics involved in auto-scheduling and derive optimization design principles by summarizing commonalities among auto-scheduling optimizations, we propose an enhanced visualization for the large search space of auto-scheduling. We validate the effectiveness of ASight through two case studies, one focused on a local machine and the other on a data center, along with a quantitative experiment exploring optimization design principles.