Home  About misconduct   中文  
 
  • Search
  • lucene Search
  • Citation
  • Fig/Tab
  • Adv Search
Just Accepted  |  Current Issue  |  Archive  |  Featured Articles  |  Most Read  |  Most Download  |  Most Cited

Chinese Journal of Stomatological Research(Electronic Edition) ›› 2026, Vol. 20 ›› Issue (01): 17-24. doi: 10.3877/cma.j.issn.1674-1366.2026.01.003

• Digital & Intelligent Dentistry Column·Original Articles • Previous Articles    

Preliminary exploration on the application of the classification of dental smile lines via deep learning and multimodal large language model

Xiaofei Meng1, Zhuohong Gong2, Yaowen Wan3, Peida Li3, Qiqi Hu1, Longshiyu Qiu4, Hengyi Liu4, Weili Xie1,()   

  1. 1Department of Prosthodontics, The First Affiliated Hospital of Harbin Medical University, Harbin 150001, China
    2Restorative Dental Sciences, Faculty of Dentistry, The University of Hong Kong, Hong Kong 999077, China
    3School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou 510006, China
    4Hospital of Stomatology, Guanghua School of Stomatology, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Stomatology, Guangdong Provincial Clinical Research Center of Oral Diseases, Guangzhou 510055, China
  • Received:2025-12-09 Online:2026-02-01 Published:2026-02-24
  • Contact: Weili Xie

Abstract:

Objective

Smile aesthetic evaluation is a critical component of clinical treatment planning. As a key aesthetic indicator, the precise classification of the smile line is crucial for optimizing restorative and reconstructive treatment plans. However, classifying smile line requires accurate assessment of complex relationships among the lips, gingiva and teeth, and analysis by dentists involves a degree of subjectivity and chances for misdiagnosis. This study aimed to investigate smile line classification by comparing the performance of convolutional neural networks (CNNs) and large language models (LLMs) , as well as clinicians of varying expertise levels, in this task.

Methods

Based on the publicly available high-quality FFHQ facial dataset, a smile image annotation dataset comprising 1 000 samples was constructed following image preprocessing and standardized annotations of three types: high, medium and low smile line. Seven classic CNN models (VGG16, ResNet34, etc.) and five representative multimodal LLMs (Qwen series, LLaVA 1.5-7B) were employed for training, validation, and testing. Model performance was evaluated using accuracy, precision, recall, and F1 scores, and compared against assessments made by clinicians of different seniority levels.

Results

Among the seven commonly used CNN models, the ResNet152 model demonstrated optimal overall performance, achieving a mean classification accuracy of 83.30%, which significantly outperformed other CNN models and multimodal LLMs. Senior dentists achieved a classification accuracy of 83.00%, comparable to the performance of ResNet152. Heatmaps demonstrate similar attention regions between ResNet152 and dental practitioners.

Conclusions

CNN models demonstrated substantial clinical potential in smile line classification tasks, attaining expert-level performance. In contrast, large language models required further optimization for medical image fine-grained classification. This study provided experimental evidence and technical insights for developing intelligent aesthetic assessment systems in dentistry.

Key words: Smile esthetics, Classification of smile lines, Deep learning, Large language models

京ICP 备07035254号-28
Copyright © Chinese Journal of Stomatological Research(Electronic Edition), All Rights Reserved.
Tel: 020-87330582 E-mail: zhkqyxyj@163.com
Powered by Beijing Magtech Co. Ltd