Preliminary exploration on the application of the classification of dental smile lines via deep learning and multimodal large language model

doi:10.3877/cma.j.issn.1674-1366.2026.01.003

Abstract

Abstract:

Objective

Smile aesthetic evaluation is a critical component of clinical treatment planning. As a key aesthetic indicator, the precise classification of the smile line is crucial for optimizing restorative and reconstructive treatment plans. However, classifying smile line requires accurate assessment of complex relationships among the lips, gingiva and teeth, and analysis by dentists involves a degree of subjectivity and chances for misdiagnosis. This study aimed to investigate smile line classification by comparing the performance of convolutional neural networks (CNNs) and large language models (LLMs) , as well as clinicians of varying expertise levels, in this task.

Methods

Based on the publicly available high-quality FFHQ facial dataset, a smile image annotation dataset comprising 1 000 samples was constructed following image preprocessing and standardized annotations of three types: high, medium and low smile line. Seven classic CNN models (VGG16, ResNet34, etc.) and five representative multimodal LLMs (Qwen series, LLaVA 1.5-7B) were employed for training, validation, and testing. Model performance was evaluated using accuracy, precision, recall, and F₁ scores, and compared against assessments made by clinicians of different seniority levels.

Results

Among the seven commonly used CNN models, the ResNet152 model demonstrated optimal overall performance, achieving a mean classification accuracy of 83.30%, which significantly outperformed other CNN models and multimodal LLMs. Senior dentists achieved a classification accuracy of 83.00%, comparable to the performance of ResNet152. Heatmaps demonstrate similar attention regions between ResNet152 and dental practitioners.

Conclusions

CNN models demonstrated substantial clinical potential in smile line classification tasks, attaining expert-level performance. In contrast, large language models required further optimization for medical image fine-grained classification. This study provided experimental evidence and technical insights for developing intelligent aesthetic assessment systems in dentistry.

Key words: Smile esthetics, Classification of smile lines, Deep learning, Large language models

Xiaofei Meng, Zhuohong Gong, Yaowen Wan, Peida Li, Qiqi Hu, Longshiyu Qiu, Hengyi Liu, Weili Xie. Preliminary exploration on the application of the classification of dental smile lines via deep learning and multimodal large language model[J]. Chinese Journal of Stomatological Research(Electronic Edition), 2026, 20(01): 17-24.

/ / Recommend

Add to citation manager EndNote|Ris|BibTeX

URL: https://zhkqyxyjzz.cma-cmc.com.cn/EN/10.3877/cma.j.issn.1674-1366.2026.01.003

https://zhkqyxyjzz.cma-cmc.com.cn/EN/Y2026/V20/I01/17

Figures/Tables 6

References 20

[1]	Lukez A，Pavlic A，Trinajstic Zrinski M，et al. The unique contribution of elements of smile aesthetics to psychosocial well-being[J]. J Oral Rehabil，2015，42（4）：275-281. DOI：10.1111/joor.12250.
[2]	Pham TAV，Nguyen PA. Morphological features of smile attractiveness and related factors influence perception and gingival aesthetic parameters[J]. Int Dent J，2022，72（1）：67-75. DOI：10.1016/j.identj.2021.02.001.
[3]	Wang C，Hu WJ，Liang LZ，et al. Esthetics and smile-related characteristics assessed by laypersons[J]. J Esthet Restor Dent，2018，30（2）：136-145. DOI：10.1111/jerd.12356.
[4]	许砚耕，张艳玲，胡文杰，等.以微笑美观为导向的口腔软组织美学评价方法概述[J].口腔医学，2025，45（1）：18-24. DOI：10.13591/j.cnki.kqyx.2025.01.004.
[5]	Lee S，Jin G，Park JH，et al. Evaluation metric of smile classification by peri-oral tissue segmentation for the automation of digital smile design[J]. J Dent，2024，145：104871. DOI：10.1016/j.jdent.2024.104871.
[6]	Liu MQ，Xu ZN，Mao WY，et al. Deep learning-based evaluation of the relationship between mandibular third molar and mandibular canal on CBCT[J]. Clin Oral Investig，2022，26（1）：981-991. DOI：10.1007/s00784-021-04082-5.
[7]	Zeng P，Song R，Lin Y，et al. Abnormal maxillary sinus diagnosing on CBCT images via object detection and 'straight-forward' classification deep learning strategy[J]. J Oral Rehabil，2023，50（12）：1465-1480. DOI：10.1111/joor.13585.
[8]	Cui Z，Fang Y，Mei L，et al. A fully automatic AI system for tooth and alveolar bone segmentation from cone-beam CT images [J]. Nat Commun，2022，13（1）：2096. DOI：10.1038/s41467-022-29637-2.
[9]	Gong Z，Li X，Shi M，et al. Measuring the binary thickness of buccal bone of anterior maxilla in low-resolution cone-beam computed tomography via a bilinear convolutional neural network [J]. Quant Imaging Med Surg，2023，13（12）：8053-8066. DOI：10.21037/qims-23-744.
[10]	Chen X，Zhou C，Zhu Y，et al. Detecting glaucoma in highly myopic eyes from fundus photographs using deep convolutional neural networks[J]. Clin Exp Ophthalmol，2025，53（5）：502-515. DOI：10.1111/ceo.14498.
[11]	Biswas SS. Role of chat GPT in public health[J]. Ann Biomed Eng，2023，51（5）：868-869. DOI：10.1007/s10439-023-03172-7.
[12]	Russell BC，Torralba A，Murphy KP，et al. LabelMe：A database and web-based tool for image annotation[J]. Int J Comput Vision，2008，77（1）：157-173. DOI：10.1007/s11263-007-0090-8.
[13]	Tjan AH，Miller GD，The JG. Some esthetic factors in a smile[J]. J Prosthet Dent，1984，51（1）：24-28. DOI：10.1016/s0022-3913(84)80097-9.
[14]	Selvaraju RR，Cogswell M，Das A，et al. Grad-CAM：Visual explanations from deep networks via gradient-based localization[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision（ICCV）. Venice，2017：22-29. URL
[15]	Das N，Hussain E，Mahanta LB. Automated classification of cells into multiple classes in epithelial tissue of oral squamous cell carcinoma using transfer learning and convolutional neural network[J]. Neural Netw，2020，128：47-60. DOI：10.1016/j.neunet.2020.05.003.
[16]	Setzer FC，Shi KJ，Zhang Z，et al. Artificial intelligence for the computer-aided detection of periapical lesions in cone-beam computed tomographic images[J]. J Endod，2020，46（7）：987-993. DOI：10.1016/j.joen.2020.03.025.
[17]	Xu X，Liu C，Zheng Y. 3D tooth segmentation and labeling using deep convolutional neural networks[J]. IEEE Trans Vis Comput Graph，2019，25（7）：2336-2348. DOI：10.1109/tvcg.2018.2839685.
[18]	Khan M，Kazmi SMR，Khan FR，et al. Analysis of different characteristics of smile[J]. BDJ Open，2020，6：6. DOI：10.1038/s41405-020-0032-x.
[19]	Cunha J，Fernandes GVO，Fernandes JCH，et al. The interference of age and gender on smile characterization analyzed on six parameters：A clinical-photographic pilot study[J]. Medicina （Kaunas），2023，59（3）：595. DOI：10.3390/medicina59030595.
[20]	Shi M，Gong Z，Zeng P，et al. Multi-quantifying maxillofacial traits via a demographic parity-based AI model[J]. BME Front，2024，5：0054. DOI：10.34133/bmef.0054.

分析者		平均准确率（%）	平均精确率（%）	平均召回率（%）	平均F₁分数（%）	平均时间（s/张， ± s）
医师	医学生	83.50	82.55	79.31	78.31	5.060 0 ± 2.145 2
	初级医师	85.50	83.00	85.13	83.42	4.993 5 ± 1.987 2
	高级医师	83.00	84.50	80.55	80.35	4.132 8 ± 1.153 3
最优CNN	ResNet152	83.30	80.67	86.81	81.84	0.008 9 ± 0.002 0
最优LLM	LLaVA-1.5-7B	62.60	61.00	61.20	60.80	0.464 3 ± 0.017 0

分析者		平均准确率（%）	平均精确率（%）	平均召回率（%）	平均F₁分数（%）	平均时间（s/张， ± s）
医师	医学生	83.50	82.55	79.31	78.31	5.060 0 ± 2.145 2
	初级医师	85.50	83.00	85.13	83.42	4.993 5 ± 1.987 2
	高级医师	83.00	84.50	80.55	80.35	4.132 8 ± 1.153 3
最优CNN	ResNet152	83.30	80.67	86.81	81.84	0.008 9 ± 0.002 0
最优LLM	LLaVA-1.5-7B	62.60	61.00	61.20	60.80	0.464 3 ± 0.017 0

Please choose a citation manager

Content to export