Multi Modal Image Text Sentiment Analysis Method for Social Media

Authors

  • Zelin Wang Department of Computer Science and Technology, Taiyuan University of Science and Technology, Shanxi, China

DOI:

https://doi.org/10.54097/qhqhfk25

Keywords:

Multimodal Sentiment Analysis, Semantic Alignment, Synthetic Image Enhancement, Large Model Fusion, Cross-Category Technology Fusion.

Abstract

With the widespread application of social media, the multimodal fusion feature of "text + image" in user-generated content has become a dominant form. Therefore, sentiment analysis based on multimodal social media images and texts is an important research topic today. However, sentiment analysis faces three major challenges: enhancing image modality expression, bridging the cross-modal semantic gap, and improving the ability to model complex texts. This paper reviews five typical multimodal sentiment analysis methods proposed in the past two years. These methods are categorized into image enhancement, semantic alignment, and large model integration based on the types of problems they primarily address. A comparative analysis of their technical paths, performance, and application scenarios is conducted to understand the research focus of each type of model, identify their shortcomings and challenges, and propose a cross-category fusion technology solution by integrating existing model methods. An integrated framework of "universal semantic encoding + dynamic modality adaptation + cross-scenario calibration" is constructed to improve the generalization and robustness of models, providing directions for future research on multimodal social media image-text sentiment analysis.

Downloads

Download data is not yet available.

References

[1] Liu Y, Wang Z, Fang J, et al. Multimodal public opinion analysis based on image-text fusion. Journal of Computer Science and Exploration, 2022, 16(6): 1260-1274.

[2] Yadav A, Vishwakarma D K. A Deep Multi-Level Attentive network for Multimodal Sentiment Analysis. Proceedings of the 2023 ACM on Conference on Information and Knowledge Management, 2023: 1-11. DOI: https://doi.org/10.1145/3517139

[3] Zhou H. The Application of Artificial Intelligence-based Multimodal Emotion Analysis. In: Wang Y (Ed.). Proceedings of the 2024 International Conference on Artificial Intelligence and Communication (ICAIC 2024), Advances in Intelligent Systems Research 185, 2024: 286-294. DOI: https://doi.org/10.2991/978-94-6463-512-6_32

[4] Zhao K, Zheng M, Li Q, et al. Multimodal Sentiment Analysis-A Comprehensive Survey From a Fusion Methods Perspective. IEEE Access, 2025, 13: 64556-64571. DOI: https://doi.org/10.1109/ACCESS.2025.3554665

[5] Isa Z O, Adewumi S E, Yemi-Peters V I. Depression Detection Using Sentiment Analysis of Social Media Text. FUW Trends in Science & Technology Journal, 2025, 10(1): 227-231.

[6] He X. Research on public attitude towards AI portrait generation technology based on social media big data and its application potential in e-commerce. E-Commerce Review, 2025, 14(4): 176-186. DOI: https://doi.org/10.12677/ecl.2025.144876

[7] Wang C, Konpang J, Sirikham A, et al. Enhancing Weibo Sentiment Analysis with Multi-Modal Learning: Integrating Text and Synthesized Images with Contrastive Learning. IEEE Access, 2024, 11: 1-11. DOI: https://doi.org/10.1109/ACCESS.2025.3579782

[8] Xiang Y, Wu D, Cai Y, et al. Entity Knowledge-Guided Image-Text Alignment for Joint Multimodal Aspect-Based Sentiment Analysis. IEICE Transactions on Information and Systems, 2024, E107-A(1): 1-10.

[9] Yan Z D, Guo J J, Yu Z T. Aspect-Guided Progressive Fusion of Text and Image for Multimodal Aspect-Based Sentiment Analysis. Proceedings of the 23rd Chinese Computational Linguistics Conference, 2024: 454-466.

[10] Mao K B, Dai W, Guo Z H, et al. Review on the evolution and application of AI knowledge distillation technology. Journal of Agricultural Big Data, 2025, 7(2): 144-154.

Downloads

Published

23-12-2025

How to Cite

Wang, Z. (2025). Multi Modal Image Text Sentiment Analysis Method for Social Media. Highlights in Science, Engineering and Technology, 159, 48-58. https://doi.org/10.54097/qhqhfk25