TY - GEN
T1 - Optimization of Multimodal Generative Models for Creative Content Generation
AU - Bhowmik, Ayushman
AU - Sharma, Ruchi
AU - Yang, Tiansheng
AU - Wang, Lu
AU - Rathore, Bharati
AU - Tripathy, Hrudaya Kumar
PY - 2024/11/6
Y1 - 2024/11/6
N2 - This research paper presents an in-depth examination of recent developments in multimodal generative models with a specific focus on enhancing creative content generation. We introduce a novel architectural framework that seamlessly integrates text, image, and audio modalities, enabling cross-modal translation of creative concepts. Extensive empirical evaluations demonstrate the model’s proficiency in generating creative content, characterized by high quality, coherence, and diversity. In our day-to-day life, the applications are massive which include digital and physical marketing and personalizing our choices in our day-to-day life.
AB - This research paper presents an in-depth examination of recent developments in multimodal generative models with a specific focus on enhancing creative content generation. We introduce a novel architectural framework that seamlessly integrates text, image, and audio modalities, enabling cross-modal translation of creative concepts. Extensive empirical evaluations demonstrate the model’s proficiency in generating creative content, characterized by high quality, coherence, and diversity. In our day-to-day life, the applications are massive which include digital and physical marketing and personalizing our choices in our day-to-day life.
U2 - 10.1007/978-981-97-6726-7_23
DO - 10.1007/978-981-97-6726-7_23
M3 - Conference contribution
SN - 978-981-97-6725-0
T3 - Lecture Notes in Networks and Systems
SP - 291
EP - 299
BT - Proceedings of Fifth Doctoral Symposium on Computational Intelligence
A2 - Swaroop, Abhishek
A2 - Kansal, Vineet
A2 - Fortino, Giancarlo
A2 - Hassanien, Aboul Ella
PB - Springer
T2 - 5th Doctoral Symposium on Computational Intelligence
Y2 - 10 May 2024 through 10 May 2024
ER -