LARGE LANGUAGE MODELS FOR GENERATING CREATIVE CONCEPTS IN VISUAL ART PRE-PRODUCTION PROCESSES

A. Vijayalakahmi; Vichitra  M; M. Ulagammai; Anshun Cai; Manoranjan Parhi; Pawan  Wawage

doi:10.29121/shodhkosh.v7.i4s.2026.7502

Authors

Dr. A. Vijayalakahmi Assistant Professor, Department of English, Chaitanya Bharathi Institute of Technology, Hyderabad-500075, Telangana, India
Dr. Vichitra M. Assistant Professor, Department of Civil Engineering, Faculty of Engineering and Technology, JAIN (Deemed-to-be University), Bengaluru, Karnataka, India
Dr. M. Ulagammai Associate Professor, SRM Institute of Science and Technology, Vadapalani Campus, Chennai, India
Anshun Cai Faculty of Education, Shinawatra University, Thailand
Dr. Manoranjan Parhi Professor, Department of Centre for Data Science, Institute of Technical Education and Research, Siksha 'O' Anusandhan (Deemed to be University), Bhubaneswar, Odisha, India
Pawan Wawage Assistant Professor, Department of Information Technology, Vishwakarma Institute of Technology, Pune, Maharashtra 411037, India

DOI:

https://doi.org/10.29121/shodhkosh.v7.i4s.2026.7502

Keywords:

Large Language Models, Computational Creativity, Visual Art Pre-Production, Concept Generation, Human–AI Collaboration, Prompt Engineering, Creative AI, Digital Art Workflow

Abstract [English]

Pre-production phase of visual art is a very significant stage because it entails intellectual ideation, story development and search of design. The need to possess smart looking systems that can be utilized to augment the traditional ideation work is growing as well as requirements of fast and diverse creative effort are escalating. The article dwells upon the application of Large Language Models (LLMs) to generate creative concepts in pre-production in visual art. With their abilities to manipulate and generate semantically rich textual data, LLCs are in a good position to be utilized to assist in supporting the early-stage artistic processes. The study proposes a formal methodology that would involve timely engineering, notion generation, and evaluation into a human-AI work system. System architecture is a developed system that assists in the conversion of user specified inputs to structured creative concepts like design of characters, descriptions of scenes and thematic scripts. The paper also explains how LLC can be incorporated with digital art tools in such a way that the ideation process through text may be incorporated into a visual representation without interruption. The obtained outcomes of the experiment show that the workflows that are assisted by LLM have a positive influence on the diversity, originality, and the quality of idea generation as compared to the traditional methods of idea generation. The generated concepts are evaluated using a detailed evaluation framework to assess the quality of the concepts generated by using various measures such as coherence, relevance, aesthetic potential and diversity. In addition, the user study, which will be carried out with artists and designers, will assist in receiving the concept of the practical applicability and usability of the offered approach. The findings demonstrate that LLMs can be regarded as efficient co-creative partners that help users overcome the issue of creative paralysis and expand the scope of their conceptual exploration without losing their artistic control. Despite these advantages, the originality, bias and creative evaluation problems are still present, which proves the need of more research. The discussion of the future directions, including multimodal integration, personalization of AI tools, and the development of the standardized ways of creativity measurement, conclude the paper. Overall, the work is applicable to the field of computational creativity as it demonstrates the possibility of using LLM to enhance the pre-production process related to the visual art and rebrand the human-AI collaboration in the creative industries.

References

Biswas, S. S. (2023). Role of ChatGPT in Public Health. Annals of Biomedical Engineering, 51, 868–869. https://doi.org/10.1007/s10439-023-03172-7 DOI: https://doi.org/10.1007/s10439-023-03172-7

Blunsom, P. (2004). Hidden Markov Models. Lecture Notes, 15, 48.

Bommasani, R., Hudson, D. A., Adeli, E., Altman, R., Arora, S., von Arx, S., ... and Liang, P. (2021). On The Opportunities and Risks of Foundation Models. arXiv preprint arXiv:2108.07258.

Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., ... and Amodei, D. (2020). Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems (Vol. 33, 1877–1901).

Driess, D., Xia, F., Sajjadi, M. S. M., Touvron, H., Yin, H., Bachem, O., ... and Rus, D. (2023). PaLM-E: An Embodied Multimodal Language Model. arXiv preprint arXiv:2303.03378.

Kashyap, S. V., Purohit, S., Kumar, D. A., Jawaid, F. I. M., Kumar, J. R. R., and Ajani, S. N. (2025). Visual Storytelling and Explainable Intelligence in Organizational Change Communication. Shodhkosh Journal of Visual and Performing Arts, 6(5s), 696–707. https://doi.org/10.29121/shodhkosh.v6.i5s.2025.6965 DOI: https://doi.org/10.29121/shodhkosh.v6.i5s.2025.6965

Kasneci, E., Sessler, K., Küchemann, S., Bannert, M., Dementieva, D., Fischer, F., ... and Kasneci, G. (2023). ChatGPT for Good? On Opportunities and Challenges of Large Language Models for Education. Learning and Individual Differences, 103, 102274. https://doi.org/10.1016/j.lindif.2023.102274 DOI: https://doi.org/10.1016/j.lindif.2023.102274

Kosmyna, N., Luu, T., Bouchard, K., and Cummings, M. L. (2025). Your Brain on ChatGPT: Accumulation of Cognitive Debt When Using an AI Assistant for Essay Writing Task. arXiv Preprint arXiv:2506.08872.

Magesh, V., Sundararajan, R., and Krishnan, A. (2025). Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools. Journal of Empirical Legal Studies, 22, 216–242. https://doi.org/10.1111/jels.12413 DOI: https://doi.org/10.1111/jels.12413

Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781.

Mikolov, T., Karafiát, M., Burget, L., Černocký, J., and Khudanpur, S. (2010). Recurrent Neural Network Based Language Model. In Proceedings of Interspeech (1045–1048). Chiba, Japan. https://doi.org/10.21437/Interspeech.2010-343 DOI: https://doi.org/10.21437/Interspeech.2010-343

OpenAI, Achiam, J., Adler, S., Agarwal, S., Ahmad, L., Akkaya, E., ... and Zoph, B. (2023). GPT-4 Technical Report. arXiv preprint arXiv:2303.08774.

Raiaan, M. A. K., Rahman, M. M., Hossain, M. S., Al Nahian, M. J., and Al Nahian, M. S. (2024). A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges. IEEE Access, 12, 26839–26874. https://doi.org/10.1109/ACCESS.2024.3365742 DOI: https://doi.org/10.1109/ACCESS.2024.3365742

Weidinger, L., Uesato, J., Rauh, M., Mellor, J., Anderson, K., Gabriel, I., ... and Kohli, P. (2022). Taxonomy of Risks Posed by Language Models. In Proceedings of the ACM Conference on Fairness, Accountability, and Transparency (FAccT) (214–229). Seoul, South Korea. https://doi.org/10.1145/3531146.3533088 DOI: https://doi.org/10.1145/3531146.3533088

Yu, Y., Si, X., Hu, C., and Zhang, J. (2019). A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures. Neural Computation, 31, 1235–1270. https://doi.org/10.1162/neco_a_01199 DOI: https://doi.org/10.1162/neco_a_01199