LARGE LANGUAGE MODELS FOR GENERATING CREATIVE CONCEPTS IN VISUAL ART PRE-PRODUCTION PROCESSES
DOI:
https://doi.org/10.29121/shodhkosh.v7.i4s.2026.7502Keywords:
Large Language Models, Computational Creativity, Visual Art Pre-Production, Concept Generation, Human–AI Collaboration, Prompt Engineering, Creative AI, Digital Art WorkflowAbstract [English]
Pre-production phase of visual art is a very significant stage because it entails intellectual ideation, story development and search of design. The need to possess smart looking systems that can be utilized to augment the traditional ideation work is growing as well as requirements of fast and diverse creative effort are escalating. The article dwells upon the application of Large Language Models (LLMs) to generate creative concepts in pre-production in visual art. With their abilities to manipulate and generate semantically rich textual data, LLCs are in a good position to be utilized to assist in supporting the early-stage artistic processes. The study proposes a formal methodology that would involve timely engineering, notion generation, and evaluation into a human-AI work system. System architecture is a developed system that assists in the conversion of user specified inputs to structured creative concepts like design of characters, descriptions of scenes and thematic scripts. The paper also explains how LLC can be incorporated with digital art tools in such a way that the ideation process through text may be incorporated into a visual representation without interruption. The obtained outcomes of the experiment show that the workflows that are assisted by LLM have a positive influence on the diversity, originality, and the quality of idea generation as compared to the traditional methods of idea generation. The generated concepts are evaluated using a detailed evaluation framework to assess the quality of the concepts generated by using various measures such as coherence, relevance, aesthetic potential and diversity. In addition, the user study, which will be carried out with artists and designers, will assist in receiving the concept of the practical applicability and usability of the offered approach. The findings demonstrate that LLMs can be regarded as efficient co-creative partners that help users overcome the issue of creative paralysis and expand the scope of their conceptual exploration without losing their artistic control. Despite these advantages, the originality, bias and creative evaluation problems are still present, which proves the need of more research. The discussion of the future directions, including multimodal integration, personalization of AI tools, and the development of the standardized ways of creativity measurement, conclude the paper. Overall, the work is applicable to the field of computational creativity as it demonstrates the possibility of using LLM to enhance the pre-production process related to the visual art and rebrand the human-AI collaboration in the creative industries.
References
Biswas, S. S. (2023). Role of ChatGPT in Public Health. Annals of Biomedical Engineering, 51, 868–869. https://doi.org/10.1007/s10439-023-03172-7
Blunsom, P. (2004). Hidden Markov Models. Lecture Notes, 15, 48.
Bommasani, R., Hudson, D. A., Adeli, E., Altman, R., Arora, S., von Arx, S., ... and Liang, P. (2021). On The Opportunities and Risks of Foundation Models. arXiv preprint arXiv:2108.07258.
Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., ... and Amodei, D. (2020). Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems (Vol. 33, 1877–1901).
Driess, D., Xia, F., Sajjadi, M. S. M., Touvron, H., Yin, H., Bachem, O., ... and Rus, D. (2023). PaLM-E: An Embodied Multimodal Language Model. arXiv preprint arXiv:2303.03378.
Kashyap, S. V., Purohit, S., Kumar, D. A., Jawaid, F. I. M., Kumar, J. R. R., and Ajani, S. N. (2025). Visual Storytelling and Explainable Intelligence in Organizational Change Communication. Shodhkosh Journal of Visual and Performing Arts, 6(5s), 696–707. https://doi.org/10.29121/shodhkosh.v6.i5s.2025.6965
Kasneci, E., Sessler, K., Küchemann, S., Bannert, M., Dementieva, D., Fischer, F., ... and Kasneci, G. (2023). ChatGPT for Good? On Opportunities and Challenges of Large Language Models for Education. Learning and Individual Differences, 103, 102274. https://doi.org/10.1016/j.lindif.2023.102274
Kosmyna, N., Luu, T., Bouchard, K., and Cummings, M. L. (2025). Your Brain on ChatGPT: Accumulation of Cognitive Debt When Using an AI Assistant for Essay Writing Task. arXiv Preprint arXiv:2506.08872.
Magesh, V., Sundararajan, R., and Krishnan, A. (2025). Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools. Journal of Empirical Legal Studies, 22, 216–242. https://doi.org/10.1111/jels.12413
Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781.
Mikolov, T., Karafiát, M., Burget, L., Černocký, J., and Khudanpur, S. (2010). Recurrent Neural Network Based Language Model. In Proceedings of Interspeech (1045–1048). Chiba, Japan. https://doi.org/10.21437/Interspeech.2010-343
OpenAI, Achiam, J., Adler, S., Agarwal, S., Ahmad, L., Akkaya, E., ... and Zoph, B. (2023). GPT-4 Technical Report. arXiv preprint arXiv:2303.08774.
Raiaan, M. A. K., Rahman, M. M., Hossain, M. S., Al Nahian, M. J., and Al Nahian, M. S. (2024). A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges. IEEE Access, 12, 26839–26874. https://doi.org/10.1109/ACCESS.2024.3365742
Weidinger, L., Uesato, J., Rauh, M., Mellor, J., Anderson, K., Gabriel, I., ... and Kohli, P. (2022). Taxonomy of Risks Posed by Language Models. In Proceedings of the ACM Conference on Fairness, Accountability, and Transparency (FAccT) (214–229). Seoul, South Korea. https://doi.org/10.1145/3531146.3533088
Yu, Y., Si, X., Hu, C., and Zhang, J. (2019). A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures. Neural Computation, 31, 1235–1270. https://doi.org/10.1162/neco_a_01199
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Yogesh

This work is licensed under a Creative Commons Attribution 4.0 International License.
With the licence CC-BY, authors retain the copyright, allowing anyone to download, reuse, re-print, modify, distribute, and/or copy their contribution. The work must be properly attributed to its author.
It is not necessary to ask for further permission from the author or journal board.
This journal provides immediate open access to its content on the principle that making research freely available to the public supports a greater global exchange of knowledge.























