Bhatt, Dvijesh, and Priyank Thakkar. “Improving Narrative Coherence in Dense Video Captioning through Transformer and Large Language Models”. Journal of Innovative Image Processing, vol. 7, no. 2, June 2025, pp. 333-61, https://doi.org/10.36548/jiip.2025.2.005.