f8870fb0-f148-4c10-bd9f-a400bfcfe7d9 beie director@blueeyesintelligence.org 10.35940/ijrte.D7332.1111422 10.1109/TPAMI.2016.2587640O. Vinyals, A. Toshev, S. Bengio and D. Erhan. Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 4, pp. 652-663, 1 April 2017, doi: 10.1109/TPAMI.2016.2587640.[CrossRef]10.1109/CVPRW.2016.61Kenneth Tran, Xiaodong He, Lei Zhang, Jian Sun, Cornelia Carapcea, Chris Thrasher, Chris Buehler, Chris Sienkiewicz. Rich Image Captioning in the Wild. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2016.Martinez Gutierrez, Maria Fernanda. Automated Image Captioning: Exploring the Potential of Microsoft Computer Vision for English and Spanish. Université de Genève. Master, 2019. https://archive-ouverte.unige.ch/unige:132748.Oriol Vinyals, Alexander Toshev, SamyBengio, Dumitru Erhan. Show and Tell: A Neural Image Caption Generator. Computer Vision and Pattern Recognition https://arxiv.org/abs/1411.4555.10.1007/978-3-030-21935-2_26Hasnine, Mohammad Nehal,Flanagan, Brendan, Akcapinar, Gokhan, Ogata, Hiroaki Mouri, Kousuke, Uosaki, Noriko. Distributed, Ambient and Pervasive Interactions" (LNCS, volume 11587) (2019):346-358. http://hdl.handle.net/ 2433/243253.[CrossRef]10.1109/ICDIS.2018.00020F. Ahmed, M. S. Mahmud, R. Al-Fahad, S. Alam and M. Yeasin. Image Captioning for Ambient Awareness on a Sidewalk. 2018 1st International Conference on Data Intelligence and Security (ICDIS), 2018, pp. 85-91, doi: 10.1109/ICDIS.2018.00020.[CrossRef]Michalik, Samuel. Deep learning and visualization of models for image captioning and multimodal translation. Praha, 2020. Bakalářskápráce. UniverzitaKarlova, Matematicko-fyzikálnífakulta, Ústavformálníaaplikovanélingvistiky. VedoucípráceHelcl, Jindřich. http://hdl.handle.net/20.500.11956/11937.10.1016/j.cmpb.2020.105796Alain Jungo, Olivier Scheidegger, Mauricio Reyes, Fabian Balsiger. pymia: A Python package for data handling and evaluation in deep learning-based medical image analysis. Computer Methods and Programs in Biomedicine,Volume 198, 2021. https://doi.org/10.1016/j.cmpb.2020.105796.[CrossRef]10.1007/978-1-4842-3925-4_3Hajba G.L. (2018). Using Beautiful Soup. In: Website Scraping with Python. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3925-4_3.[CrossRef]10.1109/IPTA50016.2020.9286602Y. Bounab, M. Oussalah and A. Ferdenache. Reconciling Image Captioning and User's Comments for Urban Tourism. 2020 Tenth International Conference on Image Processing Theory, Tools and Applications (IPTA), 2020, pp. 1-6, doi: 10.1109/IPTA50016.2020.9286602.[CrossRef]10.1109/IGARSS39084.2020.9323183A. V. Potnis, R. C. Shinde and S. S. Durbha. Towards Natural Language Question Answering Over Earth Observation Linked Data Using Attention-Based Neural Machine Translation. IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium, 2020, pp. 577-580, doi: 10.1109/IGARSS39084.2020.9323183.[CrossRef]Naeha Sharif1, Lyndon White1, Mohammed Bennamoun1, and Syed Afaq Ali Shah .NNEval: Neural Network based Evaluation Metric for Image Captioning.https://openaccess.thecvf.com/content_ECCV_2018/papers/Naeha_Sharif_NNEval_Neural_Network_ECCV_2018_paper.pdf.[CrossRef]10.3115/1073083.1073135Papineni, Kishore &Roukos, Salim & Ward, Todd & Zhu, Wei Jing. BLEU: A Method for Automatic Evaluation of Machine Translation. https://doi.org/10.3115/1073083.1073135. 4236.[CrossRef]10.18653/v1/2021.naacl-srw.8H Ahsan, N Bhalla, D Bhatt, K Shah. Multi-Modal Image Captioning for the Visually Impaired. arXiv preprint arXiv:2105.08106 [cs.CL], 2021 - arxiv.org.[CrossRef]10.1145/3123266.3123275Fuhai Chen, Rongrong Ji, JinsongSu, Yongjian Wu, and Yunsheng Wu. 2017. StructCap: Structured Semantic Embedding for Image Captioning. In Proceedings of the 25th ACM international conference on Multimedia (MM '17). Association for Computing Machinery, New York, NY, USA, 46-54. DOI:https://doi.org/10.1145/3123266.3123275.[CrossRef]10.1109/CVPR.2019.00425Yang Feng, Lin Ma, Wei Liu, Jiebo Luo. Unsupervised Image Captioning. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 4125-4134. https://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Nguyen_Deep_Neural_Networks_2015_CVPR_paper.pdf[CrossRef]10.1007/978-3-319-38791-8_11David Bermbach and Erik Wittern. 2016. Benchmarking Web API Quality. In Web Engineering, Springer International Publishing, Cham, 188-206. DOI:https://doi.org/10.1007/978-3-319-38791-8_11[CrossRef]10.1007/978-1-4842-3342-9_2Del Sole A. (2018). Getting Started with the Computer Vision API. In: Microsoft Computer Vision APIs Distilled. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3342-9_2[CrossRef]10.1007/978-1-4842-3342-9_1Del Sole A. (2018). Introducing Microsoft Cognitive Services. In: Microsoft Computer Vision APIs Distilled. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3342-9_1[CrossRef]10.1007/978-1-4842-3342-9_3Del Sole A. (2018) Invoking the Computer Vision API from C#. In: Microsoft Computer Vision APIs Distilled. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3342-9_3[CrossRef]Altseeker - PIP Package for python for automating Alt text https://github.com/ksg98/altseekerDataset created and Evaluation implementation https://github.com/ksg98/Model-Evaluatiion-with-BLEU-Confidemce-and-Latency-with-dataset-usedyes