<?xml version="1.0" encoding="UTF-8"?>
<doi_batch version="4.3.0" xmlns="http://www.crossref.org/doi_resources_schema/4.3.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.crossref.org/doi_resources_schema/4.3.0 http://www.crossref.org/schema/deposit/doi_resources4.3.0.xsd">
<head>
<doi_batch_id>f8870fb0-f148-4c10-bd9f-a400bfcfe7d9</doi_batch_id>
<depositor>
<name>beie</name>
<email_address>director@blueeyesintelligence.org</email_address>
</depositor>
</head>
<body>
<doi_citations>
<doi>10.35940/ijrte.D7332.1111422</doi>
<citation_list><citation key="ref0"><doi>10.1109/TPAMI.2016.2587640</doi><unstructured_citation>O. Vinyals, A. Toshev, S. Bengio and D. Erhan. Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 4, pp. 652-663, 1 April 2017, doi: 10.1109/TPAMI.2016.2587640.[CrossRef]</unstructured_citation></citation><citation key="ref1"><doi>10.1109/CVPRW.2016.61</doi><unstructured_citation>Kenneth Tran, Xiaodong He, Lei Zhang, Jian Sun, Cornelia Carapcea, Chris Thrasher, Chris Buehler, Chris Sienkiewicz. Rich Image Captioning in the Wild. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2016.</unstructured_citation></citation><citation key="ref2"><unstructured_citation>Martinez Gutierrez, Maria Fernanda. Automated Image Captioning: Exploring the Potential of Microsoft Computer Vision for English and Spanish. Université de Genève. Master, 2019. https://archive-ouverte.unige.ch/unige:132748.</unstructured_citation></citation><citation key="ref3"><unstructured_citation>Oriol Vinyals, Alexander Toshev, SamyBengio, Dumitru Erhan. Show and Tell: A Neural Image Caption Generator. Computer Vision and Pattern Recognition https://arxiv.org/abs/1411.4555.</unstructured_citation></citation><citation key="ref4"><doi>10.1007/978-3-030-21935-2_26</doi><unstructured_citation>Hasnine, Mohammad Nehal,Flanagan, Brendan, Akcapinar, Gokhan, Ogata, Hiroaki Mouri, Kousuke, Uosaki, Noriko. Distributed, Ambient and Pervasive Interactions&quot; (LNCS, volume 11587) (2019):346-358. http://hdl.handle.net/ 2433/243253.[CrossRef]</unstructured_citation></citation><citation key="ref5"><doi>10.1109/ICDIS.2018.00020</doi><unstructured_citation>F. Ahmed, M. S. Mahmud, R. Al-Fahad, S. Alam and M. Yeasin. Image Captioning for Ambient Awareness on a Sidewalk. 2018 1st International Conference on Data Intelligence and Security (ICDIS), 2018, pp. 85-91, doi: 10.1109/ICDIS.2018.00020.[CrossRef]</unstructured_citation></citation><citation key="ref6"><unstructured_citation>Michalik, Samuel. Deep learning and visualization of models for image captioning and multimodal translation. Praha, 2020. Bakalářskápráce. UniverzitaKarlova, Matematicko-fyzikálnífakulta, Ústavformálníaaplikovanélingvistiky. VedoucípráceHelcl, Jindřich. http://hdl.handle.net/20.500.11956/11937.</unstructured_citation></citation><citation key="ref7"><doi>10.1016/j.cmpb.2020.105796</doi><unstructured_citation>Alain Jungo, Olivier Scheidegger, Mauricio Reyes, Fabian Balsiger. pymia: A Python package for data handling and evaluation in deep learning-based medical image analysis. Computer Methods and Programs in Biomedicine,Volume 198, 2021. https://doi.org/10.1016/j.cmpb.2020.105796.[CrossRef]</unstructured_citation></citation><citation key="ref8"><doi>10.1007/978-1-4842-3925-4_3</doi><unstructured_citation>Hajba G.L. (2018). Using Beautiful Soup. In: Website Scraping with Python. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3925-4_3.[CrossRef]</unstructured_citation></citation><citation key="ref9"><doi>10.1109/IPTA50016.2020.9286602</doi><unstructured_citation>Y. Bounab, M. Oussalah and A. Ferdenache. Reconciling Image Captioning and User's Comments for Urban Tourism. 2020 Tenth International Conference on Image Processing Theory, Tools and Applications (IPTA), 2020, pp. 1-6, doi: 10.1109/IPTA50016.2020.9286602.[CrossRef]</unstructured_citation></citation><citation key="ref10"><doi>10.1109/IGARSS39084.2020.9323183</doi><unstructured_citation>A. V. Potnis, R. C. Shinde and S. S. Durbha. Towards Natural Language Question Answering Over Earth Observation Linked Data Using Attention-Based Neural Machine Translation. IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium, 2020, pp. 577-580, doi: 10.1109/IGARSS39084.2020.9323183.[CrossRef]</unstructured_citation></citation><citation key="ref11"><unstructured_citation>Naeha Sharif1, Lyndon White1, Mohammed Bennamoun1, and Syed Afaq Ali Shah .NNEval: Neural Network based Evaluation Metric for Image Captioning.</unstructured_citation></citation><citation key="ref12"><unstructured_citation>https://openaccess.thecvf.com/content_ECCV_2018/papers/Naeha_Sharif_NNEval_Neural_Network_ECCV_2018_paper.pdf.[CrossRef]</unstructured_citation></citation><citation key="ref13"><doi>10.3115/1073083.1073135</doi><unstructured_citation>Papineni, Kishore &amp;Roukos, Salim &amp; Ward, Todd &amp; Zhu, Wei Jing. BLEU: A Method for Automatic Evaluation of Machine Translation. https://doi.org/10.3115/1073083.1073135. 4236.[CrossRef]</unstructured_citation></citation><citation key="ref14"><doi>10.18653/v1/2021.naacl-srw.8</doi><unstructured_citation>H Ahsan, N Bhalla, D Bhatt, K Shah. Multi-Modal Image Captioning for the Visually Impaired. arXiv preprint arXiv:2105.08106 [cs.CL], 2021 - arxiv.org.[CrossRef]</unstructured_citation></citation><citation key="ref15"><doi>10.1145/3123266.3123275</doi><unstructured_citation>Fuhai Chen, Rongrong Ji, JinsongSu, Yongjian Wu, and Yunsheng Wu. 2017. StructCap: Structured Semantic Embedding for Image Captioning. In Proceedings of the 25th ACM international conference on Multimedia (MM '17). Association for Computing Machinery, New York, NY, USA, 46-54. DOI:https://doi.org/10.1145/3123266.3123275.[CrossRef]</unstructured_citation></citation><citation key="ref16"><doi>10.1109/CVPR.2019.00425</doi><unstructured_citation>Yang Feng, Lin Ma, Wei Liu, Jiebo Luo. Unsupervised Image Captioning. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 4125-4134. https://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Nguyen_Deep_Neural_Networks_2015_CVPR_paper.pdf[CrossRef]</unstructured_citation></citation><citation key="ref17"><doi>10.1007/978-3-319-38791-8_11</doi><unstructured_citation>David Bermbach and Erik Wittern. 2016. Benchmarking Web API Quality. In Web Engineering, Springer International Publishing, Cham, 188-206. DOI:https://doi.org/10.1007/978-3-319-38791-8_11[CrossRef]</unstructured_citation></citation><citation key="ref18"><doi>10.1007/978-1-4842-3342-9_2</doi><unstructured_citation>Del Sole A. (2018). Getting Started with the Computer Vision API. In: Microsoft Computer Vision APIs Distilled. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3342-9_2[CrossRef]</unstructured_citation></citation><citation key="ref19"><doi>10.1007/978-1-4842-3342-9_1</doi><unstructured_citation>Del Sole A. (2018). Introducing Microsoft Cognitive Services. In: Microsoft Computer Vision APIs Distilled. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3342-9_1[CrossRef]</unstructured_citation></citation><citation key="ref20"><doi>10.1007/978-1-4842-3342-9_3</doi><unstructured_citation>Del Sole A. (2018) Invoking the Computer Vision API from C#. In: Microsoft Computer Vision APIs Distilled. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3342-9_3[CrossRef]</unstructured_citation></citation><citation key="ref21"><unstructured_citation>Altseeker - PIP Package for python for automating Alt text https://github.com/ksg98/altseeker</unstructured_citation></citation><citation key="ref22"><unstructured_citation>Dataset created and Evaluation implementation https://github.com/ksg98/Model-Evaluatiion-with-BLEU-Confidemce-and-Latency-with-dataset-usedyes</unstructured_citation></citation></citation_list>
</doi_citations>
</body>
</doi_batch>
