[1] |
Green Jr, Bert F and Wolf, Alice K and Chomsky, Carol and Laughery, Kenneth. Baseball: an automatic question-answerer. Papers presented at the May 9-11, 1961, western joint IRE-AIEE-ACM computer conference. https://doi.org/10.1145/1460690.1460714. 219--224, 1961. [DOI ] |
[2] |
Woods, William A. Progress in natural language understanding: an application to lunar geology. Proceedings of the June 4-8, 1973, national computer conference and exposition. http://crossmark.crossref.org/dialog/?doi=10.1145%2F1499586.1499695&domain=pdf&date_stamp=1973-06-04. 441--450, 1973. [DOI ] |
[3] |
Voorhees, Ellen M. The TREC question answering track. Natural Language Engineering. 7(4): 361--378, 2001. [DOI ] |
[4] |
Zhu, Fengbin and Lei, Wenqiang and Wang, Chao and Zheng, Jianming and Poria, Soujanya and Chua, Tat-Seng. Retrieving and reading: A comprehensive survey on open-domain question answering. arXiv preprint arXiv:2101.00774. https://arxiv.org/abs/2101.00774. 2021. [DOI ] |
[5] |
Rajpurkar, Pranav and Zhang, Jian and Lopyrev, Konstantin and Liang, Percy. SQuAD: 100,000+ questions for machine comprehension of text. arXiv preprint arXiv:1606.05250. 2016. [DOI ] |
[6] |
Rajpurkar, Pranav and Jia, Robin and Liang, Percy. Know what you don't know: Unanswerable questions for SQuAD. arXiv preprint arXiv:1806.03822. 2018. [DOI ] |
[7] |
Kazemi, Arefeh and Mozafari, Jamshid and Nematbakhsh, Mohammad Ali. PersianQAD: The native question answering dataset for the Persian language. IEEE Access. 10: 26045--26057, 2022. [DOI ] |
[8] |
Abadani, Negin and Mozafari, Jamshid and Fatemi, Afsaneh and Nematbakhsh, Mohammd Ali and Kazemi, Arefeh. ParSQuAD: Machine translated SQuAD dataset for Persian question answering. 2021 7th International Conference on Web Research (ICWR). 163--168, 2021. [DOI ] |
[9] |
Kazemi, Arefeh and Zojaji, Zahra and Malverdi, Mahdi and Mozafari, Jamshid and Ebrahimi, Fatemeh and Abadani, Negin and Varasteh, Mohammad Reza and Nematbakhsh, Mohammad Ali. FarsNewsQA: A deep learning-based question answering system for the Persian news articles. Information Retrieval Journal. 26(1): 3, 2023. [DOI ] |
[10] |
Mozafari, Jamshid and Kazemi, Arefeh and Moradi, Parham and Nematbakhsh, Mohammad Ali. PerAnSel: A novel deep neural network-based system for Persian question answering. Computational Intelligence and Neuroscience. 2022(1): 3661286, 2022. [DOI ] |
[11] |
Nguyen, Tri and Rosenberg, Mir and Song, Xia and Gao, Jianfeng and Tiwary, Saurabh and Majumder, Rangan and Deng, Li. MS MARCO: A human-generated machine reading comprehension dataset. arXiv preprint arXiv:1611.09268. https://doi.org/10.48550/arXiv.1611.09268. 2016. [DOI ] |
[12] |
Yang, Yi and Yih, Wen-tau and Meek, Christopher. WikiQA: A challenge dataset for open-domain question answering. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 2013--2018, 2015. [DOI ] |
[13] |
Kwiatkowski, Tom and Palomaki, Jennimaria and Redfield, Olivia and Collins, Michael and Parikh, Ankur and Alberti, Chris and Epstein, Danielle and Polosukhin, Illia and Devlin, Jacob and Lee, Kenton and others. Natural Questions: A benchmark for question answering research. Transactions of the Association for Computational Linguistics. 7: 453--466, 2019. [DOI ] |
[14] |
Yamada, Ikuya and Asai, Akari and Shindo, Hiroyuki and Takeda, Hideaki and Matsumoto, Yuji. LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention. arXiv preprint arXiv:2010.01057. https://arxiv.org/abs/2010.01057. Submitted on 2 Oct 2020. 2020. [DOI ] |
[15] |
Yang, Zhilin and Dai, Zihang and Yang, Yiming and Carbonell, Jaime and Salakhutdinov, Ruslan and Le, Quoc V.. XLNet: Generalized Autoregressive Pretraining for Language Understanding. arXiv preprint arXiv:1906.08237. https://arxiv.org/abs/1906.08237. 2019. [DOI ] |
[16] |
Joshi, Mandar and Chen, Danqi and Liu, Yinhan and Weld, Daniel S. and Zettlemoyer, Luke and Levy, Omer. SpanBERT: Improving Pre-training by Representing and Predicting Spans. arXiv preprint arXiv:1907.10529. https://arxiv.org/abs/1907.10529. Equal contribution.. 2019. [DOI ] |
[17] |
Trischler, Adam and Wang, Tong and Yuan, Xingdi and Harris, Justin and Sordoni, Alessandro and Bachman, Philip and Suleman, Kaheer. NewsQA: A machine comprehension dataset. arXiv preprint arXiv:1611.09830. 2016. [DOI ] |
[18] |
Omar, Reham and Mangukiya, Omij and Kalnis, Panos and Mansour, Essam. ChatGPT versus traditional question answering for knowledge graphs: Current status and future directions towards knowledge graph chatbots. arXiv preprint arXiv:2302.06466v1. https://arxiv.org/abs/2302.06466v1. 2023. [DOI ] |
[19] |
Tan, Yiming and Min, Dehai and Li, Yu and Li, Wenbo and Hu, Nan and Chen, Yongrui and Qi, Guilin. Evaluation of ChatGPT as a question answering system for answering complex questions. arXiv preprint arXiv:2303.07992. 2023. [DOI ] |
[20] |
Guo, Biyang and Zhang, Xin and Wang, Ziyuan and Jiang, Minqi and Nie, Jinran and Ding, Yuxuan and Yue, Jianwei and Wu, Yupeng. How close is ChatGPT to human experts? Comparison corpus, evaluation, and detection. arXiv preprint arXiv:2301.07597. 2023. [DOI ] |
[21] |
Pichappan, Pit and Krishnamurthy, M and Vijayakumar, P. Analysis of ChatGPT as a question-answering tool. Journal of Digital Information Management. 21(2): 50--60, 2023. [DOI ] |
[22] |
Zhang, Zhuosheng and Yang, Junjie and Zhao, Hai. Retrospective reader for machine reading comprehension. arXiv preprint arXiv:2001.09694. 2020. [DOI ] |
[23] |
Lan, Zhenzhong and Chen, Mingda and Goodman, Sebastian and Gimpel, Kevin and Sharma, Piyush and Soricut, Radu. ALBERT: A lite BERT for self-supervised learning of language representations. International Conference on Learning Representations (ICLR). https://doi.org/10.48550/arXiv.1909.11942. 2020. [DOI ] |
[24] |
Siriwardhana, Shamane and Weerasekera, Rivindu and Wen, Elliott and Kaluarachchi, Tharindu and Rana, Rajib and Nanayakkara, Suranga. Improving the domain adaptation of retrieval augmented generation (RAG) models for open domain question answering. arXiv preprint arXiv:2210.02627. 2022. [DOI ] |
[25] |
Back, Seohyun and Kedia, Akhil and Chinthakindi, Sai Chetan and Lee, Haejun and Choo, Jaegul. Learning to generate questions by learning to recover answer-containing sentences. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. 1516--1529, 2021. [DOI ] |
[26] |
Tay, Yi and Tuan, Luu Anh and Hui, Siu Cheung and Su, Jian. Densely Connected Attention Propagation for Reading Comprehension. Proceedings of the 32nd International Conference on Neural Information Processing Systems. 2020. [DOI ] |
[27] |
Kundu, Souvik and Ng, Hwee Tou. A question-focused multi-factor attention network for question answering. Department of Computer Science, National University of Singapore. 2020. [DOI ] |
[28] |
Bang, Yejin and Cahyawijaya, Samuel and Lee, Nayeon and Dai, Wenliang and Su, Dan and Wilie, Bryan and Lovenia, Holy and Ji, Ziwei and Yu, Tiezheng and Chung, Willy and Do, Quyet V. and Xu, Yan and Fung, Pascale. A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity. Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers). https://aclanthology.org/2023.ijcnlp-main.45/. 675--718, Association for Computational Linguistics. 2023. [DOI ] |
[29] |
Qin, Chengwei and Zhang, Aston and Zhang, Zhuosheng and Chen, Jiaao and Yasunaga, Michihiro and Yang, Diyi. Is ChatGPT a General-Purpose Natural Language Processing Task Solver?. arXiv preprint arXiv:2302.06476. https://arxiv.org/abs/2302.06476. 2023. [DOI ] |
[30] |
Frieder, Simon and Pinchetti, Luca and Chevalier, Alexis and Griffiths, Ryan-Rhys and Salvatori, Tommaso and Lukasiewicz, Thomas and Petersen, Philipp Christian and Berner, Julius. Mathematical Capabilities of ChatGPT. arXiv preprint arXiv:2301.13867. https://arxiv.org/abs/2301.13867. NeurIPS 2023 Datasets and Benchmarks. 2023. [DOI ] |
[31] |
Zhong, Qihuang and Ding, Liang and Liu, Juhua and Du, Bo and Tao, Dacheng. Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT. https://arxiv.org/abs/2302.10198. 2023. [DOI ] |
[32] |
Sobania, Dominik and Briesch, Martin and Hanna, Carol and Petke, Justyna. An Analysis of the Automatic Bug Fixing Performance of ChatGPT. arXiv preprint arXiv:2301.08653. https://doi.org/10.48550/arXiv.2301.08653. arXiv:2301.08653 [cs.SE]. 2023. |
[33] |
Shafiq Surameery, Nigar M. and Shakor, Mohammed Y.. Use Chat GPT to Solve Programming Bugs. International Journal of Information Technology and Computer Science (IJITC). https://doi.org/10.55529/ijitc.31.17.22. 3(1): 17--22, 2023. [DOI ] |
[34] |
Haque, Md. Asraful and Li, Shuai. The Potential Use of ChatGPT for Debugging and Bug Fixing. AI and Robotics. https://doi.org/10.4108/airo.v2i1.3276. 2(1): 2023. [DOI ] |
[35] |
Wenxiang Jiao and Wenxuan Wang and Jen-Tse Huang and Zhaopeng Tu. Is ChatGPT A Good Translator? A Preliminary Study. arXiv preprint arXiv:2301.08745. https://doi.org/10.48550/arXiv.2301.08745. 2023. [DOI ] |
[36] |
Ali Borji. A Categorical Archive of ChatGPT Failures. arXiv preprint arXiv:2302.03494. https://arxiv.org/abs/2302.03494. 2023. [DOI ] |
[37] |
Wei Ma and Shangqing Liu and Wenhan Wang and Qiang Hu and Cen Zhang and Ye Liu and Liming Nie and Yang Liu. ChatGPT: Understanding Code Syntax and Semantics. arXiv preprint arXiv:2305.12138. https://arxiv.org/abs/2305.12138. 2023. [DOI ] |
[38] |
Zhang, Haopeng and Liu, Xiao and Zhang, Jiawei. SummIt: Iterative Text Summarization via ChatGPT. Findings of the Association for Computational Linguistics: EMNLP 2023. https://aclanthology.org/2023.findings-emnlp.714/. 10644--10657, Association for Computational Linguistics. 2023. [DOI ] |
[39] |
Jindong Wang and Xixu Hu and Wenxin Hou and Hao Chen and Runkai Zheng and Yidong Wang and Linyi Yang and Haojun Huang and Wei Ye and Xiubo Geng and Binxing Jiao and Yue Zhang and Xing Xie. On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective. arXiv preprint arXiv:2302.12095. https://arxiv.org/abs/2302.12095. 2023. [DOI ] |
[40] |
Jan Koco{\'n} and Igor Cichecki and Oliwier Kaszyca and Mateusz Kochanek and Dominika Szyd{\l}o and Joanna Baran and Julita Bielaniewicz and Marcin Gruza and Arkadiusz Janz and Kamil Kanclerz and Anna Koco{\'n} and Bart{\l}omiej Koptyra and Wiktoria Mieleszczenko-Kowszewicz and Piotr Mi{\l}kowski and Marcin Oleksy and Maciej Piasecki and {\L}ukasz Radli{\'n}ski and Konrad Wojtasik and Stanis{\l}aw Wo{\'z}niak and Przemys{\l}aw Kazienko. ChatGPT: Jack of all trades, master of none. arXiv preprint arXiv:2302.10724. https://arxiv.org/abs/2302.10724. 2023. [DOI ] |
[41] |
Yuan Gao and Ruili Wang and Feng Hou. How to Design Translation Prompts for ChatGPT: An Empirical Study. arXiv preprint arXiv:2304.02182. https://arxiv.org/abs/2304.02182. 2023. [DOI ] |
[42] |
Gao, Mingqi and Ruan, Jie and Sun, Renliang and Yin, Xunjian and Yang, Shiping and Wan, Xiaojun. Human-like summarization evaluation with chatgpt. arXiv preprint arXiv:2304.02554. 2023. [DOI ] |
[43] |
Bo Li and Gexiang Fang and Yang Yang and Quansen Wang and Wei Ye and Wen Zhao and Shikun Zhang. Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness. arXiv preprint arXiv:2304.11633. https://arxiv.org/abs/2304.11633. 2023. [DOI ] |
[44] |
Carrino, Casimiro Pio and Costa-jussà , Marta R. and Fonollosa, José A. R.. Automatic Spanish Translation of the SQuAD Dataset for Multilingual Question Answering. arXiv preprint arXiv:1912.05200. https://arxiv.org/abs/1912.05200. Submitted to LREC 2020. 2019. [DOI ] |
[45] |
Mozannar, Hussein and Maamary, Elie and El Hajal, Karl and Hajj, Hazem. Neural {A}rabic Question Answering. Proceedings of the Fourth Arabic Natural Language Processing Workshop. https://aclanthology.org/W19-4612/. 108--118, Association for Computational Linguistics. 2019. [DOI ] |
[46] |
Lee, Kyungjae and Yoon, Kyoungho and Park, Sunghyun and Hwang, Seung-won. Semi-supervised Training Data Generation for Multilingual Question Answering. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). https://aclanthology.org/L18-1437/. European Language Resources Association (ELRA). 2018. |
[47] |
Croce, Danilo and Zelenanska, Alexandra and Basili, Roberto. Neural Learning for Question Answering in Italian. AI*IA 2018 – Advances in Artificial Intelligence. 389--402, Springer International Publishing. 2018. [DOI ] |
[48] |
Efimov, Pavel and Chertok, Andrey and Boytsov, Leonid and Braslavski, Pavel. SberQuAD -- Russian Reading Comprehension Dataset. Experimental IR Meets Multilinguality, Multimodality, and Interaction. Springer. 2020. [DOI ] |
[49] |
Shao, Yiming and Wang, Dong and Li, Jing and others. A Large-Scale Chinese Machine Reading Comprehension Dataset and Evaluation Platform. Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018). 443--453, 2018. [DOI ] |
[50] |
Lim, Seunghyun and Lee, Jinhyuk and Kang, Jaewoo. KorQuAD: Korean Question Answering Dataset. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 4689--4694, 2019. [DOI ] |
[51] |
Keraronb, Antoine and others. FQuAD: French Question Answering Dataset. Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020). 2020. [DOI ] |
[52] |
Veisi, Hamed and Shandi, Hossein. Development of a Persian Medical Question Answering System for Diseases and Drugs. Journal of Biomedical Informatics. 108: 103493, Elsevier. 2020. [DOI ] |
[53] |
Boreshban, Yasaman and Yousefinasab, Hadi and Mirroshandel, Seyed Ali. Providing a Religious Corpus of Question Answering System in Persian. Journal of Statistical and Data Science. https://jsdp.rcisp.ac.ir/article-1-535-en.html. 6(1): 1--15, 2018. [DOI ] |
|