AI-Driven RAG Chatbot: Combining Information Retrieval with Generative AI

Venkatesh S.; Dhanya K R.; Kaniska P.

doi:10.36548/jismac.2024.4.005

AI-Driven RAG Chatbot: Combining Information Retrieval with Generative AI

Open Access

https://doi.org/10.36548/jismac.2024.4.005

Vol. 6, No. 4 (2024)

Published: 06 February, 2025

Pages: 364-373

Venkatesh S. , Venkatesh S.

Computer Science with Data Analytics, Dr N.G.P Arts and Science College

Computer Science with Data Analytics, Dr N.G.P Arts and Science College
Dhanya K R. , Dhanya K R.

Computer Science with Data Analytics, Dr N.G.P Arts and Science College

Computer Science with Data Analytics, Dr N.G.P Arts and Science College
Kaniska P. Kaniska P.

Computer Science with Data Analytics, Dr N.G.P Arts and Science College

Computer Science with Data Analytics, Dr N.G.P Arts and Science College

view PDF

How to Cite

S., Venkatesh, Dhanya K R., and Kaniska P. 2025. “AI-Driven RAG Chatbot: Combining Information Retrieval With Generative AI”. Journal of ISMAC 6 (4): 364-73. https://doi.org/10.36548/jismac.2024.4.005.

Keywords

Generative AI

RAG Chatbot

Embeddings

LangChain

Chroma

E-learning

Abstract

Generative AI technologies are emerging nowadays and they transform the way of user interaction with information, and allows the systems to deliver accurate responses to the user queries. This research focuses on creating a Retrieval Augmented Generation Chatbot as an e-learning assistant where it fetches the accurate data from the pdf document that is trained on and give accurate precise responses to the user query. This e-learning assistant is created specifically for the subject of “Artificial Intelligence” to deliver the user-queries related to Artificial Intelligence. The system uses Flask for the backend and React for the frontend. PDFs are loaded, split into smaller sections, and processed using LangChain. Embeddings are generated with Google’s AI models and stored in Chroma, a vector database. When a user submits a query, the system searches for similar content and uses Google Gemini-1.5-Pro to generate a response based on the retrieved data. This ensures high accuracy by relying on specific content rather than broad AI knowledge. This solution can easily scale and is perfect for education and knowledge-based fields. It helps students, teachers, and professionals by providing fast, reliable answers, making learning more efficient and effective.

References

Kulkarni, Mandar, Praveen Tangarajan, Kyung Kim, and Anusua Trivedi. "Reinforcement Learning for Optimizing RAG for Domain Chatbots." arXiv preprint arXiv:2401.06800 (2024).
Jaiswal, Anuj, Garima Tiwari, Aakash Jha, Rushikesh Mangulkar, and Pushpi Rani. "Retrieval Augmented Generation Approach for Multipdf Chatbot using LangChain." In 2024 8th International Conference on Computing, Communication, Control and Automation (ICCUBEA), IEEE, 2024. 1-6.
Dhoni, Pan Singh, Saurabh Shukla, and Jagjot Bhardwaj. “Strategies for Integrating Generative AI in Industrial Settings.” International Journal of Computer Trends and fTechnology (IJCTT) 72, no. 7 (2024).
Gao, Yunfan, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, and Haofen Wang. "Retrieval-augmented generation for large language models: A survey." arXiv preprint arXiv:2312.10997 (2023).
Rathod, Priyank Jayantilal. “Efficient Usage of RAG Systems in the World of LLMs.” Engineering 6, no. 4 (July-August 2024): Published July 10, 2024.
“Retrieval-Augmented Generation Approach: Document Question Answering Using Large Language Model.” International Journal of Advanced Computer Science and Applications (IJACSA) 15, no. 3 (2024).
Vincent, Christo, Allen Antony, Athul Asok, Abin Antony, and Anita Brigit Mathew. “Interactive VR Using RAG in Education.” International Journal of Research Publication and Reviews 5, no. 4 (April 2024): 9953–9956.
Meduri, Karthik, Geeta Sandeep Nadella, Hari Gonaygunta, Mohan Harish Maturi, and Farheen Fatima. “Efficient RAG Framework for Large-Scale Knowledge Bases.” Journal Name 9, no. 4 (April 2024): h613–h622.
Singh, J. "Combining Machine Learning and RAG Models for Enhanced Data Retrieval: Applications in Search Engines, Enterprise Data Systems, and Recommendations." J. Computational Intel. & Robotics 3, no. 1 (2023): 163-204.
Akheel, Syed Arham. “Fine-Tuning Pre-Trained Language Models for Improved Retrieval in RAG Systems for Domain-Specific Use.” International Journal For Multidisciplinary vol 6, no. 5 (September-October 2024). 1-10.
Muludi, Kurnia, Kaira Milani Fitria, Joko Triloka, and Sutedi. “Retrieval-Augmented Generation Approach: Document Question Answering Using Large Language Model.” International Journal of Advanced Computer Science and Applications (IJACSA) 15, no. 3 (2024).
Singh, Jaswinder. “Understanding Retrieval-Augmented Generation (RAG) Models in AI: A Deep Dive into the Fusion of Neural Networks and External Databases for Enhanced AI Performance.” Journal of Artificial Intelligence Research 2, no. 2 (2022).258-275
Chaubey, Harshit Kumar, Gaurav Tripathi, Rajnish Ranjan, and Srinivasa K. Gopalaiyengar. “Comparative Analysis of RAG, Fine-Tuning, and Prompt Engineering in Chatbot Development.” IEEE Xplore, September 30, 2024
Vidivelli, S., Manikandan Ramachandran, and A. Dharunbalaji. “Efficiency-Driven Custom Chatbot Development: Unleashing LangChain, RAG, and Performance-Optimized LLM Fusion.” Computers, Materials & Continua 80, no. 2 (2024): 2423–2442.
Bora, Arunabh, and Heriberto Cuayáhuitl. “Systematic Analysis of Retrieval-Augmented Generation-Based LLMs for Medical Chatbot Applications.” MDPI, October 18, 2024.
Kang, Bongsu, Jundong Kim, Tae-Rim Yun, and Chang-Eop Kim. "Prompt-RAG: Pioneering Vector Embedding-Free Retrieval-Augmented Generation in Niche Domains, Exemplified by Korean Medicine." arXiv preprint arXiv:2401.11246 (2024).
Quidwai, Mujahid Ali, and Alessandro Lagana. "A RAG Chatbot for Precision Medicine of Multiple Myeloma." medRxiv (2024): 2024-03.

AI-Driven RAG Chatbot: Combining Information Retrieval with Generative AI

How to Cite

Download Citation

Keywords

Abstract

References