Expedient Information Retrieval System for Web Pages Using the Natural Language Modeling
PDF

Keywords

Natural Language Modelling
Information Retrieval
LSA-Latent Semantic Analysis
Precision
Recall F-Score

How to Cite

Joby, P. P. 2020. “Expedient Information Retrieval System for Web Pages Using the Natural Language Modeling”. Journal of Artificial Intelligence and Capsule Networks 2 (2): 100-110. https://doi.org/10.36548/jaicn.2020.2.003.

Abstract

Retrieving of information from the huge set of data flowing due to the day to day development in the technologies has become more popular as it assists in searching for the valuable information in a structured, unstructured or a semi structured data set like text, database, multimedia, documents, and internet etc. The retrieval of information is performed employing any one of the models starting from the simple Boolean model for retrieving information, or using other frame works such as probabilistic, vector space and the natural language modelling. The paper is emphasis on using a natural language model based information retrieval to recover the meaning insights from the enormous amount of data. The method proposed in the paper uses the latent semantic analysis to retrieve significant information's from the question raised by the user or the bulk documents. The carried out method utilizes the fundamentals of semantic factor occurring in the data set to identify the useful insights. The experiment analysis of the proposed method is carried out with few state of art dataset such as TIME, LISA, CACM and the NPL etc. and the results obtained demonstrate the superiority of the method proposed in terms of precision, recall and F-score.

PDF

References

Miller, David RH, Tim Leek, and Richard M. Schwartz. "A hidden Markov model information retrieval system." In Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, pp. 214-221. 1999.

Fernandez, Eduardo B., and Xiaohong Yuan. "Semantic analysis patterns." In International Conference on Conceptual Modeling, pp. 183-195. Springer, Berlin, Heidelberg, 2000.

Rosenfeld, Ronald. "Two decades of statistical language modeling: Where do we go from here?." Proceedings of the IEEE 88, no. 8 (2000): 1270-1278.

Zhai, Jun, Yan Cao, and Yan Chen. "Semantic information retrieval based on fuzzy ontology for intelligent transportation systems." In 2008 IEEE International Conference on Systems, Man and Cybernetics, pp. 2321-2326. IEEE, 2008.

Thomo, Alex. "Latent semantic analysis (Tutorial)." Victoria, Canda (2009): 1-7.

Minnie, D., and S. Srinivasan. "Meta search engine with an intelligent interface for information retrieval on multiple domains." International Journal of Computer Science, Engineering and Information Technology (IJCSEIT) 1, no. 4 (2011): 37-45.

Dhingra, Vandana, and Komal Kumar Bhatia. "Towards Intelligent Information Retrieval on Web." International Journal on Computer Science and Engineering 3, no. 4 (2011): 1721-1726.

Dubey, Hema, and B. N. Roy. "An improved page rank algorithm based on optimized normalization technique." (2011).

Weston, Jason, Chong Wang, Ron Weiss, and Adam Berenzweig. "Latent collaborative retrieval." arXiv preprint arXiv:1206.4603 (2012).

Arora, Monika, Uma Kanjilal, and Dinesh Varshney. "Efficient and intelligent information retrieval using support vector machine (SVM)." Int. J. Soft Comput. Eng.(IJSCE) 1, no. 6 (2012): 39-43.

Babekr, Salah T., Khaled M. Fouad, and Naveed Arshad. "Personalized semantic retrieval and summarization of web based documents." International Journal of Advanced Computer Science and Applications 4, no. 1 (2013).

Pandian, A. Pasumpon, and S. Smys. "Effective Fragmentation Minimization by Cloud Enabled Back Up Storage." Journal of Ubiquitous Computing and Communication Technologies (UCCT) 2, no. 01 (2020): 1-9.

Weber, Ann M., Marta Rubio-Codina, Susan P. Walker, Stef van Buuren, Iris Eekhout, Sally M. Grantham-McGregor, Maria Caridad Araujo et al. "The D-score: a metric for interpreting the early development of infants and toddlers across global settings." BMJ global health 4, no. 6 (2019).

Jacob, I. Jeena. "Performance Evaluation of Caps-Net Based Multitask Learning Architecture for Text Classification." Journal of Artificial Intelligence 2, no. 01 (2020): 1-10.

Manoharan, Samuel. "A Smart Image Processing Algorithm for Text Recognition Information Extraction and Vocalization for the Visually Challenged." Journal of Innovative Image Processing (JIIP) 1, no. 01 (2019): 31-38.

Bindhu, V. "Biomedical Image Analysis using Semantic Segmentation." Journal of Innovative Image Processing (JIIP) 1, no. 02 (2019): 91-101.