Design of an Improved Model for Information Retrieval Using BERT and Weighted User Clicks

Authors

  • R.D.Bhoyar, Dr.D.N.Satange

Keywords:

Information Retrieval, Web Content Mining, BERT, Weighted Matching, NLP

Abstract

The need for enhancing information retrieval systems has become critical with the exponential growth of web content, which presents significant challenges in terms of data heterogeneity and user query satisfaction. Existing retrieval methods often suffer from limitations such as inadequate handling of unstructured data and suboptimal ranking of search results, leading to lower precision, accuracy, and recall.To address these challenges, we propose a novel framework for information retrieval that leverages web content mining and advanced natural language processing (NLP) techniques. The framework begins by preprocessing the input query to remove stop words using NLP, thereby refining the query for better relevance. We transform the unstructured web content into a structured format by systematically storing web content and user click data samples. This structured data serves as the foundation for our reranking mechanism.Our framework utilizes Bidirectional Encoder Representations from Transformers (BERT) to match web content effectively.

References

A. Jabbar, S. Iqbal, M. I. Tamimy, A. Rehman, S. A. Bahaj and T. Saba, "An Analytical Analysis of Text Stemming Methodologies in Information Retrieval and Natural Language Processing Systems," in IEEE Access, vol. 11, pp. 133681-133702, 2023, doi: 10.1109/ACCESS.2023.3332710.

keywords: {Natural language processing;Vocabulary;Information retrieval;Linguistics;Text categorization;Sentiment analysis;Tokenization;Text

stemming;information retrieval (IR) systems;text classification;stemmer evaluation;technological development;natural language processing (NLP)},

D. Wang, L. Liu and Y. Liu, "Normalized Storage Model Construction and Query Optimization of Book Multi-Source Heterogeneous Massive Data," in IEEE Access, vol. 11, pp. 96543-96553, 2023, doi: 10.1109/ACCESS.2023.3301134.

Downloads

Published

2024-12-17

How to Cite

R.D.Bhoyar, Dr.D.N.Satange. (2024). Design of an Improved Model for Information Retrieval Using BERT and Weighted User Clicks . Journal of Computational Analysis and Applications (JoCAAA), 33(08), 1124–1132. Retrieved from https://eudoxuspress.com/index.php/pub/article/view/1591

Issue

Section

Articles

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.