Retrieval of very large numbers of items in the Web of Science: an exercise to develop accurate search strategies
Ricardo Arencibia-Jorge, Loet Leydesdorff, Zaida Chinchilla-Rodriguez,, Ronald Rousseau, Soren W. Paris

TL;DR
This paper demonstrates a method to retrieve more than 100,000 items from Web of Science by combining multiple queries and Boolean logic, addressing a common interface limitation.
Contribution
It introduces a practical approach for retrieving large datasets from Web of Science beyond its default query limit, emphasizing team collaboration for strategy development.
Findings
Successful retrieval of entire US scientific output for a specific year
Use of Boolean logic to eliminate overlaps and improve accuracy
Highlighting teamwork in developing advanced search strategies
Abstract
The current communication presents a simple exercise with the aim of solving a singular problem: the retrieval of extremely large amounts of items in the Web of Science interface. As it is known, Web of Science interface allows a user to obtain at most 100,000 items from a single query. But what about queries that achieve a result of more than 100,000 items? The exercise developed one possible way to achieve this objective. The case study is the retrieval of the entire scientific production from the United States in a specific year. Different sections of items were retrieved using the field Source of the database. Then, a simple Boolean statement was created with the aim of eliminating overlapping and to improve the accuracy of the search strategy. The importance of team work in the development of advanced search strategies was noted.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsscientometrics and bibliometrics research · Biomedical Text Mining and Ontologies · Genetics, Bioinformatics, and Biomedical Research
