How are search results ranked?

ElasticSearch examines multiple fields (i.e., title, keywords, and full OCR text) and assigns a relevancy score based on:

The number of times a term appears in the field
The length of the field
How often that term appears

If a field is short, like the title, then a term appearing in that field is weighted higher.

If a field is very long, and a term appears many times in the field, and many times over all of the texts in the corpus, then that term is weighted lower. This helps give a lower weight to words like a, and, or the.

If a field is very long, and a term appears infrequently across all of the texts in the corpus, then it’s ranked higher. This helps give a stronger weight to words like hippopotamus or giraffe.

All three of these factors are combined to produce a score for each field in the document, and then the scores for each field are combined to assign a score to the entire document. On top of this, we can force certain fields to be given a greater “weight” than others.

Fields with a higher “weight” have more of an affect on the score than fields with a lower “weight”. The final document score is used to rank the document in the search results.

Nitty gritty details: https://www.elastic.co/guide/en/elasticsearch/guide/current/scoring-theory.html

Tags: user interface, browse

Permalink

Empowering Global Research

BHL supports research across the globe and in a variety of disciplines. Explore more BHL user testimonials.

Having access to this literature through BHL is a treasure. Being able to show the students the original publications...speaks volumes.

Dr. Tracey Hunter-Doniger

College of Charleston

BHL is a wonderful resource. I use a lot of old and obscure resources in my line of work, and BHL makes getting access to these sources a lot easier.

Dr. Paul D. Brinkman

North Carolina Museum of Natural Sciences

As a free, mobile archive for natural history literature, BHL is ideal for 21st century research, which can happen on the field, in a museum, or at a coffee shop, as long as there’s internet connectivity.

Dr. Nicholas Pyenson

Smithsonian National Museum of Natural History

The Biodiversity Heritage Library is an amazing resource for visual artists! Any artist interested in learning about natural history and science would consider these rare resources invaluable.

Emily Williams, MFA

Troy University

BHL is doing a wonderful service for researchers like me, who work with limited resources in developing countries like India. BHL has had a big, positive impact on my research.

Dr. Varad B. Giri

National Centre for Biological Sciences

BHL is an incredible resource. It provides access to material that is otherwise hard to get and enables me to undertake detailed searches of these sources.

Dr. Karen Sayer

Leeds Trinity University