I'm using Nominatim to reverse-geocode natural language location descriptions for a research project. I spent some time looking through the source code (in particular, website/search.php), but I can't seem to make heads or tails of how the "importance" score is calculated. From what I can tell, there is some baseline calculation and then numerous tweaks - one line, for example, says
I also noticed in the documentation that Nominatim will use Wikipedia to improve the ranking of results, but once again nothing specific beyond "the importance value is calculated as log(totalcount)/log(max totalcount)." I assume that "totalcount" is the number of internal links to an article about a specific location in the result set, and "max totalcount" is the maximum of that value across the entire result set. But this only tells me the scoring contribution from Wikipedia, and not how the baseline score is calculated. My question is, what properties of the OSM data go into the calculation, and then how is the importance score actually calculated? What special tweaks and thresholds should I be aware of? asked 03 Aug '13, 16:36 aweissman |
For in-depth technical discussion of nominatim, you'd be better off asking on the geocoding mailing list. answered 03 Aug '13, 18:01 Andy Allan |
From the mailing list: https://lists.openstreetmap.org/pipermail/geocoding/2013-August/000916.html
answered 04 Dec '18, 14:10 Potdeyaourt |