Note: this is crosspost of https://gis.stackexchange.com/questions/284608/difference-between-osm-regional-extracts-unexpectedly-large I want to create a difference (change file) between two Europe extracts from OSM data. The files I want to compare are:
As the files are quite large, before running on them I have tried the processing on smaller extracts:
The workflow I use is:
Or:
The process works fine with the Czech Republic data. The resulting I get strange results with the Europe extract. The resuling I have also tried doing the comparison using Osmium instead of Osmconvert, but the result was the same, the change file was huge. Am I doing something wrong, or are the Europe extracts unsuitable for the comparison for some reason? asked 30 May '18, 14:34 Ondrej Spanel |
You've likely become a casuality to GDPR-related changes on the Geofabrik download server where we've removed user information from download files. It is possible that the two files you are comparing have a different method of removing user data (one has NO user data, the other has fake user data with uid=0) and this confuses the program that computes the diffs. You could either try removing the user, uid and changeset fields from both files before you compare, or you could download the old-style, complete files from osm-internal.download.geofabrik.de. answered 30 May '18, 15:02 Frederik Ramm ♦ Given osmcompare diff does not use content, only version numbers, how can metadata removal affect this? (Cf. https://wiki.openstreetmap.org/wiki/Osmconvert#Retrieving_the_Differences_between_two_OSM_Files)
(30 May '18, 15:15)
Ondrej Spanel
Inspecting result of conversion to OSM answers this immediately: europe-180401 is completely missing
(30 May '18, 15:27)
Ondrej Spanel
Note: it seems it is only historical data is affected by this (files like europe-180401.osm.pbf). The file europe-latest.osm.pbf seems fine. I will try historical data from https://osm-internal.download.geofabrik.de/europe.html# and report the result here.
(30 May '18, 15:35)
Ondrej Spanel
I confirm data downloaded from this location contain the version id and work fine for my purpose.
(31 May '18, 09:41)
Ondrej Spanel
http://download.geofabrik.de/europe-180401.osm.pbf (MD5: 7cd103991af26a5299ccf8dd9577171f) definitely contains version numbers, I just checked.
(05 Jun '18, 15:09)
Frederik Ramm ♦
|
Have you downloaded the 20180401 file for Europe in April, or in May?
In May - just a few days ago, same as the Czech republic extract.