Hi everyone,
I’m working on a pipeline to ingest the stream of SNS generated by the Sentinel-1 open data registry, and I thought I saw something odd… multiple observations that had what appeared to be the same footprint. I checked the geojsons, and sure enough they actually are identical.
Here’s an example:
S1B_EW_GRDH_1SDH_20200330T214541_20200330T214641_020927_027B16_DD6E
and
S1B_EW_GRDH_1SDH_20200330T214541_20200330T214641_020927_027B16_CE5C
I downloaded the quick-look.png’s, and they look exactly the same, and the productinfo.json’s are also almost identical. However, they have different SciHub ingestion data:
DD6E has “sciHubId” : “a5164b21-08e6-4ca0-92ac-836fdd4c4bbf”
CE5C has “sciHubId” : “2f0705f0-06e3-4bfc-8957-c4a494c782ab”
So then I dug into SciHub, BUT I can only find one object ingested that starts “S1B_EW_GRDH_1SDH_20200330T214541*”, and that is DD6E.
I can’t find CE5C at all.
Does anyone know what’s going on, and how I can avoid these duplicates? I’m seeing roughly 50 pairs a day!