Skip to content

Download vs Data explorer preview differences in record counts #230

@Uchechukwu-Onye-Igbo

Description

@Uchechukwu-Onye-Igbo

Hello,

We had an external user point out the following difference in record counts which is usually only a difference of 100k or so:

https://opendata.nhsbsa.net/dataset/english-prescribing-data-epd/resource/6fe3818a-bff7-43d6-8ce5-655c3104f262 (EPD WITHOUT SNOMED)

Image

https://opendata.nhsbsa.net/dataset/english-prescribing-dataset-epd-with-snomed-code/resource/36c67aba-0ba4-402e-9182-f30a8cd9b138 (EPD with SNOMED)

Image

I discussed this with our DW team and they pointed out that when downloaded the EPD with SNOMED file had 18,029,396 rows. I therefore checked the datastore page and this was showing as upload to datastore completed without issue two years ago – I clicked upload to datastore again just to see if this would make a difference and now this is showing:

Image

I know we have checked for issues with pending upload to datastore files but how are we meant to identify instances wherein the above is happening?

Downloads are giving different values than compared to the data explorer preview – this again would need to be identified, rectified and communicated out.

Thank you so much,

Metadata

Metadata

Labels

Bugs & MaintenanceSomething isn't working as expected, we are getting it fixed.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions