Skip to content

Automatically read the data download export description to extract all value mappingsย #663

@pt-kkraemer

Description

@pt-kkraemer

Description of the issue

When downloading the bulk data some non-integer values are stored as integers in order to reduce file size. Most of these integer-text-mappings can be found in the Katalogwerte.xml from the bulk download. But some of the mapping values are not present there, see for example #653.

Ideas of solution

We could try to download the export description PDF from the bulk download which can be found here and use that.

Workflow checklist

  • Find out if the link of the export description has a pattern or if it changes
  • Find a suitable python package to read PDF or extract tables from PDF
  • Update the values in colums_to_replace.py
  • I am aware of the workflow in CONTRIBUTING.md

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions