Code for crawling the SKOS scheme for high-value dataset categories and creating alternative output representations.
- bin/crawl_scheme.py: Start with the URI of the scheme itself, then continue by download the top-level concepts and their sub-concepts. Output a complete Turtle file.
- bin/skos2mermaid.py: Convert the Turtle file with the SKOS scheme to a mermaid graph. Specify a language code to decide which language to use for the labels.
- queries/scheme_table_combined.rq: Query the Turtle file with the SKOS scheme to generate a CSV table. Requires Apache Jena command-line tools.
- Bulgarian (bg)
- Czech (cs)
- Danish (da)
- German (de)
- Greek (el)
- English (en)
- Spanish (es)
- Estonian (et)
- Finnish (fi)
- French (fr)
- Irish (ga)
- Croatian (hr)
- Hungarian (hu)
- Italian (it)
- Lithuanian (lt)
- Latvian (lv)
- Maltese (mt)
- Dutch (nl)
- Polish (pl)
- Portuguese (pt)
- Romanian (ro)
- Slovak (sk)
- Slovenian (sl)
- Swedish (sv)
- scheme: http://data.europa.eu/bna/asd487ae75
- regulation: https://eur-lex.europa.eu/eli/reg_impl/2023/138/oj?uri=CELEX:32023R0138#enc_1
- EU Vocabularies, High-value dataset categories: https://op.europa.eu/s/z4Lv
- data.europa.eu, Annotation of high-value datasets: https://dataeuropa.gitlab.io/data-provider-manual/hvd/annotation/
All code in this repository is published under the MIT License.