Skip to content

Commit 4365c16

Browse files
authored
Finland census data (#1790)
* adding finland census data * refreshed pvmap and output files * removing schema file * adding readme file * reverting previous PR1768 * adding Finland Census data for Google Review * renaming files as per standard * renaming files as per standard * resolving comments * resolving comments * adding manifest.json * changed values of the property placeOfBirth * changed values of the property placeOfBirth * updated Readme file
1 parent b9fcee7 commit 4365c16

11 files changed

Lines changed: 455 additions & 588 deletions
Lines changed: 27 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,40 @@
11
# Finland Demographics Dataset
2-
32
## Overview
43

5-
This dataset contains demographic information from Finland sourced from Statistics Finland (Tilastokeskus). The data includes population statistics and census information from the official Finnish statistical database.
4+
This dataset contains demographic information from Finland sourced from Statistics Finland (Tilastokeskus). This dataset provides comprehensive longitudinal coverage of Finland’s national demographics over a 34-year span, featuring annual data from 1990 to 2024. The geographic scope is standardized according to the January 1, 2025 regional division, ensuring consistency across the time series despite historical administrative changes. It offers high-resolution granularity through 43 unique statistical metrics that encompass population growth, age distribution, linguistic diversity, religious affiliation and urban-rural classification. Data is reported in multiple units for versatile analysis, including absolute counts, percentages (%), and population density (persons/km²), allowing for both scale-based and proportional statistical modeling.
5+
6+
## Data Acquisition
7+
8+
To download the latest version of this data or refresh the dataset for new years:
69

7-
## Data Source
10+
1. Navigate to the source: https://pxdata.stat.fi/PxWeb/pxweb/en/StatFin/StatFin__vaerak/statfin_vaerak_pxt_11ra.px/table/tableViewLayout1/
811

9-
**Source URL:** https://pxdata.stat.fi/PxWeb/pxweb/en/StatFin/StatFin__vaerak/statfin_vaerak_pxt_11ra.px/table/tableViewLayout1/
12+
2. Selection Criteria:
13+
- Area: Select WHOLE COUNTRY (or all individual municipalities for more granularity).
14+
- Information: Select all variables (Population, Language, Religion, Urban/Rural, etc.).
15+
- Year: Select the full range from 1990 to the most recent year.
1016

11-
The data comes from Finland's official statistical authority and includes comprehensive demographic variables such as population counts, age distributions, and other census-related metrics.
17+
3. Show and Save table Format: Choose "CSV (comma delimited)".
1218

1319
## Processing Instructions
1420

1521
To process the Finland Census data and generate statistical variables, use the following command from the "data" directory:
1622

1723
```bash
1824
python ./data/tools/statvar_importer/stat_var_processor.py --input_data="./test_data/Finland_Census_input.csv" --pv_map=Finland_Census_pvmap.csv --config_file=Finland_Census_metadata.csv --output_path=Finland_Census_output
25+
26+
## Data Refresh & Maintenance
27+
28+
When Statistics Finland releases new annual updates (typically in the Spring), follow these steps:
29+
30+
1. Execute the Data Acquisition steps to get the latest CSV.
31+
32+
2. Check if new demographic categories were added. Update finland_census_pvmap.csv if new labels appear in the source.
33+
34+
3. The source uses . to represent unavailable data (e.g., Economic dependency ratio for the current year). The processor is configured to skip these entries during import.
35+
36+
4. Because the source applies the 2025 regional division retrospectively, check for municipal merger updates if downloading municipality-level data.
37+
38+
5. Ensure total population counts match the previous year's trend to verify no rows were dropped.
39+
40+
6. Execute the data processing step.

statvar_imports/finland_census/Finland_Census_metadata.csv renamed to statvar_imports/finland_census/finland_census_metadata.csv

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,4 +4,5 @@ place_within,country/FIN
44
mapped_rows,3
55
mapped_columns,2
66
word_delimiter,""""""
7-
output_columns,"observationDate,value,unit,observationAbout,variableMeasured"
7+
output_columns,"observationDate,value,unit,observationAbout,variableMeasured"
8+

statvar_imports/finland_census/Finland_Census_pvmap.csv renamed to statvar_imports/finland_census/finland_census_pvmap.csv

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
key,p1,v1,p2,v2,p3,v3,p4,v4,p5,v5,p6,v6,p7,v7
2-
Population 31 Dec,,,measuredProperty,count,populationType,Person,,,,,,,,
2+
Population 31 Dec,,,measuredProperty,count,populationType,Person,,,,,,,,,,,
33
Increase of population,,,measuredProperty,incrementalCount,populationType,Person,,,,,,,,
44
"Increase of population, %",statType,growthRate,measuredProperty,count,populationType,Person,unit,Percent,,,,,,
55
"Share of persons aged under 15, %",,,measuredProperty,count,populationType,Person,age,[- 15 Years],unit,Percent,measurementDenominator,Count_Person,,
@@ -29,9 +29,9 @@ Persons with foreign background,statType,measuredValue,measuredProperty,count,or
2929
"Land area, km²",statType,measuredValue,measuredProperty,area,,,populationType,LandCover,unit,SquareKilometer,,,,
3030
Population density,statType,measuredValue,measuredProperty,count,,,populationType,Person,unit,PersonPerSquareKilometer,measurementDenominator,area,,
3131
Persons born in the area belonging to the population,statType,measuredValue,measuredProperty,count,placeOfBirth,country/FIN,populationType,Person,,,,,,
32-
Persons born in their area of residence,statType,measuredValue,measuredProperty,count,placeOfBirth,BornInStateOfResidence,populationType,Person,,,,,,
33-
Persons born elsewhere than in their area of residence,statType,measuredValue,measuredProperty,count,placeOfBirth,NotBornInStateOfResidence,populationType,Person,,,,,,
34-
"Share of persons born in the area of residence, %",statType,measuredValue,measuredProperty,count,placeOfBirth,BornInStateOfResidence,populationType,Person,unit,Percent,measurementDenominator,Count_Person,,
32+
Persons born in their area of residence,statType,measuredValue,measuredProperty,count,placeOfBirth,BornInAreaOfResidence,populationType,Person,,,,,,
33+
Persons born elsewhere than in their area of residence,statType,measuredValue,measuredProperty,count,placeOfBirth,BornOutsideAreaOfResidence,populationType,Person,,,,,,
34+
"Share of persons born in the area of residence, %",statType,measuredValue,measuredProperty,count,placeOfBirth,BornInAreaOfResidence,populationType,Person,unit,Percent,measurementDenominator,Count_Person,,
3535
"Share of persons living in the area of birth, %",statType,measuredValue,measuredProperty,count,placeOfResidenceClassification,AreaOfBirth,populationType,Person,unit,Percent,measurementDenominator,Count_Person,,
3636
"Share of persons in urban areas, %",statType,measuredValue,measuredProperty,count,placeOfResidenceClassification,Urban,populationType,Person,unit,Percent,measurementDenominator,Count_Person,,
3737
"Share of persons in inner urban area, %",statType,measuredValue,measuredProperty,count,placeOfResidenceClassification,InnerUrban,populationType,Person,unit,Percent,measurementDenominator,Count_Person,,
@@ -433,4 +433,4 @@ MK16 Central Ostrobothnia,observationAbout,nuts/FI1D5,#Header,observationAbout,,
433433
MK17 North Ostrobothnia,observationAbout,nuts/FI1D6,#Header,observationAbout,,,,,,,,,,
434434
MK18 Kainuu,observationAbout,nuts/FI1D4,#Header,observationAbout,,,,,,,,,,
435435
MK19 Lapland,observationAbout,nuts/FI1D7,#Header,observationAbout,,,,,,,,,,
436-
MA2 ÅLAND,observationAbout,nuts/FI2,#Header,observationAbout,,,,,,,,,,
436+
MA2 ÅLAND,observationAbout,nuts/FI2,#Header,observationAbout,,,,,,,,,,
Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
{
2+
"import_specifications": [
3+
{
4+
"import_name": "Finland_Census",
5+
"curator_emails": [
6+
"support@datacommons.org"
7+
],
8+
"provenance_url": "https://pxdata.stat.fi/PxWeb/pxweb/en/StatFin/StatFin__vaerak/statfin_vaerak_pxt_11ra.px/table/tableViewLayout1/",
9+
"provenance_description": "Population data for Finland",
10+
"scripts": [
11+
"../../tools/statvar_importer/stat_var_processor.py --input_data=gs://unresolved_mcf/country/finland/input_files/*.csv --pv_map=finland_census_pvmap.csv --config_file=finland_census_metadata.csv --output_path=output/finland_census_output --existing_statvar_mcf=gs://unresolved_mcf/scripts/statvar/stat_vars.mcf"
12+
],
13+
"import_inputs": [
14+
{
15+
"template_mcf": "output/finland_census_output.tmcf",
16+
"cleaned_csv": "output/finland_census_output.csv"
17+
}
18+
],
19+
"source_files": [
20+
"gs://unresolved_mcf/country/finland/input_files/*.csv"
21+
],
22+
"cron_schedule": "30 5 1 * *"
23+
}
24+
]
25+
}

0 commit comments

Comments
 (0)