CH Impact on IT

Cultural heritage, tourism and local economies in Italy

This project explores how cultural heritage impacts tourism and economic performance across Italian regions by analyzing the relationship between cultural assets, tourism revenues, and employment indicators. Through interactive maps and data visualizations, the project provides insights to support more effective use of cultural heritage for sustainable economic growth and balanced regional development.

Created by University of Bologna Students
Image cardHero Image
Scenarios

Exploring the Economic Impact of Cultural Heritage Across Italian Regions

This project aligns data from various sources to analyze how cultural heritage, tourism, and economic development interact across Italy. By examining regional GDP, employment, tourism revenues, and household income, we aim to understand the contribution of cultural assets to local economies and reveal regional disparities.

Our investigation focuses on two key questions:

Key Research Areas

  • Correlation between cultural sites and tourism revenue
  • Impact of cultural heritage on employment and income

The results offer practical insights for policymakers, local authorities, tourism boards, and cultural institutions. The findings support strategic decisions to strengthen regional economies, enhance tourism potential, and advocate for increased investment in cultural heritage β€” promoting both cultural preservation and regional prosperity.

Cultural Heritage Image

Our Goal

To provide data-driven insights that promote sustainable growth and balanced regional development through the better utilization of Italy's cultural heritage.

Datasets Overview

This project integrates original datasets and mashup data sources to analyze the interplay between tourism, cultural heritage, and economic development across Italian regions. Below is a summary of the key datasets utilized.

D1: Accommodation Infrastructure

Tourist establishments, bedrooms, and bed-places across NUTS 2 regions (Eurostat).

D2: Peer-to-Peer Stays

Guest nights via collaborative platforms (NUTS 3 regions, Eurostat Experimental Data).

D3: Monthly Occupancy Trends

Occupancy rates by type of tourist accommodation with guest origin breakdown (ISTAT).

D4 & D5: Arrivals and Overnight Stays

Regional arrivals and nights spent by tourists at accommodation establishments (Eurostat).

D6: Occupancy Metrics

Net occupancy rates of hotels and similar accommodations by NUTS 2 regions (Eurostat).

D7: Cultural Sites Inventory

Luoghi della cultura dataset from MiC catalog, detailing cultural heritage locations in Italy.

D8: GDP Breakdown

Italy's GDP composition by economic sector (agriculture, industry, services) from ISTAT.

D9: Employment by Sector

Regional employment distribution by economic sector based on 2010 classification (ISTAT).

D10: Household Income

Household net income by source, analyzing regional disparities and income structure (ISTAT).

M1: Mashup 1

Mashup dataset of all the tourism data, along with the cultural heritage institutions data.

M2: Mashup 2

Mashup dataset of all the economic data, along with the cultural heritage institutions data.

Quality Checkpoints

Detailed Dataset Evaluation

Overview of completeness, consistency, and reliability indicators per dataset.

Question (Criterion) D1 D2 D3 D4 D5 D6 D7 D8 D9 D10
Time-Series Completeness 100% (2019-2023) 100% (2019-2023) 100% (2019-2023) 100% (2019-2023) 100% (2019-2023) 100% (2019-2023) 0.04% missing 23.57% (2023) 23.26% (2023) 71% (2021/22)
Geographical Coverage 100% NUTS-2 100% NUTS-3 100% NUTS-2 100% NUTS-2 100% NUTS-2 100% NUTS-2 All regions present All NUTS levels All NUTS levels + NACE Rev.2 All NUTS levels
Completeness Eval βœ… βœ… βœ… βœ… βœ… βœ… βœ… βœ… βœ… βœ…
Accuracy βœ… Syntactic βœ… Syntactic βœ… Syntactic βœ… Syntactic βœ… Syntactic βœ… Syntactic βœ… Syntactic βœ… Syntactic βœ… Syntactic βœ… Syntactic
Semantic Accuracy / Data Consistency βœ… Semantic βœ… Semantic βœ… Semantic βœ… Semantic βœ… Semantic βœ… Semantic Province boundaries outdated 3 bil diff 2022/23 Cross-check Eurostat incomplete 4 thousand diff 2021/22
Coherence βœ… No contradictions βœ… No contradictions 12% violated rule βœ… No contradictions βœ… No contradictions βœ… No contradictions βœ… No contradictions βœ… No contradictions βœ… No contradictions βœ… No contradictions
Timeliness βœ… βœ… βœ… βœ… βœ… βœ… Daily Annual Annual Annual
Ethical Checkpoints

Ethical Analysis by Dataset

Evaluation of human-centric principles, data control, and user empowerment aspects per dataset.

Question D1 D2 D3 D4 D5 D6 D7 D8 D9 D10
Is your data processing based on the fact that you borrow data from the users (not owner of their data)? N/AN/AN/AN/AN/AN/AN/AN/AN/AN/A
Do you ensure that the user’s rights are prioritised, rather than commercial or institutional interests? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…
Do you ensure that primarily users benefit from their own data – not just the organisation? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…
Do you use privacy-by-design principles, and can you describe them clearly and transparently? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…

Question D1 D2 D3 D4 D5 D6 D7 D8 D9 D10
Do you ensure that users’ data – as far as possible – is processed directly on the users’ own device(s)? N/AN/AN/AN/AN/AN/AN/AN/AN/AN/A
When the processing of data is necessary other than on the user’s own devices, such as your server or a cloud solution, is collected data not related to an identifiable person? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…
Do you use profiling? If so, do you allow the user to influence and determine the values, rules and input that underlie the profiling? ❌❌❌❌❌❌❌❌❌❌
Do you use data to predict individual-level behaviour or only patterns? ❌❌❌❌❌❌❌❌❌❌

Question D1 D2 D3 D4 D5 D6 D7 D8 D9 D10
In which country is your data stored? LuxemburgLuxemburgItalyLuxemburgLuxemburgLuxemburgItalyItaly, under ISTAT infrastructureItaly, under ISTAT infrastructureItaly, under ISTAT infrastructure
Where is the storage solutions provider headquartered? Brussels/Lux.Brussels/Lux.RomeBrussels/Lux.Brussels/Lux.Brussels/Lux.N/AISTATISTATISTAT
Does the transmission of data go through countries outside of the EU? ❌❌❌❌❌❌❌❌❌❌
Do you use machine learning / artificial intelligence? If so, can you explain the algorithms – the criteria and parameters? ❌❌❌❌❌❌❌❌❌❌
Do you use personal data to influence user behaviour? ❌❌❌❌❌❌❌❌❌❌
Do you ensure that it is transparent when the use of personal data may influence a user’s behaviour? N/AN/AN/AN/AN/AN/AN/AN/AN/AN/A
Do you ensure that the design does not create addiction and thus influences the person’s self-determination and empowerment? βœ…βœ…βœ…βœ…βœ…βœ…βœ…βœ…βœ…βœ…
Do you operate with open source software, so others can use it and possibly develop it further? βœ… Eurostat publishes API code snippetsβœ… Eurostat publishes API code snippetsβŒβœ… Eurostat publishes API code snippetsβœ… Eurostat publishes API code snippetsβœ… Eurostat publishes API code snippetsβœ…βœ… the project uses open source tools (e.g., Python, Jupyter) and publishes code openlyβœ… the project uses open source tools (e.g., Python, Jupyter) and publishes code openlyβœ… the project uses open source tools (e.g., Python, Jupyter) and publishes code openly

Question D1 D2 D3 D4 D5 D6 D7 D8 D9 D10
When do you anonymise personal data? N/AN/AN/AN/AN/AN/AN/AN/AN/AN/A
Do you use end-to-end encryption of data? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…
Do you minimise the use of metadata and explain how it is done? βœ…βœ…βœ…βœ…βœ…βœ…βŒβœ…βœ…βœ…
Do you use zero knowledge as a design principle? N/AN/AN/AN/AN/AN/A❌N/AN/AN/A
Do you sell data to third parties? ❌❌❌❌❌❌❌❌❌❌
Do you sell data as personal identifiable data? ❌❌❌❌❌❌❌❌❌❌
Do you sell data as patterns on an aggregated level? ❌❌❌❌❌❌❌❌❌❌
If you sell data, are you making sure that it is fully anonymised information only describing patterns, not individuals? N/AN/AN/AN/AN/AN/AN/AN/AN/AN/A
Do you use third-party cookies? ❌❌❌❌❌❌❌❌❌❌
Does this include SoMe (social media) cookies and SoMe logins? ❌❌❌❌❌❌❌❌❌❌
Do you use Google Analytics or similar tracking tools? ❌❌❌❌❌❌❌❌❌❌
If you use third-party cookies, are your users fully aware that your cookie use leads to sharing of data about your users with third parties and do they agree with it? N/AN/AN/AN/AN/AN/AN/AN/AN/AN/A
Do you enrich data with external data, such as social media data, bought data or web scraping? ❌❌❌❌❌❌❌❌❌❌
Does this enrichment occur in response to, or in cooperation with, your users? N/AN/AN/AN/AN/AN/AN/AN/AN/AN/A
Do you have an individual or a department responsible for the ethical managing of data? βœ…βœ…βœ…βœ…βœ…βœ…βŒβœ…βœ…βœ…
How is the work with data ethics embedded in the organisation? Managed by Eurostat within the European Statistical System (ESS) Identical framework to D1 Legislative Decree 322/1989 (Italian statistical law) and ISTAT’s Code of Conduct. Identical framework to D1 Identical framework to D1 Identical framework to D1 N/A ISTAT integrates ethics through institutional data policies, transparency rules and compliance checks ISTAT integrates ethics through institutional data policies, transparency rules and compliance checks ISTAT integrates ethics through institutional data policies, transparency rules and compliance checks
How do you ensure that your data ethics guidelines are respected? automated disclosure-control software, dual sign-off workflows, audit trails, external peer reviews, and supervisory bodies automated disclosure-control software, dual sign-off workflows, audit trails, external peer reviews, and supervisory bodies version control & audit log automated disclosure-control software, dual sign-off workflows, audit trails, external peer reviews, and supervisory bodies automated disclosure-control software, dual sign-off workflows, audit trails, external peer reviews, and supervisory bodies automated disclosure-control software, dual sign-off workflows, audit trails, external peer reviews, and supervisory bodies N/A Internal audits, legal compliance teams, and mandatory adherence to GDPR and open data principles Internal audits, legal compliance teams, and mandatory adherence to GDPR and open data principles Internal audits, legal compliance teams, and mandatory adherence to GDPR and open data principles
Can the processing of data be audited by an independent third party? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…
Do you require and control the data ethics of your subcontractors and partners? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…

Question D1 D2 D3 D4 D5 D6 D7 D8 D9 D10
Do you engage in dialogue with your users on a public platform? βœ…βœ…βœ…βœ…βœ…βœ…βŒβœ…βœ…βœ…
Do you have guidelines for using the platform? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…
Do you moderate the platform in order to remove sensitive personal data? N/AN/AN/AN/AN/AN/AN/Aβœ…βœ…βœ…
If your services are offered to children, do you ensure parental consent? N/AN/AN/AN/AN/AN/AN/AN/AN/AN/A
Is data used to develop or train an algorithm? ❌❌❌❌❌❌N/A❌❌❌
Do you ensure that the use of data does not lead to discrimination? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…
Do you ensure that the use of data does not expose the vulnerabilities of individuals? βœ…βœ…βœ…βœ…βœ…βœ…N/Aβœ…βœ…βœ…
Do you ensure that the use of artificial intelligence / machine learning is to the benefit of the individual and does not cause physical, psychological, social or financial harm to the individual? N/AN/AN/AN/AN/AN/AN/AN/AN/AN/A
Technical Checkpoints

Detailed Technical Analysis

Overview of formats, metadata, provenance, and accessibility details per dataset.

Dataset D1 D2 D3 D4 D5 D6 D7 D8 D9 D10
Format XLSX, TSV, CSV, XML, JSON XLSX, TSV, CSV, XML, JSON XLS, CSV, JSON XLSX, TSV, CSV, XML, JSON XLSX, TSV, CSV, XML, JSON XLSX, TSV, CSV, XML, JSON JSON-LD, RDF/TTL, RDF/XML CSV CSV CSV

Dataset D1 D2 D3 D4 D5 D6 D7 D8 D9 D10
Metadata Covers: scope, purpose, variables, definitions, coverage, legal basis, statistical units, data sources Same as for D1 SIQual, quality notes, scope, concepts, classifications Same as for D1 Same as for D1 Same as for D1 Cultural site categorization, province/region breakdown, MiC updates Sectoral & regional breakdown, national accounts alignment Employment by sector, territorial detail, NACE/ATECO classification Household income breakdown, source categories, regional aggregation

  • D1: Eurostat
  • D2: Eurostat
  • D3: ISTAT
  • D4: Eurostat
  • D5: Eurostat
  • D6: Eurostat
  • D7: MiC (Ministry of Culture)
  • D8: ISTAT
  • D9: ISTAT
  • D10: ISTAT
Serialization

RDF Serialization of Metadata

All datasets and the project catalogue have been described with metadata, following the DCAT Application profile for data portals in Europe (DCAT-AP).

Visualizations

Visualizations

Visualizations to better explain the datasets and how they relate to each other in a visually engaging way.

Considerations Mashup Used
What is the relationship between available tourist accomodations and occupancy rates? What can this imply about the presence of cultural heritage institutions? Mashup 1
What is the relationship between tourist arrival numbers and the number of cultural heritage institutions present? Mashup 1
What is the relationship between the number of nights spent per region? Is there any correlation between this and the number of cultural heritage institutions? Mashup 1
What is the trend of total accomodation establishments versus cultural heritage institutions between 2021 and 2023? Mashup 1
What is the overall correlation between all touristic data and the number of cultural heritage institutions? Mashup 1
What is the overall average trend of the economic data between 2021 and 2023? Mashup 2
What is the relationship between cultural heritage institutions and the economic information? Mashup 2
What are the regional averages of economic data and are there any visible correlations with the number of cultural heritage institutions? Mashup 2
What is the overall correlation between all economic data and the number of cultural heritage institutions? Mashup 2
Discussion

Discussion of Findings

For the first resarch topic, as we intended to investigate the relationship between the presence of cultural heritage institutions and tourism activity, we looked at the number of available accomodation and their occupancy rates per region. From this comparison, it could be said that the regions with higher numbers of available accomodations do not necessarily have higher rates of occupancy (i.e. Veneto and Toscana regions). This could mean that the occupancy rate is not necessarily related to the total number of available places, which is supported by the low rate of correlation (r = 0.35).

In addition to the occupancy rate, we investigated the number of tourist arrivals against the number of cultural heritage institutions per region. There seems to be a reasonable relationship between these two parameters, as the higher number of arrivals are accompanied with the higher number of institutions, and the lower number of arrivals occur in the regions with lower number of institutions. Even though there are exceptions to this trend, such as Veneto and Trentino-Alto Adige regions, we believe it contributes to the initial hypothesis.

While investigating the nights spent based on the aggregated numbers from collaborative platforms and accomodation establishements per region, we found a trend similar to our second point, such that there seems to be a relationship between the higher number of cultural institutions and the higher number of nightly stays. The seperate correlation coefficients of r = 0.61 for the collaborative platforms data and r = 0.53 for the accomodation establishments seem to support the existence of a relationship.

In order to have an overall look, we used the biggest data we have for the tourism activity and we plotted the number of cultural institutions against it. As the graph suggests, there seems to be some correlation between the two data, although there is a bit more discrepancy in this point compared to other points. The correlation coefficient is r = 0.5, which seems to suggest the existince of a relationship, although it is not a strong one.

For our second research topic, we investigated if there is a relationship between the existence of cultural heritage institutions and economic data. For that, we looked at the trends between only the economic data before plotting it against the institution data. Overall, GDP and employment data seem to be highly related to each other, however as the household net income data has a large amount of missing data, based on the data we have, it seems to not have a visible relationship with the other categories of data.

According to our correlation matrix, the number of cultural heritage institutions seems to have a reasonable relation with GDP (r = 0.59) and a higher relation with the number of employment (r = 0.72).

Overall, there seems to be some overlap between tourism activity and the number of cultural heritage institutions, as well as the presence of cultural heritage institutions and their impact on the local economies. However, the data we currently have is not enough to make a solid conclusion, more data and more investigation of data is required based on the findings of our project.

Sustainability Checkpoints

Dataset Update Sustainability

Ensuring long-term relevance and reliability of datasets through institutional sources and transparent update cycles.

The sustainability of updating the datasets over time is ensured through the use of official, institutional data sources with established update cycles.

For datasets D1 to D6, provided by Eurostat and ISTAT, updates follow a predictable schedule, typically annual or quarterly, depending on the indicator. These updates are transparently published on official portals, accompanied by metadata that informs users about version changes and data currency.

For D7, maintained by the Italian Ministry of Culture (MiC), the dataset is part of the national DBUnico system, which is continuously managed and updated as new cultural sites are added or existing records change.

For economic datasets D8 to D10, ISTAT guarantees regular revisions aligned with national accounting practices, labour statistics releases, and household income surveys, ensuring data remains relevant for long-term research and policy analysis.

The reliance on standardized data sources, open data platforms, and clear versioning policies contributes to the long-term sustainability and reliability of these datasets for research, policy-making, and open-access projects.

About the Project

This website was developed by students of the University of Bologna as part of an academic project exploring the relationship between cultural heritage, tourism, and regional economic development.

© Scinarios Project. All rights reserved.