Projekt:WFD-data till Wikidata 2016/2016-06-09 Reference group meeting
WFD-data to Wikidata | |
---|---|
Documentation | About the project · Living documentation · Demo object · Final report |
Meeting notes | 2016-04-12 start-up meeting · 2016-06-09 Reference group meeting · 2016-10-17 Reference group meeting |
Properties | P2856 <EU Surface Water Body Code> (EU_CD) · P761 <Lake ID (Sweden)>P2965 <EU River Basin District code> (euRBDCode) · P3643 <Significant environmental impact types> (swSignificantImpactType) · P4002 <Ecological status> (swEcologicalStatusOrPotentialValue) · P4202 <Chemical status> (swChemicalStatusValue) |
Mappings | swSignificantImpactType (Annex 1b) · swEcologicalStatusOrPotentialValue · surfaceWaterBodyCategory · swChemicalStatusValue Work in progress: QualityElement |
Code | github:lokal-profil/WFD_import |
Notes from kick off meeting regarding Wikidata and WFD reporting
- Place: European Environment Agency,, Denmark
- Time: 10.00-15.30 9th of June 2016
Background
EU:s Water Framework Directive (WFD) requires reporting of water data in XML-data standard from all member states at the end of each 6-year water cycle. The data is stored in Reportnet at EEA and will be exposed to the public through WFS-services and RDF.
In 2012 a knowledge base called Wikidata www.wikidata.org (“the sum of all human knowledge”) was created to support the Wikimedia family. It replaces Google's attempt to make the same (called Freebase). By storing all data in one database and show data from one place will simplify updating, homogenization and translating to all languages in all Wikipedia articles.
The structure of the database is still quite immature and requires definitions of statements to be able to use. This is easily done by anyone having a Wikimedia account and is sanctioned by the Wikidata community (all other members).
General information about the project (Niklas)
One thing that was raised was the issue of demand for exposing WFD data in Wikipedia. One way to show this is to see what the usage statistics are for the 56,000 lakes in Sweden which are exposed through the Swedish Wikipedia. Another argument is that WFD is about raising environmental awareness to be able to improve water status - see WFD (14) "The success of this Directive relies on close cooperation and coherent action at Community, Member State and local level as well as on information, consultation and involvement of the public, including users.”
Introduction to the Wiki-world (André)
A recurring question was how we deal with the fact that statements cannot be locked down once they are in Wikidata, even if they are sourced and that a change to the data (although unlikely) would reflect badly on the data providers. The main thing to identify is that it is the responsibility of the downstream user of the information to verify that the information is correct, something we wouldn’t question if the information would have been embedded in a Wikipedia article.
Discussions regarding the project
Demo: https://test.wikidata.org/wiki/Q2442
In this dictionary there are description on the elements reported to EEA through Reportnet according to voluntary och mandatory reportings. The URL:s to each definition will be kept alive for a long time. For example RiverBasinDistrict is defined here: http://dd.eionet.europa.eu/tables/9140
A discussion regarding how to use different ID:s on Water bodies and the reference group stressed the importance of adding national ID:s to be able to link to a national system. Regarding INSPIRE I, it will have to be done as well within 3 years.
Future name of the project might be "Wiki Loves Waters" - paraphrasing "Wiki Loves Monuments".
Priority of data to add:
- Base facts about a water body incl. water bodies.
- Status – Ecological, Chemical, Quantitative
- Impact
- Pressure
- Drivers
- Measures
Inflow-Outflow – not possible to do through the reported data, but might be a challenge for an interested Wikipedian.
Priority regarding water bodies:
- Lakes
- Coastal waters (and transitional)
- Rivers
- Groundwaters
Make a difference in between a water body and water body as a water management unit.
Deliveries from EEA
Member states now reports to ReportNet but there will be a lot of resubmissions before the datasets are stable and EEA will deliver it to the Data Service Center. Some part will go to the DataDictionary. Identifier, names etc in the DataDictionary – status and else information. It´s a reference dataset – it should be stable – only change once every year or rather every 6th year.
To do list:
- Documentation from the meeting – find a nice place with our presentations and rough notes for CC0 usage. (André)
- Define RBD and Competent Authority in Wikidata – use descriptions from RBDSUCA. (Niklas and André)
- Develop test.wikidata.org according to advices today. Divide the Statement "EU Ecological status or potential" for global as one statement and another one for quality elements. How to explain that 5 is "High"? Check it there are used for 5,4,3,2,1, - High. Add HMWB (Heavily Modified Water Bodies) as a statement? Propose different solutions. (Niklas and André)
- CC0 – the reported data free to use without adding the source? Maybe we can ask each country to add a document saying that it is CC0…? Maybe use the description field before uploading. (Lars, Duncan, Niklas and Lasse check for each member states)
- Test data – can we use from some member state? Maybe not Finland due to http://www.jarviwiki.fi/ which have other sources and purposes.
- Next reference group meeting beginning in connection to 18-19th October WISE Technical group meeting. (Fernanda)
- Reference data – how should it be handled? Every 6th year.
Participant list
Niklas Holmgren, South Baltic Water Authority, Sweden
André Costa, Wikimedia Sweden
Fernanda Néry, EEA
Stéphane Isoard, EEA
Sören Roug, EEA
Lars Stalsberg, NVE Norway
Aslak Eide, NVE Norway
Duncan Taylor, GB SEPA
Lasse Järvenpää, SYKE, Finland