Projekt:WFD-data till Wikidata 2016/2016-06-09 Reference group meeting

Från Wikimedia
Hoppa till navigering Hoppa till sök

Notes from kick off meeting regarding Wikidata and WFD reporting

Place:            European Environment Agency,, Denmark
Time:            10.00-15.30   9th of June 2016


EU:s Water Framework Directive (WFD) requires reporting of water data in XML-data standard from all member states at the end of each 6-year water cycle. The data is stored in Reportnet at EEA and will be exposed to the public through WFS-services and RDF.

In 2012 a knowledge base called Wikidata (“the sum of all human knowledge”) was created to support the Wikimedia family. It replaces Google's attempt to make the same (called Freebase). By storing all data in one database and show data from one place will simplify updating, homogenization and translating to all languages in all Wikipedia articles.

The structure of the database is still quite immature and requires definitions of statements to be able to use. This is easily done by anyone having a Wikimedia account and is sanctioned by the Wikidata community (all other members).

General information about the project (Niklas)


One thing that was raised was the issue of demand for exposing WFD data in Wikipedia. One way to show this is to see what the usage statistics are for the 56,000 lakes in Sweden which are exposed through the Swedish Wikipedia. Another argument is that WFD is about raising environmental awareness to be able to improve water status - see WFD (14) "The success of this Directive relies on close cooperation and coherent action at Community, Member State and local level as well as on information, consultation and involvement of the public, including users.”

Introduction to the Wiki-world (André)


A recurring question was how we deal with the fact that statements cannot be locked down once they are in Wikidata, even if they are sourced and that a change to the data (although unlikely) would reflect badly on the data providers. The main thing to identify is that it is the responsibility of the downstream user of the information to verify that the information is correct, something we wouldn’t question if the information would have been embedded in a Wikipedia article.

Discussions regarding the project


EEA Eionet – Data Dictionary

In this dictionary there are description on the elements reported to EEA through Reportnet according to voluntary och mandatory reportings. The URL:s to each definition will be kept alive for a long time. For example RiverBasinDistrict is defined here:

A discussion regarding how to use different ID:s on Water bodies and the reference group stressed the importance of adding national ID:s to be able to link to a national system. Regarding INSPIRE I, it will have to be done as well within 3 years.

Future name of the project might be "Wiki Loves Waters" - paraphrasing "Wiki Loves Monuments".

Priority of data to add:

  1. Base facts about a water body incl. water bodies.
  2. Status – Ecological, Chemical, Quantitative
  3. Impact
  4. Pressure
  5. Drivers
  6. Measures

Inflow-Outflow – not possible to do through the reported data, but might be a challenge for an interested Wikipedian.

Priority regarding water bodies:

  1. Lakes
  2. Coastal waters (and transitional)
  3. Rivers
  4. Groundwaters

Make a difference in between a water body and water body as a water management unit.

Deliveries from EEA

Member states now reports to ReportNet but there will be a lot of resubmissions before the datasets are stable and EEA will deliver it to the Data Service Center. Some part will go to the DataDictionary. Identifier, names etc in the DataDictionary – status and else information. It´s a reference dataset – it should be stable – only change once every year or rather every 6th year.

To do list:

  •   Documentation from the meeting – find a nice place with our presentations and rough notes for CC0 usage. (André)
  •   Define RBD and Competent Authority in Wikidata – use descriptions from RBDSUCA. (Niklas and André)
  •   Develop according to advices today. Divide the Statement "EU Ecological status or potential" for global as one statement and another one for quality elements. How to explain that 5 is "High"? Check it there are used for 5,4,3,2,1, - High. Add HMWB (Heavily Modified Water Bodies) as a statement? Propose different solutions. (Niklas and André)
  •   CC0 – the reported data free to use without adding the source? Maybe we can ask each country to add a document saying that it is CC0…? Maybe use the description field before uploading. (Lars, Duncan, Niklas and Lasse check for each member states)
  •   Test data – can we use from some member state? Maybe not Finland due to  which have other sources and purposes.
  •   Next reference group meeting beginning in connection to 18-19th October WISE Technical group meeting. (Fernanda)
  •   Reference data – how should it be handled? Every 6th year.

Participant list

Niklas Holmgren, South Baltic Water Authority, Sweden

André Costa, Wikimedia Sweden

Fernanda Néry, EEA

Stéphane Isoard, EEA

Sören Roug, EEA

Lars Stalsberg, NVE Norway

Aslak Eide, NVE Norway

Duncan Taylor, GB SEPA

Lasse Järvenpää, SYKE, Finland