Projekt:Wikispeech – Talresursinsamlaren 2019/Project log

Från Wikimedia
Hoppa till navigering Hoppa till sök
This is intended to serve as a log over developments and decisions to make it easier to follow what is going on in the project. If you have any questions don't hesitate to get in touch at andre.costa(at)

April 2021

Mars 2021

  • The report on the fourth phase of the project is published [Swedish]
  • Added specialpage for editing lexicon. It can be used to add new lexicon entries.

February 2021

  • Recording annotations are now stored in wiki pages (using MCR) rather than directly in the database. This adds edit interface, revision editing and rollback features with traceability to authors as in any other standard wiki article.

January 2021

  • Wikispeech presented at Norwegian Wikitreff
  • Wikispeech demoed for Norwegian Nasjonalbiblioteket

November 2020

  • Speechoid docker images are automatically built and deployed to the WMF Docker Registry. One result of this is that the process for hosting Speechoid as a service on your own infrastructure is a simple (and documented) task.
  • It is now possible to hide the player again after you have activated it. This also disables the selection player.
  • The report on the third phase of the project is published [Swedish]
  • A benchmarking script is now available to measure the processing and storage requirements for segmenting and synthesising a given page.
  • The Wikispeech page on Meta has undergone a much needed update turning it into a portal for all things Wikispeech.

October 2020

  • Wikispeech hidden in views where it cannot be used (e.g. History or Edit view).
  • Released version 0.1.7 of Wikispeech. Primarily internal changes in how Wikispeech works. You can test it on

September 2020

  • Released version 0.1.6 of Wikispeech. Primarily internal changes in how Speechoid and Wikispeech interact. You can test it on
  • Listening to a page is no longer interrupted if another user edits the page.
  • Released version 0.1.5 of Wikispeech. Highlights include better control of when Wikispeech is activated and an updated UI. You can test it on
  • Speechoid is now versioned to make it easier to map compatibility with the extension.

August 2020

  • A new version of is now up including up-to-date changes in Wikispeech and Speechoid.
  • Speechoid has now been integrated in the Continuous Integration (CI) workflow. Final images are not being published but Speechoid, as deployed on is now being built in the same way as CI will do in the future.

July 2020

June 2020

May 2020

  • Re-implementing Speechoid packaging to be done using Blubber

April 2020

  • PTS published a video about the project
  • Initiating work on the march 2020 decision to store generated speech in Wikispeech extension
  • Initiating work on converting Pronlex databases from SQLite to MySQL
  • Received feedback that the proposed re-architecture of Pronlex will need to be modified to avoid the service-dependancy-loop

March 2020

February 2020

  • On-boarded new developer User:Karl Wettin (WMSE)
  • Analysed how to implement the Speech Data Collector (in Swedish). In particular if it should be a MediaWiki extension running on Wikipedia or a stand-alone tool. The conclusion was to implement it as a MediaWiki extension but to not require it to run on Wikipedia. That way it can either be used as a stand-alone tool (a separate wiki) or implemented directly on the wiki.

January 2020

  • Site visit took place at WMDE offices, in part focusing on requirments needed to bring Wikispeech to production. See notes here (in Swedish).
Events prior to 2020 ar not included in the log