Greetings,
Quite a few updates from across Discovery this week. As always, feedback and suggestions welcome. 

== Highlights ==
* Our wikis now implement the Open Graph standard, so sharing links on social media now include the appropriate imagery. Thanks to Ladsgroup for this work! [0] [1] [2]
* For those interested in how we test changes to search, there's now a page on Testing Search.[3]
* Requested more text to be translated on the wikipedia.org portal page via translatewiki. [4]
* "Hiring a data scientist", a post on Wikimedia Blog that provides an in-depth look into Discovery's hiring process for the Analysis team. [5]

== Discussions ==

=== Search ===
* Work continues to upgrade to Elasticsearch 5 [6] [7]
* A lot of work on TextCat (language identification) has been deployed; configuration to enable it in production should go out next week. [8] [9]
* Added document content model into the search index and contentmodel: keyword. [10] [11]
* Added more aliases for filetype: keyword [12]
* Nearly done with getting things ready for a new A/B test to be launched on a few wikipedias for sister project search results [13] [14]
* Fixed a timeout issue with advanced searches [15] [16] (not yet deployed, will be deployed with Elasticsearch 5 upgrade)
* Delayed updates from previous weeks:
** Created a list of languages for which we want to investigate analysers [17]
** After analysis, decided to use Stempel as our new Polish language analyser [18]; analysis of Stempel is underway [19]
** Fixed issue with ICU folding that caused problems with the search index [20] [21]

=== Analysis === 
* Wrapping up migrating a significant amount of data using the ReportUpdater infrastructure - almost done! Updating dashboards now to use the new datasets, including NEW datasets (like LDF endpoint usage for WDQS) [22]

=== Portal === 
* Added new text to translatewiki for wikipedia.org portal page for app links and legal language in footer [23] [24]

=== Other Noteworthy Stuff ===
* San Francisco by Maxime Le Forestier [25] [26]

[0] https://en.wikipedia.org/wiki/Open_Graph
[1] https://www.mediawiki.org/wiki/User:Ladsgroup
[2] https://phabricator.wikimedia.org/T142048
[3] https://www.mediawiki.org/wiki/Wikimedia_Discovery/Search/Testing_Search
[4] https://lists.wikimedia.org/pipermail/translators-l/2017-January/003810.html
[5] https://blog.wikimedia.org/2017/02/02/hiring-data-scientist/
[6] https://phabricator.wikimedia.org/T155671
[7] https://phabricator.wikimedia.org/T151224
[8] https://www.mediawiki.org/w/index.php?title=User:TJones_(WMF)/Notes/TextCat_Improvements
[9] https://phabricator.wikimedia.org/T149324
[10] https://phabricator.wikimedia.org/T156371
[11] https://www.mediawiki.org/wiki/Help:CirrusSearch#contentmodel
[12] https://phabricator.wikimedia.org/T156413
[13] https://phabricator.wikimedia.org/T149806
[14] https://phabricator.wikimedia.org/T156299
[15] https://phabricator.wikimedia.org/T152895
[16] https://phabricator.wikimedia.org/T134157
[17] https://phabricator.wikimedia.org/T155549
[18] https://phabricator.wikimedia.org/T154516
[19] https://phabricator.wikimedia.org/T154517
[20] https://www.elastic.co/guide/en/elasticsearch/plugins/current/analysis-icu-folding.html
[21] https://phabricator.wikimedia.org/T156234
[22] https://phabricator.wikimedia.org/T150915
[23] https://phabricator.wikimedia.org/T154350
[24] https://phabricator.wikimedia.org/T153764
[25] https://www.youtube.com/watch?v=tDtXXlD98kw&feature=youtu.be&t=1m30s
[26] https://en.wikipedia.org/wiki/Maxime_Le_Forestier

----

The archive of all past updates can be found on MediaWiki.org:

https://www.mediawiki.org/wiki/Discovery/Status_updates

Interested in getting involved? See tasks marked as "Easy" or "Volunteer needed" in Phabricator.

[1] https://phabricator.wikimedia.org/maniphest/query/qW51XhCCd8.7/#R
[2] https://phabricator.wikimedia.org/maniphest/query/5KEPuEJh9TPS/#R


Yours,
Chris Koerner
Community Liaison - Discovery
Wikimedia Foundation