Hi everyone,
Last week I attended and presented at the virtual Celtic Knot Conference
<https://meta.wikimedia.org/wiki/Celtic_Knot_Conference_2020>. There were
plenty of interesting talks, some live, some pre-recorded, all now
available on YouTube; links are available on the “Main/Live program
<https://meta.wikimedia.org/wiki/Celtic_Knot_Conference_2020/Live_program>”
page, and the “Videos pool
<https://meta.wikimedia.org/wiki/Celtic_Knot_Conference_2020/Videos_pool>”
page.
I wanted to point out some of presentations and other things that might be
interesting:
- You can see a demo <https://www.youtube.com/watch?v=WIeJ_0aqgPg> of
what the Growth Team has been up to with their newcomer task work that our
team has been supporting.
- There’s a workshop-like demo of the Lexeme project on Wikidata
<https://www.youtube.com/watch?v=oDM5QJAJzNc>, which still has a long
way to go, but already has a *lot* of data.
- There’s also Lexeme-related tool in ToolForge called Ordia
<https://ordia.toolforge.org/>, which has all sorts of nifty
capabilities. A nice one is looking to see how many lexemes each
language has <https://ordia.toolforge.org/language/>.
- I had not previously heard of Wikidata Bridge
<https://www.mediawiki.org/wiki/Wikidata_Bridge>, which aims to allow
people to edit Wikidata from infoboxes!
- A recent article from *Java Magazine* lists the 25 greatest Java apps
ever written
<https://blogs.oracle.com/javamagazine/the-top-25-greatest-java-apps-ever-wr…>,
and #6 is “Wikipedia Search”, even though the Java bit is mostly
Elasticsearch and the “Wikipedia” part is mostly PHP. Still, it’s nice to
be appreciated.
- Amir has some nice ideas about how to make the Wikimedia Incubator
better <https://www.youtube.com/watch?v=DdyzrDzD0qg>. One positive side
effect of his proposal might be better search on new wikis.
I don’t particularly recommend my talk
<https://www.youtube.com/watch?v=Pi3-w9ne3zg> since it is a short version
of the same old overview of the basic kinds of text processing we can do
for search—unless you want to see a few more examples in Irish (I don’t try
to *pronounce* any of the Irish words, though, so it isn’t as entertaining
as it could have been).
I already got a line on some Breton stop words, and I’m going to look into
what we are doing for Breton as a 10% project.
—Trey
Trey Jones
Sr. Computational Linguist, Search Platform
Wikimedia Foundation
UTC-4 / EDT
Search Platform Office Hours will be in just a bit less than 24 hours from
now.
On Wed, Jun 24, 2020 at 12:16 PM Trey Jones <tjones(a)wikimedia.org> wrote:
> The Search Platform Team
> <https://www.mediawiki.org/wiki/Wikimedia_Search_Platform> usually holds
> office hours the first Wednesday of each month. Come talk to us about
> anything related to Wikimedia search!
>
>
> Feel free to add your items to the Etherpad Agenda for the next meeting.
>
>
> Details for our next meeting:
>
> Date: Wednesday, July 1st, 2020
>
> Time: 15:00-16:00 GMT / 08:00-09:00 PDT / 11:00-12:00 EDT / 17:00-18:00
> CEST
>
> Etherpad: https://etherpad.wikimedia.org/p/Search_Platform_Office_Hours
>
> Google Meet link: https://meet.google.com/vyc-jvgq-dww
>
> Join by phone in the US: +1 786-701-6904 PIN: 262 122 849#
>
>
> Hope to talk to you in a week!
>
> —Trey
>
>
> Trey Jones
> Sr. Software Engineer, Search Platform
> Wikimedia Foundation
> UTC-4 / EDT
>
The Search Platform Team
<https://www.mediawiki.org/wiki/Wikimedia_Search_Platform> usually holds
office hours the first Wednesday of each month. Come talk to us about
anything related to Wikimedia search!
Feel free to add your items to the Etherpad Agenda for the next meeting.
Details for our next meeting:
Date: Wednesday, July 1st, 2020
Time: 15:00-16:00 GMT / 08:00-09:00 PDT / 11:00-12:00 EDT / 17:00-18:00 CEST
Etherpad: https://etherpad.wikimedia.org/p/Search_Platform_Office_Hours
Google Meet link: https://meet.google.com/vyc-jvgq-dww
Join by phone in the US: +1 786-701-6904 PIN: 262 122 849#
Hope to talk to you in a week!
—Trey
Trey Jones
Sr. Software Engineer, Search Platform
Wikimedia Foundation
UTC-4 / EDT
The Search Platform Team
<https://www.mediawiki.org/wiki/Wikimedia_Search_Platform> usually holds
office hours the first Wednesday of each month. Come talk to us about
anything related to Wikimedia search!
Feel free to add your items to the Etherpad Agenda for the next meeting.
Details for our next meeting:
Date: Wednesday, June 3rd, 2020
Time: 15:00-16:00 GMT / 08:00-09:00 PDT / 11:00-12:00 EDT / 17:00-18:00 CEST
Etherpad: https://etherpad.wikimedia.org/p/Search_Platform_Office_Hours
Google Meet link: https://meet.google.com/vyc-jvgq-dww
Join by phone in the US: +1 786-701-6904 PIN: 262 122 849#
Hope to talk to you in a week!
—Trey
Trey Jones
Sr. Software Engineer, Search Platform
Wikimedia Foundation
UTC-4 / EDT
The Search Platform Team
<https://www.mediawiki.org/wiki/Wikimedia_Search_Platform> usually holds
office hours the first Wednesday of each month. Come talk to us about
anything related to Wikimedia search!
Feel free to add your items to the Etherpad Agenda for the next meeting.
Details for our next meeting:
Date: Wednesday, May 6th, 2020
Time: 15:00-16:00 GMT / 08:00-09:00 PDT / 11:00-12:00 EDT / 17:00-18:00 CEST
Etherpad: https://etherpad.wikimedia.org/p/Search_Platform_Office_Hours
Google Meet link: https://meet.google.com/vyc-jvgq-dww
Join by phone in the US: +1 786-701-6904 PIN: 262 122 849#
Hope to talk to you in about a week!
—Trey
Trey Jones
Sr. Software Engineer, Search Platform
Wikimedia Foundation
UTC-4 / EDT
Routing to discovery@ where all the search people are.
On Fri, Apr 10, 2020 at 2:43 PM Kaartic Sivaraam <
kaarticsivaraam91196(a)gmail.com> wrote:
> Hi,
>
> Is there any MediaWiki API that could be used for a proper
> case-insensitive category title search? Or is this still something that
> doesn't exist yet?
>
> For some context, I'm asking this for a feature request[1] in the
> Commons Android app that asks for a case insensitive category title
> search. In case you're wondering where category search comes into
> picture in the app, adding appropriate categories for an image is part
> of the upload flow.
>
> I wonder that such an API doesn't exist yet for the following reasons:
>
> A) The exhaustive search for such an API wasn't fruitful. The closest we
> got was using `generator=search` with `srsearch=intitle:$SEARCH_TERM`
> and `srnamespace=14` (14 is the id for category namespace in Commons).
> But it's not a proper category search as it is essentially a search for
> pages existing in Category namespace. See [2] for why it's not a proper
> category search.
>
> B) I saw "T59302 Suggest case insensitive results when searching for
> categories to add"[3] which is still open. In particular the comment in
> the ticket pointed to by [4].
>
> So, is my understanding that an API for case-insensitive category title
> search doesn't exist correct? Or am I missing something?
>
> Footnotes:
> [1]: https://github.com/commons-app/apps-android-commons/issues/3179
> [2]: https://phabricator.wikimedia.org/T59302#2707969
> [3]: https://phabricator.wikimedia.org/T59302
> [4]: https://phabricator.wikimedia.org/T59302#3977813
>
> Hoping you're safe,
> Sivaraam
>
> _______________________________________________
> Wikitech-l mailing list
> Wikitech-l(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
--
Guillaume Lederrey
Engineering Manager, Search Platform
Wikimedia Foundation
UTC+1 / CET
The Search Platform Team
<https://www.mediawiki.org/wiki/Wikimedia_Search_Platform> usually holds
office hours the first Wednesday of each month. Come talk to us about
anything related to Wikimedia search!
Feel free to add your items to the Etherpad Agenda for the next meeting.
Details for our next meeting:
Date: Wednesday, April 1st, 2020
Time: 15:00-16:00 GMT / 08:00-09:00 PDT / 11:00-12:00 EDT / 17:00-18:00 CEST
Etherpad: https://etherpad.wikimedia.org/p/Search_Platform_Office_Hours
Google Meet link: https://meet.google.com/vyc-jvgq-dww
Join by phone in the US: +1 786-701-6904 PIN: 262 122 849#
Hope to talk to you tomorrow!
Trey Jones
Sr. Software Engineer, Search Platform
Wikimedia Foundation
UTC-4 / EDT
Hi everyone,
The Search Platform Team
<https://www.mediawiki.org/wiki/Wikimedia_Search_Platform> usually holds
office hours the first Wednesday of each month. Come talk to us about
anything related to Wikimedia search!
Feel free to add your items to the Etherpad Agenda for the next meeting.
Details for our next meeting:
Date: Wednesday, March 4th, 2020
Time: 16:00-17:00 GMT / 08:00-09:00 PST / 11:00-12:00 EST / 17:00-18:00 CET
Etherpad: https://etherpad.wikimedia.org/p/Search_Platform_Office_Hours
Google Meet link: https://meet.google.com/vyc-jvgq-dww
Join by phone in the US: +1 786-701-6904 PIN: 262 122 849#
Hope to talk to you in a week!
—Trey
Trey Jones
Sr. Software Engineer, Search Platform
Wikimedia Foundation
UTC-5 / EST
---------- Forwarded message ---------
From: Jason Linehan <jlinehan(a)wikimedia.org>
Date: Wed, Feb 19, 2020 at 4:12 PM
If your team uses mw.user.sessionId() for instrumentation, a recent change
to MediaWiki could impact your numbers.
The new patch <https://gerrit.wikimedia.org/r/#/c/mediawiki/core/+/572011/>
changes
the way that session IDs work, bringing their behavior closer to other
platforms that many of us are familiar with.
The value returned from mw.user.sessionId() will now:
- be the same in different tabs of the same browser process
- be the same in different windows of the same browser process
- be forgotten once the browser process ends
Since 2017, values returned from mw.user.sessionId() have only been
constant within the same browser tab, and only lasted until the tab was
closed. This had gone unnoticed until recently. See T223931
<https://phabricator.wikimedia.org/T223931> for more details. This patch
restores pre-2017 behavior.
If you have any questions about the change, or if you notice any
irregularities in your data or instrumentation, reach out or tag jlinehan,
mpopov, or the Better Use of Data topic on Phabricator.
-Jason
The Search Platform Team
<https://www.mediawiki.org/wiki/Wikimedia_Search_Platform> usually holds
office hours the first Wednesday of each month. Come talk to us about
anything related to Wikimedia search!
Feel free to add your items to the Etherpad Agenda for the next meeting.
Details for our next meeting:
Date: Wednesday, Feb 5th, 2020
Time: 16:00-17:00 GMT / 08:00-09:00 PST / 11:00-12:00 EST / 17:00-18:00 CET
Etherpad: https://etherpad.wikimedia.org/p/Search_Platform_Office_Hours
Google Meet link: https://meet.google.com/vyc-jvgq-dww
Join by phone in the US: +1 786-701-6904 PIN: 262 122 849#
Hope to talk to you tomorrow!
—Trey
Trey Jones
Sr. Software Engineer, Search Platform
Wikimedia Foundation
UTC-5 / EST