It would indeed be interesting to see which percentage
of proposals are
being approved (and stay in Wikidata after a while), and whether there
is a pattern (100% approval on some type of fact that could then be
merged more quickly; or very low approval on something else that would
maybe better revisited for mapping errors or other systematic problems).
+1, I think that's your best bet. Specific properties were much better
maintained than others -- identify those that meet the bar for wholesale
import and leave the rest to the primary sources tool.
On Thu, Sep 24, 2015 at 4:03 PM Markus Krötzsch <
markus(a)semantic-mediawiki.org> wrote:
> On 24.09.2015 23:48, James Heald wrote:
> > Has anybody actually done an assessment on Freebase and its reliability?
> >
> > Is it *really* too unreliable to import wholesale?
>
> From experience with the Primary Sources tool proposals, the quality is
> mixed. Some things it proposes are really very valuable, but other
> things are also just wrong. I added a few very useful facts and fitting
> references based on the suggestions, but I also rejected others. Not
> sure what the success rate is for the cases I looked at, but my feeling
> is that some kind of "supervised import" approach is really needed when
> considering the total amount of facts.
>
> An issue is that it is often fairly hard to tell if a suggestion is true
> or not (mainly in cases where no references are suggested to check). In
> other cases, I am just not sure if a fact is correct for the property
> used. For example, I recently ended up accepting "architect: Charles
> Husband" for Lovell Telescope (Q555130), but to be honest I am not sure
> that this is correct: he was the leading engineer contracted to design
> the telescope, which seems different from an architect; no official web
> site uses the word "architect" it seems; I could not find a better
> property though, and it seemed "good enough" to accept it (as opposed to
> the post code of the location of this structure, which apparently was
> just wrong).
>
> >
> > Are there any stats/progress graphs as to how the actual import is in
> > fact going?
>
It would indeed be interesting to see which percentage
of proposals are
being approved (and stay in Wikidata after a while), and whether there
is a pattern (100% approval on some type of fact that could then be
merged more quickly; or very low approval on something else that would
maybe better revisited for mapping errors or other systematic problems).
>
> Markus
>
>
> >
> > -- James.
> >
> >
> > On 24/09/2015 19:35, Lydia Pintscher wrote:
> >> On Thu, Sep 24, 2015 at 8:31 PM, Tom Morris <tfmorris(a)gmail.com>
wrote:
> >>>> This is to add MusicBrainz to the primary source tool, not anything
> >>>> else?
> >>>
> >>>
> >>> It's apparently worse than that (which I hadn't realized until
I
> >>> re-read the
> >>> transcript). It sounds like it's just going to generate little
warning
> >>> icons for "bad" facts and not lead to the recording of any new
facts
> >>> at all.
> >>>
> >>> 17:22:33 <Lydia_WMDE> we'll also work on getting the
extension
> >>> deployed that
> >>> will help with checking against 3rd party databases
> >>> 17:23:33 <Lydia_WMDE> the result of constraint checks and checks
> >>> against 3rd
> >>> party databases will then be used to display little indicators next to
> a
> >>> statement in case it is problematic
> >>> 17:23:47 <Lydia_WMDE> i hope this way more people become aware of
> >>> issues and
> >>> can help fix them
> >>> 17:24:35 <sjoerddebruin> Do you have any names of databases that
are
> >>> supported? :)
> >>> 17:24:59 <Lydia_WMDE> sjoerddebruin: in the first version the
german
> >>> national library. it can be extended later
> >>>
> >>>
> >>> I know Freebase is deemed to be nasty and unreliable, but is
> MusicBrainz
> >>> considered trustworthy enough to import directly or will its facts
> >>> need to
> >>> be dripped through the primary source soda straw one at a time too?
> >>
> >> The primary sources tool and the extension that helps us check against
> >> other databases are two independent things.
> >> Imports from Musicbrainz have been happening since a very long time
> >> already.
> >>
> >>
> >> Cheers
> >> Lydia
> >>
> >
> >
> > _______________________________________________
> > Wikidata mailing list
> > Wikidata(a)lists.wikimedia.org
> >
https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
> _______________________________________________
> Wikidata mailing list
> Wikidata(a)lists.wikimedia.org
>
https://lists.wikimedia.org/mailman/listinfo/wikidata
>