We tried to honor ELE[1] for extraction, so most likely, if the the
Wiktionary page deviates from ELE, then results are not so good for it.
I assume you are familiar with SPARQL, because of your D2R mapping for
wikokit. Here is the query:
Select ?g ?p count(?p) as ?count where { Graph ?g { ?s ?p ?o } } group by
?p ?g order by desc (?g) desc(?count)
It takes to long to run over http. If you are interested in more difficult
statistics and calculations, I can also give you better access to our
service (maybe even ssh access).
All the best,
Sebastian
[1]
Thank Sebastian, for quick reply.
> But these do not occur frequently. For senses
these seem to be available
> however...
Can you count - how many senses and synonyms were successfully
extracted from English Wiktionary and Russian Wiktionary,
i.e. how many senses and synonyms are available now in DBpedia Wiktionary?
It will be interesting to compare with number of senses and synonyms
extracted from Wiktionaries by wikokit parser,
seehttp://code.google.com/p/wikokit/#Statistics
Best regards,
Andrew.
On Fri, Apr 5, 2013 at 5:57 PM, Sebastian Hellmann
<hellmann(a)informatik.uni-leipzig.de> wrote:
>
> Hi Andrew,
> actually the tools to solve this problem are in place:
>
http://en.wiktionary.org/wiki/house#English-abode
> links to a sense, the highlighting is there, also if you go to Editing
> Gadgets you can enable "Enable definition editing options." to add
> glosses.
> This was created by Yair_rand and it allows you to connect senses with
> the
> help of glosses such as "abode".
>
> However, this has not received any uptake by the Wiktionary community.
>
> The idea is to have something like (on
>
http://en.wiktionary.org/wiki/house#English-establishment)
> # {{senseid|en|establishment}}An [[establishment]], whether actual, as a
> pub, or virtual, as a website. Particularly restaurant, casino, or
> financial
> or trading company.
> ...
> * {{sense|establishment}} [[shop]]
> ...
> {{trans-top|an establishment}}
>
> But these do not occur frequently. For senses these seem to be available
> however:
>
>
http://wiktionary.dbpedia.org/resource/as_soon_as_possible-English-Adverb-1…
>
> Query:
>
http://wiktionary.dbpedia.org/sparql
> select * where {Graph ?g {?s
> <http://wiktionary.dbpedia.org/terms/hasSynonym> ?o } } limit 100
>
> All the best,
> Sebastian
>
> Am 05.04.2013 11:23, schrieb Andrew Krizhanovsky:
>
>> DBpedia Wiktionary - is very interesting project!
>>
>> Is it possible to get list of synonyms for the first meaning of the
>> noun "dog" now?
>>
http://en.wiktionary.org/wiki/dog#Synonyms
>>
>> Best regards,
>> Andrew Krizhanovsky.
>>
>> On Fri, Apr 5, 2013 at 11:05 AM, Dimitris Kontokostas
>> <kontokostas(a)informatik.uni-leipzig.de> wrote:
>>>
>>> Hi Moutupsi,
>>>
>>> You should definitely take look at DBpedia Wiktionary (
>>>
http://dbpedia.org/Wiktionary).
>>> It supports everything you want and can be easily configured for other
>>> languages.
>>>
>>> Best,
>>> Dimitris
>>>
>>>
>>> On Thu, Apr 4, 2013 at 4:21 AM, Moutupsi Paul
>>> <mopaul(a)cs.stonybrook.edu>wrote;wrote:
>>>
>>>> Hi All,
>>>>
>>>>
>>>>
>>>> Greeting,
>>>>
>>>>
>>>>
>>>> I am a CS grad student from Data Science Lab Stony Brook<
>>>>
https://sites.google.com/site/datascienceslab/> and I am dropping
>>>> this
>>>> mail to request information about parsing multi-lingual Wiktionary
>>>> data.
>>>> Our lab has been using Wikipedia data for quite a while now but we are
>>>> really interested in taking advantage of the massive Wiktionary
>>>> content
>>>> which we feel , after proper parsing, can become an rich muti-language
>>>> corpus.
>>>>
>>>>
>>>>
>>>> But the big hurdle is a parsing tool. We have tried a few Wiktionary
>>>> parsing tools
>>>>
>>>>
>>>>
>>>>
1.https://github.com/clbecker/perl-wiktionary-parser/
>>>>
>>>> 2.
>>>>
https://code.google.com/p/wikokit/wiki/GettingStartedWiktionaryParser
>>>>
>>>> 3.
>>>>
>>>>
>>>>
https://github.com/benreynwar/wiktionary-parser/tree/master/wiktionary_pars…
>>>>
>>>> 4.http://www.ukp.tu-darmstadt.de/software/jwktl/
>>>>
>>>>
>>>>
>>>> but none of them are available in a ready-to-use or easy-to-extend in
>>>> multiple language mode. (I am currently trying to work with wikokit
>>>> (parser
>>>> 2 above) )
>>>>
>>>>
>>>>
>>>> I request for some advice, suggestion or redirection towards best
>>>> available Wiktionary parser. We are mainly looking to extract
>>>> meanings,
>>>> POS, examples, translations etc. (more can never hurt).
>>>>
>>>>
>>>>
>>>> Any help is appreciated. Kindly let know if further information is
>>>> needed.
>>>>
>>>>
>>>>
>>>> Regards,
>>>>
>>>> Moutupsi
>>>>
>>>> _______________________________________________
>>>> Wiktionary-l mailing list
>>>> Wiktionary-l(a)lists.wikimedia.org
>>>>
https://lists.wikimedia.org/mailman/listinfo/wiktionary-l
>>>>
>>>> --
>>>> Dimitris Kontokostas
>>>> Department of Computer Science, University of Leipzig
>>>> Research
Group:http://aksw.org
>>>>
Homepage:http://aksw.org/DimitrisKontokostas
>>>> <https://lists.wikimedia.org/mailman/listinfo/wiktionary-l>
>>>
>>> _______________________________________________
>>> Wiktionary-l mailing list
>>> Wiktionary-l(a)lists.wikimedia.org
>>>
https://lists.wikimedia.org/mailman/listinfo/wiktionary-l
>>
>> _______________________________________________
>> Wiktionary-l mailing list
>> Wiktionary-l(a)lists.wikimedia.org
>>
https://lists.wikimedia.org/mailman/listinfo/wiktionary-l
>>
> --
> Dipl. Inf. Sebastian Hellmann
>
> Department of Computer Science, University of Leipzig
>
Projects:http://nlp2rdf.org ,http://linguistics.okfn.org ,
>
http://dbpedia.org/Wiktionary ,http://dbpedia.org
> Homepage:http://bis.informatik.uni-leipzig.de/SebastianHellmann
> Research
Group:http://aksw.org
--
Dipl. Inf. Sebastian Hellmann
Department of Computer Science, University of Leipzig
Projects: